-
Notifications
You must be signed in to change notification settings - Fork 0
MaLPi Task List
Bleyddyn edited this page Jul 8, 2023
·
1 revision
This is my very tentative list of features to add to MaLPi. Although they are listed in numbered order, only the first one is 'real'. If/when I finish the first task I'll re-asses the list, possibly moving things around or dropping some or adding new ones.
- ✓ Pre-train VAE on images from all but one track
- Add auxiliary outputs: CTE, track #, steering, throttle
- Collect more training data from all tracks
- ✓ Train separate policies on each track with the VAE as an image embedder
- Testing script needs to output a single value with standard deviation
- Implement Pre-training paper
- Switch to RNN policy
- Add a small DNC as a working memory
- Add more tasks
- Cone/Human/dog/etc detector (bounding box as output)
- Drive to goal with goal given as an image
- Sketches as inputs (goals) and/or outputs
- NLP description of a scene or task trajectory as output
- Switch to more formal multi-task and/or meta-learning and/or lifelong learning
- Add some form of Aggregate memory that includes all previous experience
- Add an IMU and learn to detect crashes/bumps
Finished
- ✓ VAE based DonkeyCar pilot
- Switch from fastai to Lightning
- Train a single policy on multiple tracks/tasks
Model Predictive Control is a control method based on planning n-steps into the future to find the best path, as currently determined by the model. Take the first step. Then re-plan, take the new first step. Repeat until done.