MaLPi Task List

This is my very tentative list of features to add to MaLPi. Although they are listed in numbered order, only the first one is 'real'. If/when I finish the first task I'll re-asses the list, possibly moving things around or dropping some or adding new ones.

✓ Pre-train VAE on images from all but one track
- Add auxiliary outputs: CTE, track #, steering, throttle
- Collect more training data from all tracks
✓ Train separate policies on each track with the VAE as an image embedder
- Testing script needs to output a single value with standard deviation
Implement Pre-training paper
Switch to RNN policy
Add a small DNC as a working memory
Add more tasks
- Cone/Human/dog/etc detector (bounding box as output)
- Drive to goal with goal given as an image
- Sketches as inputs (goals) and/or outputs
- NLP description of a scene or task trajectory as output
Switch to more formal multi-task and/or meta-learning and/or lifelong learning
Add some form of Aggregate memory that includes all previous experience
Add an IMU and learn to detect crashes/bumps

Finished

✓ VAE based DonkeyCar pilot
Switch from fastai to Lightning
Train a single policy on multiple tracks/tasks

Model Predictive Control is a control method based on planning n-steps into the future to find the best path, as currently determined by the model. Take the first step. Then re-plan, take the new first step. Repeat until done.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MaLPi Task List

Clone this wiki locally