Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Refractor - Rodent Imitation/Online RL - Distributed Training - Visio…
…n Encoder - Intention Network Arch (#2) * ignore data folder * conda env creation script for TF cuda support * added requirement.txt * modify the environment before the warpper. * injected rodent class * ignore checkpoint file * modify for rodent * added my note * render expet * modify to work with rodent env * ignore logging * initialized * modified * modified * ignore the training * update notes * added flag todo * chagne hyperparam * rendering * added wandb logging for learner + configuration file support * attempt to do Rodent SingleClipTreacking * added config file & logging support * ready for merge * added preprocessing step * change to vast * move to vast * distributed ray support * ready for merge * Support rodent imitation now * scheduling strategies * distributed training support from flybody code * update my notes * update progress on genearlist training * update my notes * distributed training * ready for merge * black formatted * support intention networks with wrapper on the observation space * generalist training ready * update my notes * make network smaller * update notes * kickstarting training * added back fly and humanoid imitation, load mocap data set, and added intention network * finish kickstarting and general training * added notes * support multiple replay servers for both multi-task/single task training. * bug * support wandb rendering for rollout during training * humanoid tracking working * clean up imitation learning pipeline * curriculum learning dev in progress * fix api between online & imitation * tuning for imitaion learning * added average report of the evaluator * updates my note * update * clean up repo * debug decoder arch * added checkpointer for decoder * added better logging terms * support custom reward scaling term * add option to remove skin * clean up unused imitation visnet * undo remove skin -> modify xml file * implement encoder snapshotter * more robust rendering * formats * refractor + fix bugs in eval * Simplify API * added iterative lifting * modify default action std * rename & fix NN shape issue * refractor intention network API - more general * adapt logger API * API Changes * add reward composition rendering * expose reward composition weight to config * added upright reward & contact terminations * support mixed experience replay server * reward rendering util * implemented visual intention network * explicit froze_decoder option when loading checkpoints * remove redundant comments * implement reward term logging for online tasks + reward termination logics * added config file * renamed folder + package name * rename package reference name * delete unused rodent file * added config option for discount * add hierarchical config file for better readability * added reward termination/more reward terms for bowl escape * implemented multiple layer of encoder * change reward condition * updates readme --------- Co-authored-by: KevinBian107 <[email protected]>
- Loading branch information