Merge mw v2 examples #2287

ML1 launchers and ML10 launchers Add rl2 with stddev clipping and neg log liklihood entroy and shared std and mean network. Use ryan's tanh clipping in the gaussian gru model/policy to maintain stability in the KL_before metric. Use a Linear baseline in MAML instead of the a mlp baseline, which had trouble fitting to observations between the different tasks All MT10 Launchers TE update to log named params Modify Pearl to add logging of train tasks add pearl train logging and also increase the train tasks to be large enough for ML10 in pearl_ml10 launcher Update Launchers Metaworld v1 mt10 experiments Modify launchers with final hparams and launching rate limiter RL2 tuned for old metaworld ML10 launchers for v1 environments ML10 old v1 launchers Final V2 ML10 MT10 launchers Launchers Used for final MT10 and ML10 Results Launchers round3 Updated door envs in metaworld for ML10 environments Final ML10 MT10 Launchers used for the Paper Initial Commit MT1 launchers Extend RL2 MTPPO MTTRPO experiments update metaworld to master ML1 rl2 round2 Increase rl2 ppo ml10 steps Rl2 Ml1 fixes decrease ml1-rl2-ppo eval epochs Reduce Logging epochs for rl2_ppo ml10 MT10 5e8 steps Downgrade torch TE PPO increase timesteps MT1 Experiments with fixed MT1 Adjust TE to log all tasks Switch back to epoch cycles Update MTPPO and MTTRPO params MTPPO MT50 MTTRPO MT50 launcher TE PPO MT50 MTSAC MT50 MAML and RL2 ML45 New hparams ml1 PEARL Debugging MAML ML45

- remove the torch linear feature baseline wrapper - add param docs to all new functions and launchers - delete unused docker and gcp launch files - mt1 examples do not work yet

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Merge mw v2 examples #2287

Merge mw v2 examples #2287

Commits on Jun 13, 2021

Commits on Jun 24, 2021