-
Notifications
You must be signed in to change notification settings - Fork 310
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merge mw v2 examples #2287
Merge mw v2 examples #2287
Commits on Jun 13, 2021
-
ML Launchers, Modified RL2 and MAML
ML1 launchers and ML10 launchers Add rl2 with stddev clipping and neg log liklihood entroy and shared std and mean network. Use ryan's tanh clipping in the gaussian gru model/policy to maintain stability in the KL_before metric. Use a Linear baseline in MAML instead of the a mlp baseline, which had trouble fitting to observations between the different tasks All MT10 Launchers TE update to log named params Modify Pearl to add logging of train tasks add pearl train logging and also increase the train tasks to be large enough for ML10 in pearl_ml10 launcher Update Launchers Metaworld v1 mt10 experiments Modify launchers with final hparams and launching rate limiter RL2 tuned for old metaworld ML10 launchers for v1 environments ML10 old v1 launchers Final V2 ML10 MT10 launchers Launchers Used for final MT10 and ML10 Results Launchers round3 Updated door envs in metaworld for ML10 environments Final ML10 MT10 Launchers used for the Paper Initial Commit MT1 launchers Extend RL2 MTPPO MTTRPO experiments update metaworld to master ML1 rl2 round2 Increase rl2 ppo ml10 steps Rl2 Ml1 fixes decrease ml1-rl2-ppo eval epochs Reduce Logging epochs for rl2_ppo ml10 MT10 5e8 steps Downgrade torch TE PPO increase timesteps MT1 Experiments with fixed MT1 Adjust TE to log all tasks Switch back to epoch cycles Update MTPPO and MTTRPO params MTPPO MT50 MTTRPO MT50 launcher TE PPO MT50 MTSAC MT50 MAML and RL2 ML45 New hparams ml1 PEARL Debugging MAML ML45
Avnish Narayan committedJun 13, 2021 Configuration menu - View commit details
-
Copy full SHA for 4c39bf7 - Browse repository at this point
Copy the full SHA 4c39bf7View commit details -
Avnish Narayan committed
Jun 13, 2021 Configuration menu - View commit details
-
Copy full SHA for ea44917 - Browse repository at this point
Copy the full SHA ea44917View commit details -
Update to use new dev containers
Avnish Narayan committedJun 13, 2021 Configuration menu - View commit details
-
Copy full SHA for 13dd0dd - Browse repository at this point
Copy the full SHA 13dd0ddView commit details -
PEARL ML1 with kate's recommendations
Avnish Narayan committedJun 13, 2021 Configuration menu - View commit details
-
Copy full SHA for ea5d9d7 - Browse repository at this point
Copy the full SHA ea5d9d7View commit details -
Avnish Narayan committed
Jun 13, 2021 Configuration menu - View commit details
-
Copy full SHA for c2a2c4a - Browse repository at this point
Copy the full SHA c2a2c4aView commit details -
Update to sample new tasks after a certain number of epochs
Avnish Narayan committedJun 13, 2021 Configuration menu - View commit details
-
Copy full SHA for 766044d - Browse repository at this point
Copy the full SHA 766044dView commit details -
Round 8 MTTRPO MTPPO update to address task sampling not happening
Avnish Narayan committedJun 13, 2021 Configuration menu - View commit details
-
Copy full SHA for 08272cf - Browse repository at this point
Copy the full SHA 08272cfView commit details
Commits on Jun 24, 2021
-
Move metaworld examples, other refactors
- remove the torch linear feature baseline wrapper - add param docs to all new functions and launchers - delete unused docker and gcp launch files - mt1 examples do not work yet
Avnish Narayan committedJun 24, 2021 Configuration menu - View commit details
-
Copy full SHA for 0849200 - Browse repository at this point
Copy the full SHA 0849200View commit details