Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merge mw v2 examples #2287

Closed
wants to merge 8 commits into from
Closed

Merge mw v2 examples #2287

wants to merge 8 commits into from

Commits on Jun 13, 2021

  1. ML Launchers, Modified RL2 and MAML

    ML1 launchers and ML10 launchers
    
    Add rl2 with stddev clipping and neg log liklihood entroy and shared std and mean network.
    Use ryan's tanh clipping in the gaussian gru model/policy to maintain
    stability in the KL_before metric.
    
    Use a Linear baseline in MAML instead of the a mlp baseline,
    which had trouble fitting to observations between the different
    tasks
    
    All MT10 Launchers
    
    TE update to log named params
    
    Modify Pearl to add logging of train tasks
    
    add pearl train logging and also increase
    
    the train tasks to be large enough for ML10
    in pearl_ml10 launcher
    
    Update Launchers
    
    Metaworld v1 mt10 experiments
    
    Modify launchers with final hparams and launching rate limiter
    
    RL2 tuned for old metaworld
    
    ML10 launchers for v1 environments
    
    ML10 old v1 launchers
    
    Final V2 ML10 MT10 launchers
    
    Launchers Used for final MT10 and ML10 Results
    
    Launchers round3
    
    Updated door envs in metaworld for ML10 environments
    
    Final ML10 MT10 Launchers used for the Paper
    
    Initial Commit MT1 launchers
    
    Extend RL2 MTPPO MTTRPO experiments update metaworld to master
    
    ML1 rl2 round2
    
    Increase rl2 ppo ml10 steps
    
    Rl2 Ml1 fixes
    
    decrease ml1-rl2-ppo eval epochs
    
    Reduce Logging epochs for rl2_ppo ml10
    
    MT10 5e8 steps
    
    Downgrade torch
    
    TE PPO increase timesteps
    
    MT1 Experiments with fixed MT1
    
    Adjust TE to log all tasks
    
    Switch back to epoch cycles
    
    Update MTPPO and MTTRPO params
    
    MTPPO MT50
    
    MTTRPO MT50 launcher
    
    TE PPO MT50
    
    MTSAC MT50
    
    MAML and RL2 ML45
    
    New hparams ml1 PEARL
    
    Debugging MAML ML45
    Avnish Narayan committed Jun 13, 2021
    Configuration menu
    Copy the full SHA
    4c39bf7 View commit details
    Browse the repository at this point in the history
  2. Fixed MAML ML45

    Avnish Narayan committed Jun 13, 2021
    Configuration menu
    Copy the full SHA
    ea44917 View commit details
    Browse the repository at this point in the history
  3. Update to use new dev containers

    Avnish Narayan committed Jun 13, 2021
    Configuration menu
    Copy the full SHA
    13dd0dd View commit details
    Browse the repository at this point in the history
  4. PEARL ML1 with kate's recommendations

    Avnish Narayan committed Jun 13, 2021
    Configuration menu
    Copy the full SHA
    ea5d9d7 View commit details
    Browse the repository at this point in the history
  5. Update PEARL sampling params

    Avnish Narayan committed Jun 13, 2021
    Configuration menu
    Copy the full SHA
    c2a2c4a View commit details
    Browse the repository at this point in the history
  6. Update to sample new tasks after a certain number of epochs

    Avnish Narayan committed Jun 13, 2021
    Configuration menu
    Copy the full SHA
    766044d View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    08272cf View commit details
    Browse the repository at this point in the history

Commits on Jun 24, 2021

  1. Move metaworld examples, other refactors

    - remove the torch linear feature baseline wrapper
    - add param docs to all new functions and launchers
    - delete unused docker and gcp launch files
    
    - mt1 examples do not work yet
    Avnish Narayan committed Jun 24, 2021
    Configuration menu
    Copy the full SHA
    0849200 View commit details
    Browse the repository at this point in the history