1. 07 9月, 2021 1 次提交
  2. 06 9月, 2021 5 次提交
    • Y
      feature(zym): add offlineRL algo CQL; add offlineRL env D4RL (#37) · 69828ed5
      Yinmin.Zhang 提交于
      * feature(zym): add pybullet env info; add entropy type in sac.
      
      * feature(zym): add cql; add serial entry for offlineRL.
      
      * feature/polish(zym): add generation entry in mujoco env for offlineRL; polish cql/serial entry for offlineRL.
      
      * feature(lj): add d4rl env for offlineRL.
      
      * polish(zym): polish cql.
      
      * feature/polish(zym): add dataset registry; polish offlineRL pipeline.
      
      * fix(zym): fix bug in d4rl/mujoco config; fix bug in dataset for offlineRL.
      
      * style(zym): add pybulletgym and d4rl requirements in setup.
      
      * fix/polish(zym): support str in NaiveRLDataset; polish cql.
      
      * polish(zym): polish command policy.
      
      * feature(zym): add cql in pendulum env; add unittest/algotest for cql.
      
      * fix(zym): fix cql bug in unittest/algotest for cql.
      69828ed5
    • N
      style(nyz): add algorithm list in README · 110d4063
      niuyazhe 提交于
      110d4063
    • N
      style(nyz): add algorithm list in README · 12a727cd
      niuyazhe 提交于
      12a727cd
    • W
      enable user to use any expert model for sqil(#44) · 5fbc9453
      Will-Nie 提交于
      * enable user to use any model generated here
      
      * delete irelevant package
      
      * add test
      
      * bash format.sh to reformat style
      5fbc9453
    • fix(pu): fix r2d2 bug (#36) · c8dac674
      蒲源 提交于
      * test rnd
      
      * fix mz config
      
      * fix config
      
      * fix(pu): fix r2d2
      
      * feature(puyuan): add minigrid r2d2 config
      
      * polish minigrid config
      
      * modified as review
      
      * fix(pu): fix bugffor compatibility
      
      * polish(pu): add annotations and polish slice operation
      
      * style(pu): run format.sh
      
      * style(pu): correct yapf format
      c8dac674
  3. 03 9月, 2021 3 次提交
  4. 02 9月, 2021 4 次提交
  5. 31 8月, 2021 1 次提交
  6. 27 8月, 2021 4 次提交
  7. 26 8月, 2021 1 次提交
  8. 25 8月, 2021 2 次提交
  9. 24 8月, 2021 3 次提交
  10. 23 8月, 2021 3 次提交
  11. 20 8月, 2021 1 次提交
    • W
      SQIL (#25) · 9929dc37
      Will-Nie 提交于
      * add sqil
      
      * conceal all the personal info
      
      * revise according to the comments
      
      * correct_format
      
      * add_comment to hardcodes part
      
      * pass flake8
      
      * add force_reproducibility = True; device, ex_model
      
      * check format
      9929dc37
  12. 19 8月, 2021 2 次提交
  13. 13 8月, 2021 1 次提交
  14. 11 8月, 2021 2 次提交
  15. 10 8月, 2021 1 次提交
    • G
      add overcooked environment (#20) · c1d22458
      garyzhang99 提交于
      * init runable ppo
      
      * init overcooked env
      
      * overcooked ppo in place
      
      * runable ppo with shaped rewards
      
      * modified config
      
      * feature(nyz): modify win rate calculation with draws
      
      * remove redundant code, modified baseline model
      
      * Update __init__.py
      
      * Update config.py
      
      * modify temp_config_file.close() position in config.py to work in windows os
      
      * remove redundant comments and rename files
      
      * fix name bug and use namedlist
      
      * add simple readme and remove redundant comments from copies
      
      * resolve threads
      
      * remove debug comments
      Co-authored-by: Nniuyazhe <niuyazhe314@outlook.com>
      c1d22458
  16. 06 8月, 2021 1 次提交
  17. 03 8月, 2021 5 次提交