1. 08 9月, 2021 4 次提交
    • S
      feature(nyz): add supervised learning image classification training demo (#27) · 11cc97e8
      Swain 提交于
      * feature(nyz): add resnet for cv sl task
      
      * feature(nyz): add imagenet classification dataset and adapt compile config for sl
      
      * feature(nyz): add naive image training entry demo
      
      * style(nyz): polish image cls train log
      
      * polish(nyz): polish multi gpu training setting
      
      * feature(nyz): add nn training bp and update async execution
      
      * feature(nyz): add distributed sampler for different dist backend
      
      * fix(nyz): fix compile config collector and buffer compatibility problem
      
      * style(nyz): correct yapf format
      
      * fix(nyz): fix env manager compile config compatibility bug
      
      * refactor(nyz): abstarct ISerialEvaluator and rename serial evaluation implementation
      
      * refactor(nyz): refactor collector name
      
      * feature(nyz): add metric evaluator and image cls acc metric eval demo
      
      * fix(nyz): fix cuda and multi gpu bug in image cls demo
      11cc97e8
    • S
      style(nyz): update sparse reward badge in env table · 5e52c1a0
      Swain 提交于
      5e52c1a0
    • N
      style(nyz): polish env table in README · 1439e22f
      niuyazhe 提交于
      1439e22f
    • W
      style(wyh): add env information in readme (#46) · fa453ef0
      Weiyuhong-1998 提交于
      * env-list
      
      * env-list-fix-grammmer
      
      * env-only-test
      
      * modify-gif
      
      * modify-gif-pendulum
      
      * modify-gif-delect-maze
      fa453ef0
  2. 07 9月, 2021 2 次提交
  3. 06 9月, 2021 5 次提交
    • Y
      feature(zym): add offlineRL algo CQL; add offlineRL env D4RL (#37) · 69828ed5
      Yinmin.Zhang 提交于
      * feature(zym): add pybullet env info; add entropy type in sac.
      
      * feature(zym): add cql; add serial entry for offlineRL.
      
      * feature/polish(zym): add generation entry in mujoco env for offlineRL; polish cql/serial entry for offlineRL.
      
      * feature(lj): add d4rl env for offlineRL.
      
      * polish(zym): polish cql.
      
      * feature/polish(zym): add dataset registry; polish offlineRL pipeline.
      
      * fix(zym): fix bug in d4rl/mujoco config; fix bug in dataset for offlineRL.
      
      * style(zym): add pybulletgym and d4rl requirements in setup.
      
      * fix/polish(zym): support str in NaiveRLDataset; polish cql.
      
      * polish(zym): polish command policy.
      
      * feature(zym): add cql in pendulum env; add unittest/algotest for cql.
      
      * fix(zym): fix cql bug in unittest/algotest for cql.
      69828ed5
    • N
      style(nyz): add algorithm list in README · 110d4063
      niuyazhe 提交于
      110d4063
    • N
      style(nyz): add algorithm list in README · 12a727cd
      niuyazhe 提交于
      12a727cd
    • W
      enable user to use any expert model for sqil(#44) · 5fbc9453
      Will-Nie 提交于
      * enable user to use any model generated here
      
      * delete irelevant package
      
      * add test
      
      * bash format.sh to reformat style
      5fbc9453
    • fix(pu): fix r2d2 bug (#36) · c8dac674
      蒲源 提交于
      * test rnd
      
      * fix mz config
      
      * fix config
      
      * fix(pu): fix r2d2
      
      * feature(puyuan): add minigrid r2d2 config
      
      * polish minigrid config
      
      * modified as review
      
      * fix(pu): fix bugffor compatibility
      
      * polish(pu): add annotations and polish slice operation
      
      * style(pu): run format.sh
      
      * style(pu): correct yapf format
      c8dac674
  4. 03 9月, 2021 3 次提交
  5. 02 9月, 2021 4 次提交
  6. 31 8月, 2021 1 次提交
  7. 27 8月, 2021 4 次提交
  8. 26 8月, 2021 1 次提交
  9. 25 8月, 2021 2 次提交
  10. 24 8月, 2021 3 次提交
  11. 23 8月, 2021 3 次提交
  12. 20 8月, 2021 1 次提交
    • W
      SQIL (#25) · 9929dc37
      Will-Nie 提交于
      * add sqil
      
      * conceal all the personal info
      
      * revise according to the comments
      
      * correct_format
      
      * add_comment to hardcodes part
      
      * pass flake8
      
      * add force_reproducibility = True; device, ex_model
      
      * check format
      9929dc37
  13. 19 8月, 2021 2 次提交
  14. 13 8月, 2021 1 次提交
  15. 11 8月, 2021 2 次提交
  16. 10 8月, 2021 1 次提交
    • G
      add overcooked environment (#20) · c1d22458
      garyzhang99 提交于
      * init runable ppo
      
      * init overcooked env
      
      * overcooked ppo in place
      
      * runable ppo with shaped rewards
      
      * modified config
      
      * feature(nyz): modify win rate calculation with draws
      
      * remove redundant code, modified baseline model
      
      * Update __init__.py
      
      * Update config.py
      
      * modify temp_config_file.close() position in config.py to work in windows os
      
      * remove redundant comments and rename files
      
      * fix name bug and use namedlist
      
      * add simple readme and remove redundant comments from copies
      
      * resolve threads
      
      * remove debug comments
      Co-authored-by: Nniuyazhe <niuyazhe314@outlook.com>
      c1d22458
  17. 06 8月, 2021 1 次提交