1. 08 9月, 2021 3 次提交
    • S
      feature(nyz): add supervised learning image classification training demo (#27) · 11cc97e8
      Swain 提交于
      * feature(nyz): add resnet for cv sl task
      
      * feature(nyz): add imagenet classification dataset and adapt compile config for sl
      
      * feature(nyz): add naive image training entry demo
      
      * style(nyz): polish image cls train log
      
      * polish(nyz): polish multi gpu training setting
      
      * feature(nyz): add nn training bp and update async execution
      
      * feature(nyz): add distributed sampler for different dist backend
      
      * fix(nyz): fix compile config collector and buffer compatibility problem
      
      * style(nyz): correct yapf format
      
      * fix(nyz): fix env manager compile config compatibility bug
      
      * refactor(nyz): abstarct ISerialEvaluator and rename serial evaluation implementation
      
      * refactor(nyz): refactor collector name
      
      * feature(nyz): add metric evaluator and image cls acc metric eval demo
      
      * fix(nyz): fix cuda and multi gpu bug in image cls demo
      11cc97e8
    • N
      style(nyz): polish env table in README · 1439e22f
      niuyazhe 提交于
      1439e22f
    • W
      style(wyh): add env information in readme (#46) · fa453ef0
      Weiyuhong-1998 提交于
      * env-list
      
      * env-list-fix-grammmer
      
      * env-only-test
      
      * modify-gif
      
      * modify-gif-pendulum
      
      * modify-gif-delect-maze
      fa453ef0
  2. 07 9月, 2021 2 次提交
  3. 06 9月, 2021 2 次提交
    • Y
      feature(zym): add offlineRL algo CQL; add offlineRL env D4RL (#37) · 69828ed5
      Yinmin.Zhang 提交于
      * feature(zym): add pybullet env info; add entropy type in sac.
      
      * feature(zym): add cql; add serial entry for offlineRL.
      
      * feature/polish(zym): add generation entry in mujoco env for offlineRL; polish cql/serial entry for offlineRL.
      
      * feature(lj): add d4rl env for offlineRL.
      
      * polish(zym): polish cql.
      
      * feature/polish(zym): add dataset registry; polish offlineRL pipeline.
      
      * fix(zym): fix bug in d4rl/mujoco config; fix bug in dataset for offlineRL.
      
      * style(zym): add pybulletgym and d4rl requirements in setup.
      
      * fix/polish(zym): support str in NaiveRLDataset; polish cql.
      
      * polish(zym): polish command policy.
      
      * feature(zym): add cql in pendulum env; add unittest/algotest for cql.
      
      * fix(zym): fix cql bug in unittest/algotest for cql.
      69828ed5
    • fix(pu): fix r2d2 bug (#36) · c8dac674
      蒲源 提交于
      * test rnd
      
      * fix mz config
      
      * fix config
      
      * fix(pu): fix r2d2
      
      * feature(puyuan): add minigrid r2d2 config
      
      * polish minigrid config
      
      * modified as review
      
      * fix(pu): fix bugffor compatibility
      
      * polish(pu): add annotations and polish slice operation
      
      * style(pu): run format.sh
      
      * style(pu): correct yapf format
      c8dac674
  4. 02 9月, 2021 2 次提交
    • N
      hotfix(nyz): fix cartpole ppg value buffer sample typo · da19fdbd
      niuyazhe 提交于
      da19fdbd
    • S
      League Metric (#22) · d24f1f3d
      Swain 提交于
      * feature(nyz): add trueskill as league metric, naive elo calculator, fix game_env info bug
      
      * fix(nyz): fix league player mutate bug
      
      * fix(nyz): fix league unittest bug
      
      * feature(nyz): add elo ranking in league metric env
      
      * polish(nyz): modify fixed eval policy and trueskill init
      
      * feature(nyz): add init main player in evaluation and fix stop_value bug
      
      * style(nyz): rename test_league_metric to avoid pyc cache bug
      d24f1f3d
  5. 27 8月, 2021 2 次提交
  6. 25 8月, 2021 1 次提交
  7. 24 8月, 2021 2 次提交
  8. 23 8月, 2021 2 次提交
  9. 20 8月, 2021 1 次提交
    • W
      SQIL (#25) · 9929dc37
      Will-Nie 提交于
      * add sqil
      
      * conceal all the personal info
      
      * revise according to the comments
      
      * correct_format
      
      * add_comment to hardcodes part
      
      * pass flake8
      
      * add force_reproducibility = True; device, ex_model
      
      * check format
      9929dc37
  10. 19 8月, 2021 1 次提交
  11. 13 8月, 2021 1 次提交
  12. 11 8月, 2021 2 次提交
  13. 10 8月, 2021 1 次提交
    • G
      add overcooked environment (#20) · c1d22458
      garyzhang99 提交于
      * init runable ppo
      
      * init overcooked env
      
      * overcooked ppo in place
      
      * runable ppo with shaped rewards
      
      * modified config
      
      * feature(nyz): modify win rate calculation with draws
      
      * remove redundant code, modified baseline model
      
      * Update __init__.py
      
      * Update config.py
      
      * modify temp_config_file.close() position in config.py to work in windows os
      
      * remove redundant comments and rename files
      
      * fix name bug and use namedlist
      
      * add simple readme and remove redundant comments from copies
      
      * resolve threads
      
      * remove debug comments
      Co-authored-by: Nniuyazhe <niuyazhe314@outlook.com>
      c1d22458
  14. 03 8月, 2021 4 次提交
  15. 01 8月, 2021 2 次提交
    • S
      add ACER algorithm(szj) (#14) · dd4de1a0
      simonat2011 提交于
      * add endoro env config. add enduro's ppo,dqn,drdqn,rainbow,impala config.
      
      * modified as reviewer mentions
      
      * add qacd network
      
      * fix bugs
      
      * fix bugs
      
      * update acer algorithm
      
      * update ACER code
      
      * update acer config
      
      * fix bug
      
      * update pong acer's config
      
      * edit commit
      
      * update code as mention
      
      * fix the comment table and trust region
      
      * fix format
      
      * fix typing lint
      
      * fix format,flake8
      
      * fix format
      
      * fix whitespace problem
      
      * test(nyz): add acer unittest and algotest
      
      * style(nyz): correct flake8 style
      Co-authored-by: Nshenziju <simonshen2011@foxmail.com>
      Co-authored-by: NSwain <niuyazhe314@outlook.com>
      dd4de1a0
    • Y
      add pybullet env (#16) · dc161ea5
      Yinmin.Zhang 提交于
      * add pybullet envs.
      
      * add td3/ddpg/sac/ppo configs for pybullet.
      
      * update td3/ddpg/sac/ppo configs for pybullet.
      
      * update td3 configs; remove td3 model.
      dc161ea5
  16. 29 7月, 2021 2 次提交
  17. 23 7月, 2021 1 次提交
  18. 22 7月, 2021 1 次提交
  19. 21 7月, 2021 3 次提交
  20. 19 7月, 2021 2 次提交
  21. 17 7月, 2021 1 次提交
  22. 16 7月, 2021 2 次提交