1. 29 10月, 2021 1 次提交
    • S
      feature(nyz): add PADDPG for hybrid action space as baseline (#109) · d2f79536
      Swain 提交于
      * fix(nyz): fix gym_hybrid env not scale action bug
      
      * feature(nyz): add PADDPG basic implementation for hybrid action space
      
      * fix(nyz): fix td3/d4pg comatibility bug with new modifications
      
      * fix(nyz): fix hybrid ddpg action type grad bug and update config
      
      * feature(nyz): add eps greedy + multinomial wrapper and gym_hybrid ddpg convergence config
      
      * style(nyz): update PADDPG in README
      
      * test_model_hybrid_qac
      
      * fix_typo_in_README
      
      * test_policy_hybrid_qac
      
      * polish(nyz): polish hybrid action space to dict structure and polish unittest
      
      * fix(nyz): fix td3bc compatibility bug
      Co-authored-by: N李可 <like2@CN0014008466M.local>
      d2f79536
  2. 28 10月, 2021 1 次提交
    • S
      feature(nyz): add gobigger baseline (#95) · a8fec8bb
      Swain 提交于
      * feature(nyz): add gobigger baseline
      
      * style(nyz): add gobigger env infor
      
      * feature(nyz): add ignore prefix in default collate
      
      * feautre(nyz): add vsbot training baseline
      
      * fix(nyz): fix to_tensor empty list bug and polish gobigger baseline
      
      * style(nyz): split gobigger baseline code
      a8fec8bb
  3. 22 10月, 2021 1 次提交
    • Y
      feature(zym): add offlineRL algo td3_bc and polish policy comments(#88) · 7c1b5e95
      Yinmin.Zhang 提交于
      * feature(zym): add offlineRL algo td3_bc.
      
      * feature(zym): add offlineRL algo td3_bc.
      
      * feature(zym): add offlineRL algo td3_bc.
      
      * polish(zym): polish some annotations in td3/ddpg/sac/ppo; polish `_forward_collect` and `_foward_eval`.
      
      * fix(lj): fix dimension bug in cql for continuous env.
      
      * fix(zym): fix dimension bug in cql for continuous env.
      
      * fix(zym): fix dimension bug in cql for continuous env.
      
      * polish(zym): update README.md.
      7c1b5e95
  4. 21 10月, 2021 1 次提交
    • K
      feature(lk): add gym-soccer (HFO) env (#94) · 8f47f4cb
      Ke Li 提交于
      * add_soccer_env
      
      * add_info
      
      * close
      
      * format
      
      * test_gym_soccer
      
      * rm_torch
      
      * replay_log
      
      * format_style
      
      * add_gym_soccer_to_readme
      
      * separate render_func
      
      * add_gif_file
      
      * scale_action
      
      * flake_style_format
      
      * resolve_review_comments
      
      * add branch info for gym hybrid
      8f47f4cb
  5. 19 10月, 2021 1 次提交
  6. 16 10月, 2021 1 次提交
    • W
      feature(nyp): add DQfD algorithm (#48) · e2ca8738
      Will-Nie 提交于
      * add_dqfd
      
      * Is_expert to is_expert
      
      * modify according to the last commnets
      
      * value_gamma; done; marginloss; sqil compatibility
      
      * finally shorten the code, revise config
      
      * revise config, style
      
      * add_readme/two_more_config
      
      * correct format
      Co-authored-by: Nniuyazhe <niuyazhe@sensetime.com>
      e2ca8738
  7. 12 10月, 2021 1 次提交
  8. 08 10月, 2021 1 次提交
    • L
      feature(zlx): add vs bot training and self-play training with slime volley env (#23) · dbf432cd
      LuciusMos 提交于
      * slime volley env in dizoo, first commit
      
      * fix bug in slime volley env
      
      * modify volley env to satisfy ding 1v1 requirements; add naive self-play and league training pipeline(evaluator is not finished, now use a very naive one)
      
      * adopt volley builtin ai as default eval opponent
      
      * polish(nyz): polish slime_volley_env and its test
      
      * feature(nyz): add slime_volley vs bot ppo demo
      
      * feature(nyz): add battle_sample_serial_collector and adapt abnormal check in subprocess env manager
      
      * feature(nyz): add slime volley self-play demo
      
      * style(nyz): add slime_volleyball env gif and split MARL and selfplay label
      
      * feature(nyz): add save replay function in slime volleyball env
      Co-authored-by: Nzlx-sensetime <zhaoliangxuan@sensetime.com>
      Co-authored-by: Nniuyazhe <niuyazhe@sensetime.com>
      dbf432cd
  9. 01 10月, 2021 1 次提交
  10. 30 9月, 2021 4 次提交
  11. 24 9月, 2021 1 次提交
  12. 23 9月, 2021 1 次提交
  13. 17 9月, 2021 2 次提交
  14. 14 9月, 2021 1 次提交
  15. 08 9月, 2021 3 次提交
  16. 06 9月, 2021 2 次提交
  17. 03 9月, 2021 1 次提交
  18. 26 8月, 2021 1 次提交
  19. 03 8月, 2021 1 次提交
  20. 14 7月, 2021 2 次提交
  21. 13 7月, 2021 1 次提交
  22. 12 7月, 2021 2 次提交
  23. 11 7月, 2021 1 次提交
  24. 10 7月, 2021 1 次提交
  25. 08 7月, 2021 3 次提交