- 02 9月, 2021 4 次提交
-
-
-
由 niuyazhe 提交于
-
由 Swain 提交于
* feature(nyz): add trueskill as league metric, naive elo calculator, fix game_env info bug * fix(nyz): fix league player mutate bug * fix(nyz): fix league unittest bug * feature(nyz): add elo ranking in league metric env * polish(nyz): modify fixed eval policy and trueskill init * feature(nyz): add init main player in evaluation and fix stop_value bug * style(nyz): rename test_league_metric to avoid pyc cache bug
-
由 niuyazhe 提交于
-
- 31 8月, 2021 1 次提交
-
-
由 niuyazhe 提交于
-
- 27 8月, 2021 4 次提交
-
-
-
由 niuyazhe 提交于
-
由 Yinmin.Zhang 提交于
-
由 niuyazhe 提交于
-
- 26 8月, 2021 1 次提交
-
-
由 niuyazhe 提交于
-
- 25 8月, 2021 2 次提交
- 24 8月, 2021 3 次提交
- 23 8月, 2021 3 次提交
- 20 8月, 2021 1 次提交
-
-
由 Will-Nie 提交于
* add sqil * conceal all the personal info * revise according to the comments * correct_format * add_comment to hardcodes part * pass flake8 * add force_reproducibility = True; device, ex_model * check format
-
- 19 8月, 2021 2 次提交
-
-
由 Weiyuhong-1998 提交于
-
由 niuyazhe 提交于
-
- 13 8月, 2021 1 次提交
-
-
由 simonat2011 提交于
* fix weight bug * update acer qbert result * fix flake8 format problem * update space qbert config * update as review Co-authored-by: Nshenziju <simonshen2011@foxmail.com>
-
- 11 8月, 2021 2 次提交
- 10 8月, 2021 1 次提交
-
-
由 garyzhang99 提交于
* init runable ppo * init overcooked env * overcooked ppo in place * runable ppo with shaped rewards * modified config * feature(nyz): modify win rate calculation with draws * remove redundant code, modified baseline model * Update __init__.py * Update config.py * modify temp_config_file.close() position in config.py to work in windows os * remove redundant comments and rename files * fix name bug and use namedlist * add simple readme and remove redundant comments from copies * resolve threads * remove debug comments Co-authored-by: Nniuyazhe <niuyazhe314@outlook.com>
-
- 06 8月, 2021 1 次提交
-
-
由 niuyazhe 提交于
-
- 03 8月, 2021 7 次提交
-
-
由 niuyazhe 提交于
-
由 niuyazhe 提交于
-
由 niuyazhe 提交于
-
由 niuyazhe 提交于
-
由 niuyazhe 提交于
-
由 niuyazhe 提交于
-
由 Swain 提交于
* feature(nyz): add naive 1v1 two player demo * feature(nyz): add 1v1 evaluator and 2 rule-based policy for evaluation * feature(nyz): modify game env and adjust hyper-param * feature(nyz): add naive league training multi player demo * feature(nyz): enable force snapshot to support init historical league player; finish league demo basic code * feature(nyz): modify selfplay demo and add two type game env * style(nyz): correct format style * polish(nyz): correct format style and adapt league demo main * feature(nyz): add league payoff viz and enable payoff update in league demo * feature(nyz): modify win rate calculation with draws * test(nyz): fix one vs one league test compatibility bug * test(nyz): add selfplay and league demo into unittest and algotest * style(nyz): correct format * hotfix(nyz): fix ppo continuous comatibility bug
-
- 01 8月, 2021 2 次提交
-
-
由 simonat2011 提交于
* add endoro env config. add enduro's ppo,dqn,drdqn,rainbow,impala config. * modified as reviewer mentions * add qacd network * fix bugs * fix bugs * update acer algorithm * update ACER code * update acer config * fix bug * update pong acer's config * edit commit * update code as mention * fix the comment table and trust region * fix format * fix typing lint * fix format,flake8 * fix format * fix whitespace problem * test(nyz): add acer unittest and algotest * style(nyz): correct flake8 style Co-authored-by: Nshenziju <simonshen2011@foxmail.com> Co-authored-by: NSwain <niuyazhe314@outlook.com>
-
由 Yinmin.Zhang 提交于
* add pybullet envs. * add td3/ddpg/sac/ppo configs for pybullet. * update td3/ddpg/sac/ppo configs for pybullet. * update td3 configs; remove td3 model.
-
- 29 7月, 2021 4 次提交
-
-
由 niuyazhe 提交于
-
由 simonat2011 提交于
* add endoro env config. add enduro's ppo,dqn,drdqn,rainbow,impala config. * modified as reviewer mentions * add qacd network * fix bugs * update dizoo readme * add README.md about max reward result * update dqn config and update README Co-authored-by: Nshenziju <simonshen2011@foxmail.com> Co-authored-by: Nsimon shen <simon@simondeMacBook-Air.local> Co-authored-by: NSwain <niuyazhe314@outlook.com>
-
由 niuyazhe 提交于
-
由 Swain 提交于
on policy ppo (#9)
-
- 23 7月, 2021 1 次提交
-
-
https://github.com/opendilab/DI-engine由 zhangyinmin 提交于
# Conflicts: # ding/policy/common_utils.py # ding/policy/ppo.py
-