由 Swain 提交于 8月 03, 2021

* feature(nyz): add naive 1v1 two player demo

* feature(nyz): add 1v1 evaluator and 2 rule-based policy for evaluation

* feature(nyz): modify game env and adjust hyper-param

* feature(nyz): add naive league training multi player demo

* feature(nyz): enable force snapshot to support init historical league player; finish league demo basic code

* feature(nyz): modify selfplay demo and add two type game env

* style(nyz): correct format style

* polish(nyz): correct format style and adapt league demo main

* feature(nyz): add league payoff viz and enable payoff update in league demo

* feature(nyz): modify win rate calculation with draws

* test(nyz): fix one vs one league test compatibility bug

* test(nyz): add selfplay and league demo into unittest and algotest

* style(nyz): correct format

* hotfix(nyz): fix ppo continuous comatibility bug

73295c22

selfplay_demo_ppo_main.py 4.4 KB

OpenDILab开源决策智能平台 / DI-engine 上一次同步 2 年多

Replace selfplay_demo_ppo_main.py

OpenDILab开源决策智能平台 / DI-engine
上一次同步 2 年多