• S
    serial training league demo (#12) · 73295c22
    Swain 提交于
    * feature(nyz): add naive 1v1 two player demo
    
    * feature(nyz): add 1v1 evaluator and 2 rule-based policy for evaluation
    
    * feature(nyz): modify game env and adjust hyper-param
    
    * feature(nyz): add naive league training multi player demo
    
    * feature(nyz): enable force snapshot to support init historical league player; finish league demo basic code
    
    * feature(nyz): modify selfplay demo and add two type game env
    
    * style(nyz): correct format style
    
    * polish(nyz): correct format style and adapt league demo main
    
    * feature(nyz): add league payoff viz and enable payoff update in league demo
    
    * feature(nyz): modify win rate calculation with draws
    
    * test(nyz): fix one vs one league test compatibility bug
    
    * test(nyz): add selfplay and league demo into unittest and algotest
    
    * style(nyz): correct format
    
    * hotfix(nyz): fix ppo continuous comatibility bug
    73295c22
selfplay_demo_ppo_main.py 4.4 KB