1. 03 8月, 2021 1 次提交
    • S
      serial training league demo (#12) · 73295c22
      Swain 提交于
      * feature(nyz): add naive 1v1 two player demo
      
      * feature(nyz): add 1v1 evaluator and 2 rule-based policy for evaluation
      
      * feature(nyz): modify game env and adjust hyper-param
      
      * feature(nyz): add naive league training multi player demo
      
      * feature(nyz): enable force snapshot to support init historical league player; finish league demo basic code
      
      * feature(nyz): modify selfplay demo and add two type game env
      
      * style(nyz): correct format style
      
      * polish(nyz): correct format style and adapt league demo main
      
      * feature(nyz): add league payoff viz and enable payoff update in league demo
      
      * feature(nyz): modify win rate calculation with draws
      
      * test(nyz): fix one vs one league test compatibility bug
      
      * test(nyz): add selfplay and league demo into unittest and algotest
      
      * style(nyz): correct format
      
      * hotfix(nyz): fix ppo continuous comatibility bug
      73295c22