add ACER algorithm(szj) (#14)
* add endoro env config. add enduro's ppo,dqn,drdqn,rainbow,impala config. * modified as reviewer mentions * add qacd network * fix bugs * fix bugs * update acer algorithm * update ACER code * update acer config * fix bug * update pong acer's config * edit commit * update code as mention * fix the comment table and trust region * fix format * fix typing lint * fix format,flake8 * fix format * fix whitespace problem * test(nyz): add acer unittest and algotest * style(nyz): correct flake8 style Co-authored-by: Nshenziju <simonshen2011@foxmail.com> Co-authored-by: NSwain <niuyazhe314@outlook.com>
Showing
ding/model/template/acer.py
0 → 100644
ding/policy/acer.py
0 → 100644
此差异已折叠。
ding/rl_utils/acer.py
0 → 100644
ding/rl_utils/retrace.py
0 → 100644
想要评论请 注册 或 登录