- 14 7月, 2020 1 次提交
-
-
由 rical730 提交于
-
- 13 7月, 2020 1 次提交
-
-
由 rical730 提交于
-
- 06 3月, 2020 1 次提交
-
-
由 Bo Zhou 提交于
* fix paddle version bug * add gym dependence (introduced by MADDPG) * recall
-
- 13 8月, 2019 1 次提交
-
-
由 Bo Zhou 提交于
* add learning curve for ES * add learning curve for ES * support new APIs of the cluster * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * rename learner.py * Update README.md * Update README.md * Update README.cn.md * Update README.md * Update README.cn.md * Update README.md
-
- 12 8月, 2019 1 次提交
-
-
由 Hongsheng Zeng 提交于
* ES example * refine settings * fix yapf * refine documentation; remove csv logger * fix bug * merge learner.py and train.py; add version requirements of gym and atari_py * unify actor num
-
- 02 8月, 2019 1 次提交
-
-
由 fuyw 提交于
* first pr * start a worker when the master is started. * First PR & Fix logger bugs. * update docs for a2c, impala and ga3c * update doc * yapf modification * update logger * yapf correct * yapf * setup.py * old setup.py * worker 86
-
- 26 7月, 2019 1 次提交
-
-
由 Bo Zhou 提交于
* fix the compatibility issue * fix the comment issue * support paddle 1.5.1 and replace PE with compiler * yapf©right * yapf * fix the teamcity problem * fix the teamcity problem * fix comment * only support paddle 1.5.1 * Cmake * fix comment
-
- 24 7月, 2019 1 次提交
-
-
由 Bo Zhou 提交于
* remove previous algorithms; use the forward function in parl.Model * remove abundant lines * yapf
-
- 05 7月, 2019 1 次提交
-
-
由 Bo Zhou 提交于
* Update README.cn.md * Update README.md * Update README.md * Update README.cn.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md
-
- 17 4月, 2019 1 次提交
-
-
由 Hongsheng Zeng 提交于
* add IMPALA algorithm and some common utils * update README.md * refactor files structure of impala algorithm; seperate numpy utils from utils * add hyper parameter scheduler module; add entropy and lr scheduler in impala * clip reward in atari wrapper instead of learner side; fix codestyle * add benchmark result of impala; refine code of impala example; add obs_format in atari_wrappers * Update README.md * add a3c algorithm, A2C example and rl_utils * require training in single gpu/cpu * only check cpu/gpu num in learner * refine Readme * update impala benchmark picture; update Readme * add benchmark result of A2C * move get_params/set_params in agent_base * add GA3C example * Update README.md * Update README.md * Update README.md * Update README.md * refine Readme * add benchmark * add default safe eps in numpy logp calculation * refine document; make unittest stable
-
- 15 4月, 2019 1 次提交
-
-
由 Hongsheng Zeng 提交于
* add IMPALA algorithm and some common utils * update README.md * refactor files structure of impala algorithm; seperate numpy utils from utils * add hyper parameter scheduler module; add entropy and lr scheduler in impala * clip reward in atari wrapper instead of learner side; fix codestyle * add benchmark result of impala; refine code of impala example; add obs_format in atari_wrappers * Update README.md * add a3c algorithm, A2C example and rl_utils * require training in single gpu/cpu * only check cpu/gpu num in learner * refine Readme * update impala benchmark picture; update Readme * add benchmark result of A2C * move get_params/set_params in agent_base * fix shell script cannot run in ubuntu * refine comment and document * Update README.md * Update README.md
-
- 08 4月, 2019 1 次提交
-
-
由 Hongsheng Zeng 提交于
* add IMPALA algorithm and some common utils * update README.md * refactor files structure of impala algorithm; seperate numpy utils from utils * add hyper parameter scheduler module; add entropy and lr scheduler in impala * clip reward in atari wrapper instead of learner side; fix codestyle * add benchmark result of impala; refine code of impala example; add obs_format in atari_wrappers * Update README.md
-