- 25 3月, 2020 1 次提交
-
-
由 Hongsheng Zeng 提交于
* fix a2c cannot run in paddle 1.6.0 * fix impala compatibility * yapf
-
- 02 8月, 2019 1 次提交
-
-
由 fuyw 提交于
* first pr * start a worker when the master is started. * First PR & Fix logger bugs. * update docs for a2c, impala and ga3c * update doc * yapf modification * update logger * yapf correct * yapf * setup.py * old setup.py * worker 86
-
- 26 7月, 2019 1 次提交
-
-
由 Bo Zhou 提交于
* fix the compatibility issue * fix the comment issue * support paddle 1.5.1 and replace PE with compiler * yapf©right * yapf * fix the teamcity problem * fix the teamcity problem * fix comment * only support paddle 1.5.1 * Cmake * fix comment
-
- 25 7月, 2019 1 次提交
-
-
由 Bo Zhou 提交于
* fix the compatibility issue * fix the comment issue
-
- 24 7月, 2019 1 次提交
-
-
由 Bo Zhou 提交于
* intra-version: move parl.framework into parl.core.fluid * add folder: parl.core * remove former test folders * yapf * yapf0.24
-
- 18 6月, 2019 1 次提交
-
-
由 Hongsheng Zeng 提交于
* refine A2C example * fix unittest in python2; fix codestyle * fix codestyle * refine comment
-
- 15 4月, 2019 1 次提交
-
-
由 Hongsheng Zeng 提交于
* add IMPALA algorithm and some common utils * update README.md * refactor files structure of impala algorithm; seperate numpy utils from utils * add hyper parameter scheduler module; add entropy and lr scheduler in impala * clip reward in atari wrapper instead of learner side; fix codestyle * add benchmark result of impala; refine code of impala example; add obs_format in atari_wrappers * Update README.md * add a3c algorithm, A2C example and rl_utils * require training in single gpu/cpu * only check cpu/gpu num in learner * refine Readme * update impala benchmark picture; update Readme * add benchmark result of A2C * move get_params/set_params in agent_base * fix shell script cannot run in ubuntu * refine comment and document * Update README.md * Update README.md
-
- 08 4月, 2019 1 次提交
-
-
由 Hongsheng Zeng 提交于
* add IMPALA algorithm and some common utils * update README.md * refactor files structure of impala algorithm; seperate numpy utils from utils * add hyper parameter scheduler module; add entropy and lr scheduler in impala * clip reward in atari wrapper instead of learner side; fix codestyle * add benchmark result of impala; refine code of impala example; add obs_format in atari_wrappers * Update README.md
-