- 14 7月, 2020 1 次提交
-
-
由 rical730 提交于
-
- 13 7月, 2020 1 次提交
-
-
由 rical730 提交于
-
- 29 5月, 2020 1 次提交
-
-
由 Bo Zhou 提交于
* replace tensorboard with summary to support VDL in the future * unittest * rename keys for record * yapf
-
- 25 3月, 2020 1 次提交
-
-
由 Hongsheng Zeng 提交于
* fix a2c cannot run in paddle 1.6.0 * fix impala compatibility * yapf
-
- 23 3月, 2020 1 次提交
-
-
由 Bo Zhou 提交于
* fix compatibility issue with the newest paddle * remove logging lines * resolve the compatibility issue with the newest paddle * yapf Co-authored-by: Nrobot <zenghongsheng@baidu.com>
-
- 06 3月, 2020 1 次提交
-
-
由 Bo Zhou 提交于
* fix paddle version bug * add gym dependence (introduced by MADDPG) * recall
-
- 04 12月, 2019 1 次提交
-
-
由 LI Yunxiang 提交于
* Update train.py remove create_actors thread in train.py * Update GA3C train.py
-
- 17 9月, 2019 1 次提交
-
-
由 Hongsheng Zeng 提交于
* Limit impala to single GPU training * refine comment of scheduler * refine comment
-
- 13 8月, 2019 1 次提交
-
-
由 Bo Zhou 提交于
* add learning curve for ES * add learning curve for ES * support new APIs of the cluster * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * rename learner.py * Update README.md * Update README.md * Update README.cn.md * Update README.md * Update README.cn.md * Update README.md
-
- 12 8月, 2019 1 次提交
-
-
由 Hongsheng Zeng 提交于
* ES example * refine settings * fix yapf * refine documentation; remove csv logger * fix bug * merge learner.py and train.py; add version requirements of gym and atari_py * unify actor num
-
- 02 8月, 2019 1 次提交
-
-
由 fuyw 提交于
* first pr * start a worker when the master is started. * First PR & Fix logger bugs. * update docs for a2c, impala and ga3c * update doc * yapf modification * update logger * yapf correct * yapf * setup.py * old setup.py * worker 86
-
- 26 7月, 2019 1 次提交
-
-
由 Bo Zhou 提交于
* fix the compatibility issue * fix the comment issue * support paddle 1.5.1 and replace PE with compiler * yapf©right * yapf * fix the teamcity problem * fix the teamcity problem * fix comment * only support paddle 1.5.1 * Cmake * fix comment
-
- 24 7月, 2019 2 次提交
- 05 7月, 2019 1 次提交
-
-
由 Bo Zhou 提交于
* Update README.cn.md * Update README.md * Update README.md * Update README.cn.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md
-
- 18 4月, 2019 1 次提交
-
-
由 Hongsheng Zeng 提交于
* fix typo * Update README.md * Update README.md * Update README.md * soft depend on fluid; add module to monitor client status * improve performance of IMPALA example * fix bug of some client cannot exit normally * refine comment * .
-
- 17 4月, 2019 1 次提交
-
-
由 Hongsheng Zeng 提交于
* add IMPALA algorithm and some common utils * update README.md * refactor files structure of impala algorithm; seperate numpy utils from utils * add hyper parameter scheduler module; add entropy and lr scheduler in impala * clip reward in atari wrapper instead of learner side; fix codestyle * add benchmark result of impala; refine code of impala example; add obs_format in atari_wrappers * Update README.md * add a3c algorithm, A2C example and rl_utils * require training in single gpu/cpu * only check cpu/gpu num in learner * refine Readme * update impala benchmark picture; update Readme * add benchmark result of A2C * move get_params/set_params in agent_base * add GA3C example * Update README.md * Update README.md * Update README.md * Update README.md * refine Readme * add benchmark * add default safe eps in numpy logp calculation * refine document; make unittest stable
-
- 15 4月, 2019 1 次提交
-
-
由 Hongsheng Zeng 提交于
* add IMPALA algorithm and some common utils * update README.md * refactor files structure of impala algorithm; seperate numpy utils from utils * add hyper parameter scheduler module; add entropy and lr scheduler in impala * clip reward in atari wrapper instead of learner side; fix codestyle * add benchmark result of impala; refine code of impala example; add obs_format in atari_wrappers * Update README.md * add a3c algorithm, A2C example and rl_utils * require training in single gpu/cpu * only check cpu/gpu num in learner * refine Readme * update impala benchmark picture; update Readme * add benchmark result of A2C * move get_params/set_params in agent_base * fix shell script cannot run in ubuntu * refine comment and document * Update README.md * Update README.md
-
- 08 4月, 2019 1 次提交
-
-
由 Hongsheng Zeng 提交于
* add IMPALA algorithm and some common utils * update README.md * refactor files structure of impala algorithm; seperate numpy utils from utils * add hyper parameter scheduler module; add entropy and lr scheduler in impala * clip reward in atari wrapper instead of learner side; fix codestyle * add benchmark result of impala; refine code of impala example; add obs_format in atari_wrappers * Update README.md
-