- 29 5月, 2020 1 次提交
-
-
由 Bo Zhou 提交于
* replace tensorboard with summary to support VDL in the future * unittest * rename keys for record * yapf
-
- 30 4月, 2020 1 次提交
-
-
由 LI Yunxiang 提交于
* state to obs * yapf & update softlink in offline-q-learning
-
- 28 4月, 2020 1 次提交
-
-
由 LI Yunxiang 提交于
* add simple dqn * Update README.md * Update train.py * update * update image in README * update readme * simplify * yapf * Update README.md * Update README.md * Update README.md * Update train.py * yapf
-
- 29 10月, 2019 1 次提交
-
-
由 LI Yunxiang 提交于
-
- 24 10月, 2019 1 次提交
-
-
由 LI Yunxiang 提交于
* add Double & Dueling DQN * yapf...................... * update * Update train.py
-
- 24 7月, 2019 1 次提交
-
-
由 Bo Zhou 提交于
* intra-version: move parl.framework into parl.core.fluid * add folder: parl.core * remove former test folders * yapf * yapf0.24
-
- 11 3月, 2019 1 次提交
-
-
由 Bo Zhou 提交于
* Update README.md * Update train.py * Update README.md * Update agent_base.py * Update train.py * Update train.py * Update train.py
-
- 24 1月, 2019 1 次提交
-
-
由 Hongsheng Zeng 提交于
* add more dqn benchmark result; unify train scripts * resize benchmark picture * resize benchmark picture, refine comments of args * change dependence, mujoco only support python3 now
-
- 15 1月, 2019 1 次提交
-
-
由 Hongsheng Zeng 提交于
* NeurIPS2018-AI-for-Prosthetics-Challenge training code * remove model_zoo, provide download link * remove model_zoo, provide download link * add restore_from_one_head api, refine README, fix logger bug * fix test bug * fix rpm bug, refine ddpg train script * fix rpm bug, refine Readme
-
- 15 12月, 2018 1 次提交
-
-
由 Hongsheng Zeng 提交于
* add DDPG example, fix some tiny bug * add license * unify code structure * unify code structure * refine gputils, fix seed in QuickStart * use white noise in DDPG * fix codestyle
-
- 07 12月, 2018 1 次提交
-
-
由 Hongsheng Zeng 提交于
* add QuickStart example, refine DQN example * add examples link * refine the naming, and add quick start training result
-
- 04 12月, 2018 1 次提交
-
-
由 Hongsheng Zeng 提交于
* add DQN example, add Agent unittest * refine readme * refine code * simplify code
-