- 09 3月, 2020 1 次提交
-
-
由 rical730 提交于
* add maddpg example * format with yapf * fix coding style * fix coding style * unittest without import multiagent env * update maddpg code * update maddpg readme * add copyright comments * update parl.maddpg without import gym * update NeurlIPS2018.gif to NeurlIPS2019.gif * update readme and comments
-
- 03 3月, 2020 1 次提交
-
-
由 Hongsheng Zeng 提交于
* torch benchmark policy gradient * refine comments and use native api
-
- 08 2月, 2020 1 次提交
-
-
由 rical730 提交于
* add maddpg example * format with yapf * fix coding style * fix coding style * unittest without import multiagent env * update maddpg code * update maddpg readme * add copyright comments
-
- 30 12月, 2019 1 次提交
-
-
由 LI Yunxiang 提交于
* add sac
-
- 11 12月, 2019 1 次提交
-
-
由 Hongsheng Zeng 提交于
* Training pipeline of NeurIPS2019-Learn-to-Move-Challenge * fix grammar mistakes * release 1.2.1 * copyright * fix bug * refine README * refine README * fix typo * Update README.md * Update README.md
-
- 27 11月, 2019 1 次提交
-
-
由 LI Yunxiang 提交于
-
- 22 11月, 2019 1 次提交
-
-
由 LI Yunxiang 提交于
* add TD3 * update * yapf..... * Update train.py
-
- 06 11月, 2019 1 次提交
-
-
由 LI Yunxiang 提交于
* add pytorch a2c * add set/get_weights test & copyright * yapf.... * Update model_base_test_torch.py * update * Delete banma.py * Update model_base_test_torch.py * update * Update model.py * update torch tests * Update model_base_test_torch.py
-
- 24 10月, 2019 1 次提交
-
-
由 LI Yunxiang 提交于
* add Double & Dueling DQN * yapf...................... * update * Update train.py
-
- 25 9月, 2019 1 次提交
-
-
由 fuyw 提交于
* git commit -m torchdqn * yapf * fix bugs * fix bugs * fix bugs * yapf * remove fstring format * torch_test yapf * yapf * Add torch in unittest.requirements * update torch_unittest * Torch and FLUID conflict problem in __init__.py * Unittest fail for torch when both torch and fluid exists. * cluster_test fail in the unittest, add timeout seconds. * Torch backend for PARL * add sleep time for unit test send_job_test.py * Unit test for send_job_test.py * use multiple try for unit test * Fix compatibility for python2.7. * fix send_job_test.py bugs * check file exist before send_job_test.py * Modify send_job_test.py
-
- 12 8月, 2019 1 次提交
-
-
由 Hongsheng Zeng 提交于
* fix bug of ParamAttr * refine imports of unittest
-
- 24 7月, 2019 2 次提交
- 15 4月, 2019 1 次提交
-
-
由 Hongsheng Zeng 提交于
* add IMPALA algorithm and some common utils * update README.md * refactor files structure of impala algorithm; seperate numpy utils from utils * add hyper parameter scheduler module; add entropy and lr scheduler in impala * clip reward in atari wrapper instead of learner side; fix codestyle * add benchmark result of impala; refine code of impala example; add obs_format in atari_wrappers * Update README.md * add a3c algorithm, A2C example and rl_utils * require training in single gpu/cpu * only check cpu/gpu num in learner * refine Readme * update impala benchmark picture; update Readme * add benchmark result of A2C * move get_params/set_params in agent_base * fix shell script cannot run in ubuntu * refine comment and document * Update README.md * Update README.md
-
- 08 4月, 2019 1 次提交
-
-
由 Hongsheng Zeng 提交于
* add IMPALA algorithm and some common utils * update README.md * refactor files structure of impala algorithm; seperate numpy utils from utils * add hyper parameter scheduler module; add entropy and lr scheduler in impala * clip reward in atari wrapper instead of learner side; fix codestyle * add benchmark result of impala; refine code of impala example; add obs_format in atari_wrappers * Update README.md
-
- 04 1月, 2019 1 次提交
-
-
由 Hongsheng Zeng 提交于
* add PPO example * Update Readme * Update Readme * fix codestyle * Update Readme * refine action mapping * add more unitest case * remove unnecessary params initialize, add more comments, add benchmark result * rename * remove PARL dependence in readme of examples
-
- 15 12月, 2018 1 次提交
-
-
由 Hongsheng Zeng 提交于
* add DDPG example, fix some tiny bug * add license * unify code structure * unify code structure * refine gputils, fix seed in QuickStart * use white noise in DDPG * fix codestyle
-
- 07 12月, 2018 1 次提交
-
-
由 Hongsheng Zeng 提交于
* add QuickStart example, refine DQN example * add examples link * refine the naming, and add quick start training result
-
- 04 12月, 2018 1 次提交
-
-
由 Hongsheng Zeng 提交于
* add DQN example, add Agent unittest * refine readme * refine code * simplify code
-