- 23 3月, 2020 3 次提交
-
-
由 Bo Zhou 提交于
* fix compatibility issue with the newest paddle * remove logging lines * resolve the compatibility issue with the newest paddle * yapf Co-authored-by: Nrobot <zenghongsheng@baidu.com>
-
由 rical730 提交于
* add SGD and Adam Optimizer for DeepES * update deepes readme * add warning when input different size in the same param update() * add error return in update(), add optimizer.cc * separate SGD and Adam, optimizer type in config is not case sensitive * delete optimizer.cc * config optimizer in deepes.proto * more readable * update maddpg readme, fixed gym version
-
由 Bo Zhou 提交于
* add tutorial of deepes, written with numpy, less than 100lines * modify learning_rate as an arugment of Agent
-
- 22 3月, 2020 1 次提交
-
-
由 Bo Zhou 提交于
* fix compatibility issue with the newest paddle * remove logging lines Co-authored-by: Nrobot <zenghongsheng@baidu.com>
-
- 20 3月, 2020 1 次提交
-
-
由 Hongsheng Zeng 提交于
* mac compatibility * refine import trick
-
- 18 3月, 2020 2 次提交
-
-
由 Hongsheng Zeng 提交于
* liftsim a2c baseline * update readme * compatible with different os * empty * refine comments * remove unnecessary assertion; add tensorboard guide * remove unnecessary assertion * update parl dependence of A2C
-
由 Bo Zhou 提交于
* add deepES & a demo that is compatible with torch * add copyright & update protoc file path * add copyright * rm useless files * update dependency on libtorch * add the demonstration gif * update gif * Create README.md * Update README.md * Update README.md * Update README.md * update scripts * update scripts#2 * update torch_predictor
-
- 16 3月, 2020 1 次提交
-
-
由 Bo Zhou 提交于
* update comments for ES * check dependence on paddle or torch * update readme * update readme#2 * users can still use parl.remote when no DL framework was found * yapf
-
- 09 3月, 2020 1 次提交
-
-
由 rical730 提交于
* add maddpg example * format with yapf * fix coding style * fix coding style * unittest without import multiagent env * update maddpg code * update maddpg readme * add copyright comments * update parl.maddpg without import gym * update NeurlIPS2018.gif to NeurlIPS2019.gif * update readme and comments
-
- 06 3月, 2020 1 次提交
-
-
由 Bo Zhou 提交于
* fix paddle version bug * add gym dependence (introduced by MADDPG) * recall
-
- 03 3月, 2020 1 次提交
-
-
由 Hongsheng Zeng 提交于
* torch benchmark policy gradient * refine comments and use native api
-
- 08 2月, 2020 1 次提交
-
-
由 rical730 提交于
* add maddpg example * format with yapf * fix coding style * fix coding style * unittest without import multiagent env * update maddpg code * update maddpg readme * add copyright comments
-
- 15 1月, 2020 1 次提交
-
-
由 Bo Zhou 提交于
-
- 14 1月, 2020 1 次提交
-
-
由 LI Yunxiang 提交于
* add offline q learning * Update README.md * update * yapf
-
- 30 12月, 2019 1 次提交
-
-
由 LI Yunxiang 提交于
* add sac
-
- 23 12月, 2019 1 次提交
-
-
由 LI Yunxiang 提交于
-
- 21 12月, 2019 1 次提交
-
-
由 Hongsheng Zeng 提交于
-
- 17 12月, 2019 1 次提交
-
-
由 Hongsheng Zeng 提交于
-
- 11 12月, 2019 2 次提交
-
-
由 Hongsheng Zeng 提交于
* Training pipeline of NeurIPS2019-Learn-to-Move-Challenge * fix grammar mistakes * release 1.2.1 * copyright * fix bug * refine README * refine README * fix typo * Update README.md * Update README.md
-
由 Bo Zhou 提交于
-
- 09 12月, 2019 1 次提交
-
-
由 LI Yunxiang 提交于
* Update reward calculation in QuickStart * update * yapf
-
- 04 12月, 2019 1 次提交
-
-
由 LI Yunxiang 提交于
* Update train.py remove create_actors thread in train.py * Update GA3C train.py
-
- 27 11月, 2019 1 次提交
-
-
由 LI Yunxiang 提交于
-
- 22 11月, 2019 1 次提交
-
-
由 LI Yunxiang 提交于
* add TD3 * update * yapf..... * Update train.py
-
- 18 11月, 2019 1 次提交
-
-
由 LI Yunxiang 提交于
* update dqn readme * update merge.png
-
- 16 11月, 2019 1 次提交
-
-
由 Hongsheng Zeng 提交于
* make job run task in a separate process * fix typo * add more debug info in xparl client * refine control flow of different processes in xparl job * refine control flow of different processes in xparl job * remove tsinghua source * remove tsinghua source * remove unnecessary logic * fix typo * refine comments and some logic * fix bug, `decay=0` means totally synchronize weights of source model to target model
-
- 15 11月, 2019 1 次提交
-
-
由 LI Yunxiang 提交于
-
- 11 11月, 2019 1 次提交
-
-
由 LI Yunxiang 提交于
* add save_param in docs and quickstart * Update train.py
-
- 06 11月, 2019 1 次提交
-
-
由 LI Yunxiang 提交于
* add pytorch a2c * add set/get_weights test & copyright * yapf.... * Update model_base_test_torch.py * update * Delete banma.py * Update model_base_test_torch.py * update * Update model.py * update torch tests * Update model_base_test_torch.py
-
- 04 11月, 2019 1 次提交
-
-
由 Hongsheng Zeng 提交于
* final submit models of NeurIPS2019 challenge * update readme * fix yapf * refine comment
-
- 30 10月, 2019 2 次提交
- 29 10月, 2019 1 次提交
-
-
由 LI Yunxiang 提交于
-
- 24 10月, 2019 1 次提交
-
-
由 LI Yunxiang 提交于
* add Double & Dueling DQN * yapf...................... * update * Update train.py
-
- 22 10月, 2019 1 次提交
-
-
由 Hongsheng Zeng 提交于
-
- 16 10月, 2019 1 次提交
-
-
由 Hongsheng Zeng 提交于
-
- 08 10月, 2019 1 次提交
-
-
由 LI Yunxiang 提交于
* torch test env * Update build.sh * update torch unit test
-
- 25 9月, 2019 2 次提交
-
-
由 fuyw 提交于
* git commit -m torchdqn * yapf * fix bugs * fix bugs * fix bugs * yapf * remove fstring format * torch_test yapf * yapf * Add torch in unittest.requirements * update torch_unittest * Torch and FLUID conflict problem in __init__.py * Unittest fail for torch when both torch and fluid exists. * cluster_test fail in the unittest, add timeout seconds. * Torch backend for PARL * add sleep time for unit test send_job_test.py * Unit test for send_job_test.py * use multiple try for unit test * Fix compatibility for python2.7. * fix send_job_test.py bugs * check file exist before send_job_test.py * Modify send_job_test.py
-
由 LI Yunxiang 提交于
* add dygraph pg * update acc. comments * update comments
-
- 17 9月, 2019 1 次提交
-
-
由 Hongsheng Zeng 提交于
* Limit impala to single GPU training * refine comment of scheduler * refine comment
-