- 14 7月, 2020 1 次提交
-
-
由 rical730 提交于
-
- 13 7月, 2020 1 次提交
-
-
由 rical730 提交于
-
- 01 7月, 2020 1 次提交
-
-
由 rical730 提交于
* fix self.alg * torch agent initialization * remove definition of self.alg in PPO * replace self.algorithm with self.alg * remove unnecessary definition of self.alg * fix cn readme * unittest * yapf
-
- 24 6月, 2020 1 次提交
-
-
由 Bo Zhou 提交于
-
- 23 6月, 2020 1 次提交
-
-
由 rical730 提交于
* add tutorials * yapf * yapf * copyright * yapf * update tutorial lesson5 * delete drawing code * yapf * remove action_mapping * update dqn and add README * update * update * yapf * add toturials homework
-
- 16 6月, 2020 1 次提交
-
-
由 rical730 提交于
* update tutorials
-
- 10 6月, 2020 2 次提交
- 29 5月, 2020 1 次提交
-
-
由 Bo Zhou 提交于
* replace tensorboard with summary to support VDL in the future * unittest * rename keys for record * yapf
-
- 07 5月, 2020 1 次提交
-
-
由 LI Yunxiang 提交于
* update ddpg * Update train.py
-
- 06 5月, 2020 1 次提交
-
-
由 LI Yunxiang 提交于
* use cartpole-v0 in dqn * Update README.md
-
- 30 4月, 2020 1 次提交
-
-
由 LI Yunxiang 提交于
* state to obs * yapf & update softlink in offline-q-learning
-
- 28 4月, 2020 1 次提交
-
-
由 LI Yunxiang 提交于
* add simple dqn * Update README.md * Update train.py * update * update image in README * update readme * simplify * yapf * Update README.md * Update README.md * Update README.md * Update train.py * yapf
-
- 27 4月, 2020 1 次提交
-
-
由 LI Yunxiang 提交于
* remove version 1.3 warnings * update * yapf * add algorithms test * Update algs_test.py * Update algs_test.py add SAC DDPG TD3 tests * yapf
-
- 30 3月, 2020 1 次提交
-
-
由 Hongsheng Zeng 提交于
* release v1.2.3 * change dep of liftsim a2c
-
- 25 3月, 2020 1 次提交
-
-
由 Hongsheng Zeng 提交于
* fix a2c cannot run in paddle 1.6.0 * fix impala compatibility * yapf
-
- 24 3月, 2020 1 次提交
-
-
由 Hongsheng Zeng 提交于
-
- 23 3月, 2020 3 次提交
-
-
由 Bo Zhou 提交于
* fix compatibility issue with the newest paddle * remove logging lines * resolve the compatibility issue with the newest paddle * yapf Co-authored-by: Nrobot <zenghongsheng@baidu.com>
-
由 rical730 提交于
* add SGD and Adam Optimizer for DeepES * update deepes readme * add warning when input different size in the same param update() * add error return in update(), add optimizer.cc * separate SGD and Adam, optimizer type in config is not case sensitive * delete optimizer.cc * config optimizer in deepes.proto * more readable * update maddpg readme, fixed gym version
-
由 Bo Zhou 提交于
* add tutorial of deepes, written with numpy, less than 100lines * modify learning_rate as an arugment of Agent
-
- 18 3月, 2020 1 次提交
-
-
由 Hongsheng Zeng 提交于
* liftsim a2c baseline * update readme * compatible with different os * empty * refine comments * remove unnecessary assertion; add tensorboard guide * remove unnecessary assertion * update parl dependence of A2C
-
- 16 3月, 2020 1 次提交
-
-
由 Bo Zhou 提交于
* update comments for ES * check dependence on paddle or torch * update readme * update readme#2 * users can still use parl.remote when no DL framework was found * yapf
-
- 06 3月, 2020 1 次提交
-
-
由 Bo Zhou 提交于
* fix paddle version bug * add gym dependence (introduced by MADDPG) * recall
-
- 08 2月, 2020 1 次提交
-
-
由 rical730 提交于
* add maddpg example * format with yapf * fix coding style * fix coding style * unittest without import multiagent env * update maddpg code * update maddpg readme * add copyright comments
-
- 14 1月, 2020 1 次提交
-
-
由 LI Yunxiang 提交于
* add offline q learning * Update README.md * update * yapf
-
- 30 12月, 2019 1 次提交
-
-
由 LI Yunxiang 提交于
* add sac
-
- 21 12月, 2019 1 次提交
-
-
由 Hongsheng Zeng 提交于
-
- 17 12月, 2019 1 次提交
-
-
由 Hongsheng Zeng 提交于
-
- 11 12月, 2019 1 次提交
-
-
由 Hongsheng Zeng 提交于
* Training pipeline of NeurIPS2019-Learn-to-Move-Challenge * fix grammar mistakes * release 1.2.1 * copyright * fix bug * refine README * refine README * fix typo * Update README.md * Update README.md
-
- 09 12月, 2019 1 次提交
-
-
由 LI Yunxiang 提交于
* Update reward calculation in QuickStart * update * yapf
-
- 04 12月, 2019 1 次提交
-
-
由 LI Yunxiang 提交于
* Update train.py remove create_actors thread in train.py * Update GA3C train.py
-
- 22 11月, 2019 1 次提交
-
-
由 LI Yunxiang 提交于
* add TD3 * update * yapf..... * Update train.py
-
- 18 11月, 2019 1 次提交
-
-
由 LI Yunxiang 提交于
* update dqn readme * update merge.png
-
- 16 11月, 2019 1 次提交
-
-
由 Hongsheng Zeng 提交于
* make job run task in a separate process * fix typo * add more debug info in xparl client * refine control flow of different processes in xparl job * refine control flow of different processes in xparl job * remove tsinghua source * remove tsinghua source * remove unnecessary logic * fix typo * refine comments and some logic * fix bug, `decay=0` means totally synchronize weights of source model to target model
-
- 11 11月, 2019 1 次提交
-
-
由 LI Yunxiang 提交于
* add save_param in docs and quickstart * Update train.py
-
- 04 11月, 2019 1 次提交
-
-
由 Hongsheng Zeng 提交于
* final submit models of NeurIPS2019 challenge * update readme * fix yapf * refine comment
-
- 29 10月, 2019 1 次提交
-
-
由 LI Yunxiang 提交于
-
- 24 10月, 2019 1 次提交
-
-
由 LI Yunxiang 提交于
* add Double & Dueling DQN * yapf...................... * update * Update train.py
-
- 25 9月, 2019 1 次提交
-
-
由 LI Yunxiang 提交于
* add dygraph pg * update acc. comments * update comments
-
- 17 9月, 2019 1 次提交
-
-
由 Hongsheng Zeng 提交于
* Limit impala to single GPU training * refine comment of scheduler * refine comment
-