- 18 3月, 2020 1 次提交
-
-
由 Hongsheng Zeng 提交于
* liftsim a2c baseline * update readme * compatible with different os * empty * refine comments * remove unnecessary assertion; add tensorboard guide * remove unnecessary assertion * update parl dependence of A2C
-
- 16 3月, 2020 1 次提交
-
-
由 Bo Zhou 提交于
* update comments for ES * check dependence on paddle or torch * update readme * update readme#2 * users can still use parl.remote when no DL framework was found * yapf
-
- 06 3月, 2020 1 次提交
-
-
由 Bo Zhou 提交于
* fix paddle version bug * add gym dependence (introduced by MADDPG) * recall
-
- 08 2月, 2020 1 次提交
-
-
由 rical730 提交于
* add maddpg example * format with yapf * fix coding style * fix coding style * unittest without import multiagent env * update maddpg code * update maddpg readme * add copyright comments
-
- 14 1月, 2020 1 次提交
-
-
由 LI Yunxiang 提交于
* add offline q learning * Update README.md * update * yapf
-
- 30 12月, 2019 1 次提交
-
-
由 LI Yunxiang 提交于
* add sac
-
- 21 12月, 2019 1 次提交
-
-
由 Hongsheng Zeng 提交于
-
- 17 12月, 2019 1 次提交
-
-
由 Hongsheng Zeng 提交于
-
- 11 12月, 2019 1 次提交
-
-
由 Hongsheng Zeng 提交于
* Training pipeline of NeurIPS2019-Learn-to-Move-Challenge * fix grammar mistakes * release 1.2.1 * copyright * fix bug * refine README * refine README * fix typo * Update README.md * Update README.md
-
- 09 12月, 2019 1 次提交
-
-
由 LI Yunxiang 提交于
* Update reward calculation in QuickStart * update * yapf
-
- 04 12月, 2019 1 次提交
-
-
由 LI Yunxiang 提交于
* Update train.py remove create_actors thread in train.py * Update GA3C train.py
-
- 22 11月, 2019 1 次提交
-
-
由 LI Yunxiang 提交于
* add TD3 * update * yapf..... * Update train.py
-
- 18 11月, 2019 1 次提交
-
-
由 LI Yunxiang 提交于
* update dqn readme * update merge.png
-
- 16 11月, 2019 1 次提交
-
-
由 Hongsheng Zeng 提交于
* make job run task in a separate process * fix typo * add more debug info in xparl client * refine control flow of different processes in xparl job * refine control flow of different processes in xparl job * remove tsinghua source * remove tsinghua source * remove unnecessary logic * fix typo * refine comments and some logic * fix bug, `decay=0` means totally synchronize weights of source model to target model
-
- 11 11月, 2019 1 次提交
-
-
由 LI Yunxiang 提交于
* add save_param in docs and quickstart * Update train.py
-
- 04 11月, 2019 1 次提交
-
-
由 Hongsheng Zeng 提交于
* final submit models of NeurIPS2019 challenge * update readme * fix yapf * refine comment
-
- 29 10月, 2019 1 次提交
-
-
由 LI Yunxiang 提交于
-
- 24 10月, 2019 1 次提交
-
-
由 LI Yunxiang 提交于
* add Double & Dueling DQN * yapf...................... * update * Update train.py
-
- 25 9月, 2019 1 次提交
-
-
由 LI Yunxiang 提交于
* add dygraph pg * update acc. comments * update comments
-
- 17 9月, 2019 1 次提交
-
-
由 Hongsheng Zeng 提交于
* Limit impala to single GPU training * refine comment of scheduler * refine comment
-
- 26 8月, 2019 1 次提交
-
-
由 Bo Zhou 提交于
* fix minor probmels in the docs * typo * remove pip source * fix monitor * add performance of A2C * Update README.md * modify logger for GPU detection
-
- 13 8月, 2019 1 次提交
-
-
由 Bo Zhou 提交于
* add learning curve for ES * add learning curve for ES * support new APIs of the cluster * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * rename learner.py * Update README.md * Update README.md * Update README.cn.md * Update README.md * Update README.cn.md * Update README.md
-
- 12 8月, 2019 1 次提交
-
-
由 Hongsheng Zeng 提交于
* ES example * refine settings * fix yapf * refine documentation; remove csv logger * fix bug * merge learner.py and train.py; add version requirements of gym and atari_py * unify actor num
-
- 06 8月, 2019 1 次提交
-
-
由 LI Yunxiang 提交于
* add new_alg.rst * rename LiftSim_demo as LiftSim_baseline * Update new_alg.rst * Update new_alg.rst
-
- 05 8月, 2019 1 次提交
-
-
由 LI Yunxiang 提交于
* add liftsim baseline * yapf * yapf... * modify acc. comments * yapf * yapf.......... * yapf! why is yapf on paddle different from that on my mac!!!!!
-
- 02 8月, 2019 1 次提交
-
-
由 fuyw 提交于
* first pr * start a worker when the master is started. * First PR & Fix logger bugs. * update docs for a2c, impala and ga3c * update doc * yapf modification * update logger * yapf correct * yapf * setup.py * old setup.py * worker 86
-
- 01 8月, 2019 1 次提交
-
-
由 Bo Zhou 提交于
* new feature: save params * add unittest for save()/retore() * add an example demonstrating the usage * rename the variable * yapf * fix comment
-
- 29 7月, 2019 1 次提交
-
-
由 Bo Zhou 提交于
* fix some problems of tensorboard * yapf
-
- 26 7月, 2019 1 次提交
-
-
由 Bo Zhou 提交于
* fix the compatibility issue * fix the comment issue * support paddle 1.5.1 and replace PE with compiler * yapf©right * yapf * fix the teamcity problem * fix the teamcity problem * fix comment * only support paddle 1.5.1 * Cmake * fix comment
-
- 25 7月, 2019 1 次提交
-
-
由 Bo Zhou 提交于
* fix the compatibility issue * fix the comment issue
-
- 24 7月, 2019 2 次提交
- 10 7月, 2019 1 次提交
-
-
由 Bo Zhou 提交于
* make the quickstart more compact * remove args in the main function * yapf * add gif * remove render * Update README.md * Update README.md * Update README.md
-
- 05 7月, 2019 1 次提交
-
-
由 Bo Zhou 提交于
* Update README.cn.md * Update README.md * Update README.md * Update README.cn.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md
-
- 18 6月, 2019 1 次提交
-
-
由 Hongsheng Zeng 提交于
* refine A2C example * fix unittest in python2; fix codestyle * fix codestyle * refine comment
-
- 23 4月, 2019 1 次提交
-
-
由 Hongsheng Zeng 提交于
-
- 19 4月, 2019 1 次提交
-
-
由 Hongsheng Zeng 提交于
* add A2C benchmark; add more information in PyPI homepage * filter picture in PyPI homepage
-
- 18 4月, 2019 1 次提交
-
-
由 Hongsheng Zeng 提交于
* fix typo * Update README.md * Update README.md * Update README.md * soft depend on fluid; add module to monitor client status * improve performance of IMPALA example * fix bug of some client cannot exit normally * refine comment * .
-
- 17 4月, 2019 1 次提交
-
-
由 Hongsheng Zeng 提交于
* add IMPALA algorithm and some common utils * update README.md * refactor files structure of impala algorithm; seperate numpy utils from utils * add hyper parameter scheduler module; add entropy and lr scheduler in impala * clip reward in atari wrapper instead of learner side; fix codestyle * add benchmark result of impala; refine code of impala example; add obs_format in atari_wrappers * Update README.md * add a3c algorithm, A2C example and rl_utils * require training in single gpu/cpu * only check cpu/gpu num in learner * refine Readme * update impala benchmark picture; update Readme * add benchmark result of A2C * move get_params/set_params in agent_base * add GA3C example * Update README.md * Update README.md * Update README.md * Update README.md * refine Readme * add benchmark * add default safe eps in numpy logp calculation * refine document; make unittest stable
-
- 15 4月, 2019 1 次提交
-
-
由 Hongsheng Zeng 提交于
* add IMPALA algorithm and some common utils * update README.md * refactor files structure of impala algorithm; seperate numpy utils from utils * add hyper parameter scheduler module; add entropy and lr scheduler in impala * clip reward in atari wrapper instead of learner side; fix codestyle * add benchmark result of impala; refine code of impala example; add obs_format in atari_wrappers * Update README.md * add a3c algorithm, A2C example and rl_utils * require training in single gpu/cpu * only check cpu/gpu num in learner * refine Readme * update impala benchmark picture; update Readme * add benchmark result of A2C * move get_params/set_params in agent_base * fix shell script cannot run in ubuntu * refine comment and document * Update README.md * Update README.md
-