提交 · 452050a0383e2cb1bc14a97bb5f4afe4c5c53b4d · PaddlePaddle / PARL

08 4月, 2019 1 次提交

implement of IMPALA with the newest parallel design (#60) · b28289ac

由 Hongsheng Zeng 提交于 4月 08, 2019

* add IMPALA algorithm and some common utils

* update README.md

* refactor files structure of impala algorithm; seperate numpy utils from utils

* add hyper parameter scheduler module; add entropy and lr scheduler in impala

* clip reward in atari wrapper instead of learner side; fix codestyle

* add benchmark result of impala; refine code of impala example; add obs_format in atari_wrappers

* Update README.md

b28289ac

26 3月, 2019 1 次提交

add api set_params/get_params in Model (#56) · 7346a23d

由 Hongsheng Zeng 提交于 3月 26, 2019

* add api set_params/get_params in Model; add Interface of Network and LayerFunc to solve circular imports; refine parameter_names api of Model

* remove licence in third party code; remove interface of Network and LayerFunc; move get_parameter_pairs and get_parameter_names api to Network

* refine comment

* refine commment

7346a23d

11 3月, 2019 1 次提交

update documents (#58) · d8449b74

由 Bo Zhou 提交于 3月 11, 2019

* Update README.md

* Update train.py

* Update README.md

* Update agent_base.py

* Update train.py

* Update train.py

* Update train.py

d8449b74

01 3月, 2019 1 次提交
- B
  Update some docs. (#51) · 46188cd4
  由 Bo Zhou 提交于 3月 01, 2019
```
* Update model_base.py

* Update README.md

* Update README.md
```
  46188cd4
14 2月, 2019 1 次提交

fix PPO bug; add more benchmark result (#47) · 65ad2a4e

由 Hongsheng Zeng 提交于 2月 14, 2019

* fix PPO bug; add more benchmark result

* refine code

* update benchmark of PPO, after fix bug

* refine code

65ad2a4e

24 1月, 2019 1 次提交

Add more dqn benchmark result and unify train scripts (#46) · 6fdf4448

由 Hongsheng Zeng 提交于 1月 24, 2019

* add more dqn benchmark result; unify train scripts

* resize benchmark picture

* resize benchmark picture, refine comments of args

* change dependence, mujoco only support python3 now

6fdf4448

18 1月, 2019 1 次提交

Refine documents of PARL (#43) · 7a7583ab

由 Hongsheng Zeng 提交于 1月 18, 2019

* remove not used files, add benchmark for DQN and DDPG, add Parameters management Readme

* Update README.md

* Update README.md

* add parl dependence in examples, use np shuffle instead of sklean

* fix codestyle

* refine readme of nips example

* fix bug

* fix code style

* Update README.md

* Update README.md

* Update README.md

* refine document and remove outdated design doc

* Update README.md

* Update README.md

* refine comment

* release version 1.0

* gif of examples

* Update README.md

* update Readme

7a7583ab

15 1月, 2019 2 次提交

B
update readme for competition folder (#42) · 4163d732
由 Bo Zhou 提交于 1月 15, 2019
```
* Update README.md

* add experimental results
```
4163d732

NeurIPS2018-AI-for-Prosthetics-Challenge training code (#40) · cdb50056

由 Hongsheng Zeng 提交于 1月 15, 2019

* NeurIPS2018-AI-for-Prosthetics-Challenge training code

* remove model_zoo, provide download link

* remove model_zoo, provide download link

* add restore_from_one_head api, refine README, fix logger bug

* fix test bug

* fix rpm bug, refine ddpg train script

* fix rpm bug, refine Readme

cdb50056

04 1月, 2019 1 次提交

add PPO example (#39) · f8de849b

由 Hongsheng Zeng 提交于 1月 04, 2019

* add PPO example

* Update Readme

* Update Readme

* fix codestyle

* Update Readme

* refine action mapping

* add more unitest case

* remove unnecessary params initialize, add more comments, add benchmark result

* rename

* remove PARL dependence in readme of examples

f8de849b

15 12月, 2018 1 次提交

Add DDPG example (#36) · 53c94787

由 Hongsheng Zeng 提交于 12月 15, 2018

* add DDPG example, fix some tiny bug

* add license

* unify code structure

* unify code structure

* refine gputils, fix seed in QuickStart

* use white noise in DDPG

* fix codestyle

53c94787

12 12月, 2018 1 次提交
- D
  add a episode in quick start to show the final test reward (#37) · 58e8fe28
  由 Davanoffi Liang 提交于 12月 12, 2018
```
* add a episode to show the final test reward

* make code more clear
```
  58e8fe28
07 12月, 2018 1 次提交

Add QuickStart example (#35) · cdd4622a

由 Hongsheng Zeng 提交于 12月 06, 2018

* add QuickStart example, refine DQN example

* add examples link

* refine the naming, and add quick start training result

cdd4622a

04 12月, 2018 1 次提交

DQN example (#33) · 4a4366a5

由 Hongsheng Zeng 提交于 12月 04, 2018

* add DQN example, add Agent unittest

* refine readme

* refine  code

* simplify code

4a4366a5

30 11月, 2018 1 次提交

add testing module of NeurIPS2018-AI-for-Prosthetics-Challenge (#32) · b249dee3

由 Hongsheng Zeng 提交于 11月 30, 2018

* add testing module of NeurIPS2018-AI-for-Prosthetics-Challenge, add dependencies of setup

* add copyright

* add google drive link

* fix depedencie

* refine setup

b249dee3

30 9月, 2018 1 次提交
- T
  add IARL directory for camera-ready paper of CoRL2018 · b32f8940
  由 TomorrowIsAnOtherDay 提交于 9月 30, 2018
```
code will be released before 31.October (#20)
```
  b32f8940