提交 · fa93980e8306c9984f840e5e9c253b8ebe29f499 · PaddlePaddle / PARL

14 7月, 2020 1 次提交
- R
  
  update readme (#346) · fa93980e
  由 rical730 提交于 7月 14, 2020
  
  fa93980e
13 7月, 2020 1 次提交
- R
  
  update readme (#344) · fb43d292
  由 rical730 提交于 7月 13, 2020
  
  fb43d292
29 5月, 2020 1 次提交

replace tensorboard with summary to support VDL (#276) · bcface6a

由 Bo Zhou 提交于 5月 29, 2020

* replace tensorboard with summary to support VDL in the future

* unittest

* rename keys for record

* yapf

bcface6a

25 3月, 2020 1 次提交
- H
  fix a2c cannot run in paddle 1.6.0 (#232) · f46ad361
  由 Hongsheng Zeng 提交于 3月 25, 2020
```
* fix a2c cannot run in paddle 1.6.0

* fix impala compatibility

* yapf
```
  f46ad361
23 3月, 2020 1 次提交

resolve the compatibility issue (#226) · aede5aee

由 Bo Zhou 提交于 3月 23, 2020

* fix compatibility issue with the newest paddle

* remove logging lines

* resolve the compatibility issue with the newest paddle

* yapf
Co-authored-by: Nrobot <zenghongsheng@baidu.com>

aede5aee

06 3月, 2020 1 次提交
- B
  fix paddle version bug (#207) · 450a4a34
  由 Bo Zhou 提交于 3月 06, 2020
```
* fix paddle version bug

* add gym dependence (introduced by MADDPG)

* recall
```
  450a4a34
04 12月, 2019 1 次提交

Update train.py (#181) · dbb5931a

由 LI Yunxiang 提交于 12月 04, 2019

* Update train.py

remove create_actors thread in train.py

* Update GA3C train.py

dbb5931a

17 9月, 2019 1 次提交
- H
  Limit impala to single GPU training (#152) · 89c3366b
  由 Hongsheng Zeng 提交于 9月 17, 2019
```
* Limit impala to single GPU training

* refine comment of scheduler

* refine comment
```
  89c3366b
13 8月, 2019 1 次提交

Zhoubo01 es (#127) · 5612ecde

由 Bo Zhou 提交于 8月 13, 2019

* add learning curve for ES

* add learning curve for ES

* support new APIs of the cluster

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* rename learner.py

* Update README.md

* Update README.md

* Update README.cn.md

* Update README.md

* Update README.cn.md

* Update README.md

5612ecde

12 8月, 2019 1 次提交

ES example (#105) · 60d68135

由 Hongsheng Zeng 提交于 8月 12, 2019

* ES example

* refine settings

* fix yapf

* refine documentation; remove csv logger

* fix bug

* merge learner.py and train.py; add version requirements of gym and atari_py

* unify actor num

60d68135

02 8月, 2019 1 次提交

first pr (#113) · b29a1ec1

由 fuyw 提交于 8月 02, 2019

* first pr

* start a worker when the master is started.

* First PR & Fix logger bugs.

* update docs for a2c, impala and ga3c

* update doc

* yapf modification

* update logger

* yapf correct

* yapf

* setup.py

* old setup.py

* worker 86

b29a1ec1

26 7月, 2019 1 次提交

replace PE with compiler(new feature in paddle151). (#99) · d33f3002

由 Bo Zhou 提交于 7月 26, 2019

* fix the compatibility issue

* fix the comment issue

* support paddle 1.5.1 and replace PE with compiler

* yapf&copyright

* yapf

* fix the teamcity problem

* fix the teamcity problem

* fix comment

* only support paddle 1.5.1

* Cmake

* fix comment

d33f3002

24 7月, 2019 2 次提交

B
remove previous algorithms; use the forward function in parl.Model (#96) · 0e174277
由 Bo Zhou 提交于 7月 24, 2019
```
* remove previous algorithms; use the forward function in parl.Model

* remove abundant lines

* yapf
```
0e174277

breaking changes#1 (#95) · 6efa7871

由 Bo Zhou 提交于 7月 24, 2019

* intra-version: move parl.framework into parl.core.fluid

* add folder: parl.core

* remove former test folders

* yapf

* yapf0.24

6efa7871

05 7月, 2019 1 次提交

Documents cn (#85) · 96c58265

由 Bo Zhou 提交于 7月 05, 2019

* Update README.cn.md

* Update README.md

* Update README.md

* Update README.cn.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

96c58265

18 4月, 2019 1 次提交

Refine (#67) · 3556c786

由 Hongsheng Zeng 提交于 4月 18, 2019

* fix typo

* Update README.md

* Update README.md

* Update README.md

* soft depend on fluid; add module to monitor client status

* improve performance of IMPALA example

* fix bug of some client cannot exit normally

* refine comment

* .

3556c786

17 4月, 2019 1 次提交

GA3C example (#63) · 3c511e8f

由 Hongsheng Zeng 提交于 4月 17, 2019

* add IMPALA algorithm and some common utils

* update README.md

* refactor files structure of impala algorithm; seperate numpy utils from utils

* add hyper parameter scheduler module; add entropy and lr scheduler in impala

* clip reward in atari wrapper instead of learner side; fix codestyle

* add benchmark result of impala; refine code of impala example; add obs_format in atari_wrappers

* Update README.md

* add a3c algorithm, A2C example and rl_utils

* require training in single gpu/cpu

* only check cpu/gpu num in learner

* refine Readme

* update impala benchmark picture; update Readme

* add benchmark result of A2C

* move get_params/set_params in agent_base

* add GA3C example

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* refine Readme

* add benchmark

* add default safe eps in numpy logp calculation

* refine document; make unittest stable

3c511e8f

15 4月, 2019 1 次提交

A2C example (#62) · 39846831

由 Hongsheng Zeng 提交于 4月 15, 2019

* add IMPALA algorithm and some common utils

* update README.md

* refactor files structure of impala algorithm; seperate numpy utils from utils

* add hyper parameter scheduler module; add entropy and lr scheduler in impala

* clip reward in atari wrapper instead of learner side; fix codestyle

* add benchmark result of impala; refine code of impala example; add obs_format in atari_wrappers

* Update README.md

* add a3c algorithm, A2C example and rl_utils

* require training in single gpu/cpu

* only check cpu/gpu num in learner

* refine Readme

* update impala benchmark picture; update Readme

* add benchmark result of A2C

* move get_params/set_params in agent_base

* fix shell script cannot run in ubuntu

* refine comment and document

* Update README.md

* Update README.md

39846831

08 4月, 2019 1 次提交

implement of IMPALA with the newest parallel design (#60) · b28289ac

由 Hongsheng Zeng 提交于 4月 08, 2019

* add IMPALA algorithm and some common utils

* update README.md

* refactor files structure of impala algorithm; seperate numpy utils from utils

* add hyper parameter scheduler module; add entropy and lr scheduler in impala

* clip reward in atari wrapper instead of learner side; fix codestyle

* add benchmark result of impala; refine code of impala example; add obs_format in atari_wrappers

* Update README.md

b28289ac