提交 · 6b70b81d110714084ceabb3f756e86f1affdaabe · PaddlePaddle / PARL

18 3月, 2020 1 次提交

由 Hongsheng Zeng 提交于 3月 18, 2020

* liftsim a2c baseline

* update readme

* compatible with different os

* empty

* refine comments

* remove unnecessary assertion; add tensorboard guide

* remove unnecessary assertion

* update parl dependence of A2C

6b70b81d

16 3月, 2020 1 次提交

update comments for ES (#211) · fa420300

由 Bo Zhou 提交于 3月 16, 2020

* update comments for ES

* check dependence on paddle or torch

* update readme

* update readme#2

* users can still use parl.remote when no DL framework was found

* yapf

fa420300

06 3月, 2020 1 次提交
- B
  fix paddle version bug (#207) · 450a4a34
  由 Bo Zhou 提交于 3月 06, 2020
```
* fix paddle version bug

* add gym dependence (introduced by MADDPG)

* recall
```
  450a4a34
08 2月, 2020 1 次提交

add maddpg example (#200) · 9216d941

由 rical730 提交于 2月 08, 2020

* add maddpg example

* format with yapf

* fix coding style

* fix coding style

* unittest without import multiagent env

* update maddpg code

* update maddpg readme

* add copyright comments

9216d941

14 1月, 2020 1 次提交
- L
  add offline q learning (#193) · 6a672c80
  由 LI Yunxiang 提交于 1月 14, 2020
```
* add offline q learning

* Update README.md

* update

* yapf
```
  6a672c80
30 12月, 2019 1 次提交
- L
  add sac (#188) · c070db83
  由 LI Yunxiang 提交于 12月 30, 2019
```
* add sac
```
  c070db83
21 12月, 2019 1 次提交
- H
  
  fix typo; refine readme (#190) · cb4b3852
  由 Hongsheng Zeng 提交于 12月 21, 2019
  
  cb4b3852
17 12月, 2019 1 次提交
- H
  
  fix paddle version bug; refine scripts (#186) · c5a8c2ba
  由 Hongsheng Zeng 提交于 12月 17, 2019
  
  c5a8c2ba
11 12月, 2019 1 次提交

Training pipeline of NeurIPS2019-Learn-to-Move-Challenge (#183) · 7173368e

由 Hongsheng Zeng 提交于 12月 11, 2019

* Training pipeline of NeurIPS2019-Learn-to-Move-Challenge

* fix grammar mistakes

* release 1.2.1

* copyright

* fix bug

* refine README

* refine README

* fix typo

* Update README.md

* Update README.md

7173368e

09 12月, 2019 1 次提交
- L
  Update reward calculation in QuickStart (#182) · 1475ca77
  由 LI Yunxiang 提交于 12月 09, 2019
```
* Update reward calculation in QuickStart

* update

* yapf
```
  1475ca77
04 12月, 2019 1 次提交

Update train.py (#181) · dbb5931a

由 LI Yunxiang 提交于 12月 04, 2019

* Update train.py

remove create_actors thread in train.py

* Update GA3C train.py

dbb5931a

22 11月, 2019 1 次提交
- L
  add TD3 (#175) · 6e7f862e
  由 LI Yunxiang 提交于 11月 22, 2019
```
* add TD3

* update

* yapf.....

* Update train.py
```
  6e7f862e
18 11月, 2019 1 次提交
- L
  update dqn readme (#174) · bb0bf579
  由 LI Yunxiang 提交于 11月 18, 2019
```
* update dqn readme

* update merge.png
```
  bb0bf579
16 11月, 2019 1 次提交

make job run task in a separate process (#170) · 64aebb6d

由 Hongsheng Zeng 提交于 11月 16, 2019

* make job run task in a separate process

* fix typo

* add more debug info in xparl client

* refine control flow of different processes in xparl job

* refine control flow of different processes in xparl job

* remove tsinghua source

* remove tsinghua source

* remove unnecessary logic

* fix typo

* refine comments and some logic

* fix bug, `decay=0` means totally synchronize weights of source model to target model

64aebb6d

11 11月, 2019 1 次提交
- L
  add save_params in docs and quickStart (#172) · 4c98e3fd
  由 LI Yunxiang 提交于 11月 11, 2019
```
* add save_param in docs and quickstart

* Update train.py
```
  4c98e3fd
04 11月, 2019 1 次提交
- H
  Final submitted models of NeurIPS2019 challenge (#168) · 7c406386
  由 Hongsheng Zeng 提交于 11月 04, 2019
```
* final submit models of NeurIPS2019 challenge

* update readme

* fix yapf

* refine comment
```
  7c406386
29 10月, 2019 1 次提交
- L
  
  update dqn lr_scheduler (#164) · d5a8d268
  由 LI Yunxiang 提交于 10月 29, 2019
  
  d5a8d268
24 10月, 2019 1 次提交
- L
  add Double & Dueling DQN (#163) · bb9b78b4
  由 LI Yunxiang 提交于 10月 24, 2019
```
* add Double & Dueling DQN

* yapf......................

* update

* Update train.py
```
  bb9b78b4
25 9月, 2019 1 次提交
- L
  add dygraph pg (#155) · 49b0e706
  由 LI Yunxiang 提交于 9月 25, 2019
```
* add dygraph pg

* update acc. comments

* update comments
```
  49b0e706
17 9月, 2019 1 次提交
- H
  Limit impala to single GPU training (#152) · 89c3366b
  由 Hongsheng Zeng 提交于 9月 17, 2019
```
* Limit impala to single GPU training

* refine comment of scheduler

* refine comment
```
  89c3366b
26 8月, 2019 1 次提交

fix minor problems in the docs (#138) · b6122aa2

由 Bo Zhou 提交于 8月 26, 2019

* fix minor probmels in the docs

* typo

* remove pip source

* fix monitor

* add performance of A2C

* Update README.md

* modify logger for GPU detection

b6122aa2

13 8月, 2019 1 次提交

Zhoubo01 es (#127) · 5612ecde

由 Bo Zhou 提交于 8月 13, 2019

* add learning curve for ES

* add learning curve for ES

* support new APIs of the cluster

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* rename learner.py

* Update README.md

* Update README.md

* Update README.cn.md

* Update README.md

* Update README.cn.md

* Update README.md

5612ecde

12 8月, 2019 1 次提交

ES example (#105) · 60d68135

由 Hongsheng Zeng 提交于 8月 12, 2019

* ES example

* refine settings

* fix yapf

* refine documentation; remove csv logger

* fix bug

* merge learner.py and train.py; add version requirements of gym and atari_py

* unify actor num

60d68135

06 8月, 2019 1 次提交

add new_alg.rst (#123) · a7670972

由 LI Yunxiang 提交于 8月 06, 2019

* add new_alg.rst

* rename LiftSim_demo as LiftSim_baseline

* Update new_alg.rst

* Update new_alg.rst

a7670972

05 8月, 2019 1 次提交

add liftsim baseline (#120) · c1646351

由 LI Yunxiang 提交于 8月 05, 2019

* add liftsim baseline

* yapf

* yapf...

* modify acc. comments

* yapf

* yapf..........

* yapf!

why is yapf on paddle different from that on my mac!!!!!

c1646351

02 8月, 2019 1 次提交

first pr (#113) · b29a1ec1

由 fuyw 提交于 8月 02, 2019

* first pr

* start a worker when the master is started.

* First PR & Fix logger bugs.

* update docs for a2c, impala and ga3c

* update doc

* yapf modification

* update logger

* yapf correct

* yapf

* setup.py

* old setup.py

* worker 86

b29a1ec1

01 8月, 2019 1 次提交

Save params (#107) · 7dafee77

由 Bo Zhou 提交于 8月 01, 2019

* new feature: save params

* add unittest for save()/retore()

* add an example demonstrating the usage

* rename the variable

* yapf

* fix comment

7dafee77

29 7月, 2019 1 次提交
- B
  fix some problems of tensorboard (#100) · 564a3742
  由 Bo Zhou 提交于 7月 29, 2019
```
* fix some problems of tensorboard

* yapf
```
  564a3742
26 7月, 2019 1 次提交

replace PE with compiler(new feature in paddle151). (#99) · d33f3002

由 Bo Zhou 提交于 7月 26, 2019

* fix the compatibility issue

* fix the comment issue

* support paddle 1.5.1 and replace PE with compiler

* yapf&copyright

* yapf

* fix the teamcity problem

* fix the teamcity problem

* fix comment

* only support paddle 1.5.1

* Cmake

* fix comment

d33f3002

25 7月, 2019 1 次提交
- B
  fix the compatibility issue in the A2C example. (#98) · 33516338
  由 Bo Zhou 提交于 7月 25, 2019
```
* fix the compatibility issue

* fix the comment issue
```
  33516338
24 7月, 2019 2 次提交

B
remove previous algorithms; use the forward function in parl.Model (#96) · 0e174277
由 Bo Zhou 提交于 7月 24, 2019
```
* remove previous algorithms; use the forward function in parl.Model

* remove abundant lines

* yapf
```
0e174277

breaking changes#1 (#95) · 6efa7871

由 Bo Zhou 提交于 7月 24, 2019

* intra-version: move parl.framework into parl.core.fluid

* add folder: parl.core

* remove former test folders

* yapf

* yapf0.24

6efa7871

10 7月, 2019 1 次提交

make the quickstart more compact (#88) · 9dc152f0

由 Bo Zhou 提交于 7月 10, 2019

* make the quickstart more compact

* remove args in the main function

* yapf

* add gif

* remove render

* Update README.md

* Update README.md

* Update README.md

9dc152f0

05 7月, 2019 1 次提交

Documents cn (#85) · 96c58265

由 Bo Zhou 提交于 7月 05, 2019

* Update README.cn.md

* Update README.md

* Update README.md

* Update README.cn.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

96c58265

18 6月, 2019 1 次提交

refine A2C example (#80) · 255ef4f7

由 Hongsheng Zeng 提交于 6月 18, 2019

* refine A2C example

* fix unittest in python2; fix codestyle

* fix codestyle

* refine comment

255ef4f7

23 4月, 2019 1 次提交
- H
  
  add benchmark of GA3C (#71) · 858c4f0c
  由 Hongsheng Zeng 提交于 4月 23, 2019
  
  858c4f0c
19 4月, 2019 1 次提交
- H
  add A2C benchmark; add more information in PyPI homepage (#70) · 3b97394e
  由 Hongsheng Zeng 提交于 4月 19, 2019
```
* add A2C benchmark; add more information in PyPI homepage

* filter picture in PyPI homepage
```
  3b97394e
18 4月, 2019 1 次提交

Refine (#67) · 3556c786

由 Hongsheng Zeng 提交于 4月 18, 2019

* fix typo

* Update README.md

* Update README.md

* Update README.md

* soft depend on fluid; add module to monitor client status

* improve performance of IMPALA example

* fix bug of some client cannot exit normally

* refine comment

* .

3556c786

17 4月, 2019 1 次提交

GA3C example (#63) · 3c511e8f

由 Hongsheng Zeng 提交于 4月 17, 2019

* add IMPALA algorithm and some common utils

* update README.md

* refactor files structure of impala algorithm; seperate numpy utils from utils

* add hyper parameter scheduler module; add entropy and lr scheduler in impala

* clip reward in atari wrapper instead of learner side; fix codestyle

* add benchmark result of impala; refine code of impala example; add obs_format in atari_wrappers

* Update README.md

* add a3c algorithm, A2C example and rl_utils

* require training in single gpu/cpu

* only check cpu/gpu num in learner

* refine Readme

* update impala benchmark picture; update Readme

* add benchmark result of A2C

* move get_params/set_params in agent_base

* add GA3C example

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* refine Readme

* add benchmark

* add default safe eps in numpy logp calculation

* refine document; make unittest stable

3c511e8f

15 4月, 2019 1 次提交

A2C example (#62) · 39846831

由 Hongsheng Zeng 提交于 4月 15, 2019

* add IMPALA algorithm and some common utils

* update README.md

* refactor files structure of impala algorithm; seperate numpy utils from utils

* add hyper parameter scheduler module; add entropy and lr scheduler in impala

* clip reward in atari wrapper instead of learner side; fix codestyle

* add benchmark result of impala; refine code of impala example; add obs_format in atari_wrappers

* Update README.md

* add a3c algorithm, A2C example and rl_utils

* require training in single gpu/cpu

* only check cpu/gpu num in learner

* refine Readme

* update impala benchmark picture; update Readme

* add benchmark result of A2C

* move get_params/set_params in agent_base

* fix shell script cannot run in ubuntu

* refine comment and document

* Update README.md

* Update README.md

39846831