提交 · aede5aee1854164a1c5caa5ef7f32c582338424f · PaddlePaddle / PARL

23 3月, 2020 3 次提交

resolve the compatibility issue (#226) · aede5aee

由 Bo Zhou 提交于 3月 23, 2020

* fix compatibility issue with the newest paddle

* remove logging lines

* resolve the compatibility issue with the newest paddle

* yapf
Co-authored-by: Nrobot <zenghongsheng@baidu.com>

aede5aee

add SGD and Adam Optimizer for DeepES (#222) · b1cabc2d

由 rical730 提交于 3月 23, 2020

* add SGD and Adam Optimizer for DeepES

* update deepes readme

* add warning when input different size in the same param update()

* add error return in update(), add optimizer.cc

* separate SGD and Adam, optimizer type in config is not case sensitive

* delete optimizer.cc

* config optimizer in deepes.proto

* more readable

* update maddpg readme, fixed gym version

b1cabc2d

B
add tutorial of deepes, written with numpy, less than 100 lines (#225) · 7b5c5241
由 Bo Zhou 提交于 3月 23, 2020
```
* add tutorial of deepes, written with numpy, less than 100lines

* modify learning_rate as an arugment of Agent
```
7b5c5241

22 3月, 2020 1 次提交

fix compatibility issue with the newest paddle (#218) · 7a16adc0

由 Bo Zhou 提交于 3月 22, 2020

* fix compatibility issue with the newest paddle

* remove logging lines
Co-authored-by: Nrobot <zenghongsheng@baidu.com>

7a16adc0

20 3月, 2020 1 次提交
- H
  mac compatibility (#219) · d96dba18
  由 Hongsheng Zeng 提交于 3月 20, 2020
```
* mac compatibility

* refine import trick
```
  d96dba18
18 3月, 2020 2 次提交

LiftSim A2C baseline (#209) · 6b70b81d

由 Hongsheng Zeng 提交于 3月 18, 2020

* liftsim a2c baseline

* update readme

* compatible with different os

* empty

* refine comments

* remove unnecessary assertion; add tensorboard guide

* remove unnecessary assertion

* update parl dependence of A2C

6b70b81d

deepES framework & a demo that is compatible with torch (#214) · c848bda2

由 Bo Zhou 提交于 3月 18, 2020

* add deepES & a demo that is compatible with torch

* add copyright & update protoc file path

* add copyright

* rm useless files

* update dependency on libtorch

* add the demonstration gif

* update gif

* Create README.md

* Update README.md

* Update README.md

* Update README.md

* update scripts

* update scripts#2

* update torch_predictor

c848bda2

16 3月, 2020 1 次提交

update comments for ES (#211) · fa420300

由 Bo Zhou 提交于 3月 16, 2020

* update comments for ES

* check dependence on paddle or torch

* update readme

* update readme#2

* users can still use parl.remote when no DL framework was found

* yapf

fa420300

09 3月, 2020 1 次提交

update parl.maddpg without import gym (#208) · 7f2abd56

由 rical730 提交于 3月 09, 2020

* add maddpg example

* format with yapf

* fix coding style

* fix coding style

* unittest without import multiagent env

* update maddpg code

* update maddpg readme

* add copyright comments

* update parl.maddpg without import gym

* update NeurlIPS2018.gif to NeurlIPS2019.gif

* update readme and comments

7f2abd56

06 3月, 2020 1 次提交
- B
  fix paddle version bug (#207) · 450a4a34
  由 Bo Zhou 提交于 3月 06, 2020
```
* fix paddle version bug

* add gym dependence (introduced by MADDPG)

* recall
```
  450a4a34
03 3月, 2020 1 次提交
- H
  torch benchmark policy gradient (#203) · bbcb707b
  由 Hongsheng Zeng 提交于 3月 03, 2020
```
* torch benchmark policy gradient

* refine comments and use native api
```
  bbcb707b
08 2月, 2020 1 次提交

add maddpg example (#200) · 9216d941

由 rical730 提交于 2月 08, 2020

* add maddpg example

* format with yapf

* fix coding style

* fix coding style

* unittest without import multiagent env

* update maddpg code

* update maddpg readme

* add copyright comments

9216d941

15 1月, 2020 1 次提交
- B
  
  Create ICLR_2020.md (#199) · f35200fe
  由 Bo Zhou 提交于 1月 15, 2020
  
  f35200fe
14 1月, 2020 1 次提交
- L
  add offline q learning (#193) · 6a672c80
  由 LI Yunxiang 提交于 1月 14, 2020
```
* add offline q learning

* Update README.md

* update

* yapf
```
  6a672c80
30 12月, 2019 1 次提交
- L
  add sac (#188) · c070db83
  由 LI Yunxiang 提交于 12月 30, 2019
```
* add sac
```
  c070db83
23 12月, 2019 1 次提交
- L
  
  fix compiled_program restore (#192) · 5054efed
  由 LI Yunxiang 提交于 12月 23, 2019
  
  5054efed
21 12月, 2019 1 次提交
- H
  
  fix typo; refine readme (#190) · cb4b3852
  由 Hongsheng Zeng 提交于 12月 21, 2019
  
  cb4b3852
17 12月, 2019 1 次提交
- H
  
  fix paddle version bug; refine scripts (#186) · c5a8c2ba
  由 Hongsheng Zeng 提交于 12月 17, 2019
  
  c5a8c2ba
11 12月, 2019 2 次提交

Training pipeline of NeurIPS2019-Learn-to-Move-Challenge (#183) · 7173368e

由 Hongsheng Zeng 提交于 12月 11, 2019

* Training pipeline of NeurIPS2019-Learn-to-Move-Challenge

* fix grammar mistakes

* release 1.2.1

* copyright

* fix bug

* refine README

* refine README

* fix typo

* Update README.md

* Update README.md

7173368e

B

Add files via upload (#184) · 1768fbc3
由 Bo Zhou 提交于 12月 11, 2019

1768fbc3

09 12月, 2019 1 次提交
- L
  Update reward calculation in QuickStart (#182) · 1475ca77
  由 LI Yunxiang 提交于 12月 09, 2019
```
* Update reward calculation in QuickStart

* update

* yapf
```
  1475ca77
04 12月, 2019 1 次提交

Update train.py (#181) · dbb5931a

由 LI Yunxiang 提交于 12月 04, 2019

* Update train.py

remove create_actors thread in train.py

* Update GA3C train.py

dbb5931a

27 11月, 2019 1 次提交
- L
  
  add torch td3 (#176) · 7f456dc7
  由 LI Yunxiang 提交于 11月 27, 2019
  
  7f456dc7
22 11月, 2019 1 次提交
- L
  add TD3 (#175) · 6e7f862e
  由 LI Yunxiang 提交于 11月 22, 2019
```
* add TD3

* update

* yapf.....

* Update train.py
```
  6e7f862e
18 11月, 2019 1 次提交
- L
  update dqn readme (#174) · bb0bf579
  由 LI Yunxiang 提交于 11月 18, 2019
```
* update dqn readme

* update merge.png
```
  bb0bf579
16 11月, 2019 1 次提交

make job run task in a separate process (#170) · 64aebb6d

由 Hongsheng Zeng 提交于 11月 16, 2019

* make job run task in a separate process

* fix typo

* add more debug info in xparl client

* refine control flow of different processes in xparl job

* refine control flow of different processes in xparl job

* remove tsinghua source

* remove tsinghua source

* remove unnecessary logic

* fix typo

* refine comments and some logic

* fix bug, `decay=0` means totally synchronize weights of source model to target model

64aebb6d

15 11月, 2019 1 次提交
- L
  
  Update common.py (#173) · ee36f15b
  由 LI Yunxiang 提交于 11月 15, 2019
  
  ee36f15b
11 11月, 2019 1 次提交
- L
  add save_params in docs and quickStart (#172) · 4c98e3fd
  由 LI Yunxiang 提交于 11月 11, 2019
```
* add save_param in docs and quickstart

* Update train.py
```
  4c98e3fd
06 11月, 2019 1 次提交

add pytorch a2c (#167) · 4abc0534

由 LI Yunxiang 提交于 11月 06, 2019

* add pytorch a2c

* add set/get_weights test & copyright

* yapf....

* Update model_base_test_torch.py

* update

* Delete banma.py

* Update model_base_test_torch.py

* update

* Update model.py

* update torch tests

* Update model_base_test_torch.py

4abc0534

04 11月, 2019 1 次提交
- H
  Final submitted models of NeurIPS2019 challenge (#168) · 7c406386
  由 Hongsheng Zeng 提交于 11月 04, 2019
```
* final submit models of NeurIPS2019 challenge

* update readme

* fix yapf

* refine comment
```
  7c406386
30 10月, 2019 2 次提交
- B
  
  support not starting any jobs at the command line (#166) · 51fd6169
  由 Bo Zhou 提交于 10月 30, 2019
  
  51fd6169
- B
  
  Update archive.md (#165) · 554f19ad
  由 Bo Zhou 提交于 10月 30, 2019
  
  554f19ad
29 10月, 2019 1 次提交
- L
  
  update dqn lr_scheduler (#164) · d5a8d268
  由 LI Yunxiang 提交于 10月 29, 2019
  
  d5a8d268
24 10月, 2019 1 次提交
- L
  add Double & Dueling DQN (#163) · bb9b78b4
  由 LI Yunxiang 提交于 10月 24, 2019
```
* add Double & Dueling DQN

* yapf......................

* update

* Update train.py
```
  bb9b78b4
22 10月, 2019 1 次提交
- H
  
  release 1.2 (#162) · 4d763f36
  由 Hongsheng Zeng 提交于 10月 22, 2019
  
  4d763f36
16 10月, 2019 1 次提交
- H
  
  fix bug in _get_parameter_names function (#159) · 6596320f
  由 Hongsheng Zeng 提交于 10月 16, 2019
  
  6596320f
08 10月, 2019 1 次提交
- L
  torch test env (#156) · 2ddf4c11
  由 LI Yunxiang 提交于 10月 08, 2019
```
* torch test env

* Update build.sh

* update torch unit test
```
  2ddf4c11
25 9月, 2019 2 次提交

torchdqn (#150) · 757cc391

由 fuyw 提交于 9月 25, 2019

* git commit -m torchdqn

* yapf

* fix bugs

* fix bugs

* fix bugs

* yapf

* remove fstring format

* torch_test yapf

* yapf

* Add torch in unittest.requirements

* update torch_unittest

* Torch and FLUID conflict problem in __init__.py

* Unittest fail for torch when both torch and fluid exists.

* cluster_test fail in the unittest, add timeout seconds.

* Torch backend for PARL

* add sleep time for unit test send_job_test.py

* Unit test for send_job_test.py

* use multiple try for unit test

* Fix compatibility for python2.7.

* fix send_job_test.py bugs

* check file exist before send_job_test.py

* Modify send_job_test.py

757cc391

L
add dygraph pg (#155) · 49b0e706
由 LI Yunxiang 提交于 9月 25, 2019
```
* add dygraph pg

* update acc. comments

* update comments
```
49b0e706

17 9月, 2019 1 次提交
- H
  Limit impala to single GPU training (#152) · 89c3366b
  由 Hongsheng Zeng 提交于 9月 17, 2019
```
* Limit impala to single GPU training

* refine comment of scheduler

* refine comment
```
  89c3366b