提交 · ce152a62316eda2a0557d1aa1618938936c1fb3f · PaddlePaddle / PARL

22 9月, 2020 1 次提交

update lr interface and support training on single gpu (#415) · ce152a62

由 rical730 提交于 9月 22, 2020

* update lr interface and support  training on single gpu

* yapf

* update warning message

* update warning message

ce152a62

21 8月, 2020 1 次提交

add torch coma (#216) · 1cbcfb15

由 rical730 提交于 8月 21, 2020

* add torch coma

* add Apache License comment

* update readme

* update readme for installing sc2 on windows

* update readme

* add new line at the end of shell file

* update readme

* update readme of coma

* fix model_path

* self.algorithm to self.alg
Co-authored-by: NBo Zhou <2466956298@qq.com>

1cbcfb15

27 7月, 2020 1 次提交
- Z
  Add Prioritized DQN (#326) · 3a27f407
  由 Zheyue Tan 提交于 7月 27, 2020
```
- add prioritized dqn
- fix#239
```
  3a27f407
20 7月, 2020 1 次提交
- Z
  
  fix bug of ddqn (#353) · 942e570c
  由 zenghsh3 提交于 7月 20, 2020
  
  942e570c
01 7月, 2020 1 次提交

fix self.alg (#325) · a50793e4

由 rical730 提交于 7月 01, 2020

* fix self.alg

* torch agent initialization

* remove definition of self.alg in PPO

* replace self.algorithm with self.alg

* remove unnecessary definition of self.alg

* fix cn readme

* unittest

* yapf

a50793e4

24 6月, 2020 1 次提交
- B
  
  support paddle 1.8.2 (#317) · 2e56337e
  由 Bo Zhou 提交于 6月 24, 2020
  
  2e56337e
11 6月, 2020 1 次提交
- L
  add torch ppo (#213) · 2deefa8f
  由 LI Yunxiang 提交于 6月 11, 2020
```
* add ppo

* fix bugs

* yapf
```
  2deefa8f
10 6月, 2020 1 次提交
- R
  upgrade DQN's lr interface compatibility (#291) · a9159021
  由 rical730 提交于 6月 10, 2020
```
* upgrade DQN's lr interface compatibility

* yapf

* update example DQN
```
  a9159021
02 6月, 2020 1 次提交

ping the master before connection (#278) · d683f331

由 Bo Zhou 提交于 6月 02, 2020

* ping the master before connection

* yapf

* fix comments

* remove the useless library

* install ping for the docker environment

* remove protobuf intallation

* remove evokit test

d683f331

29 5月, 2020 1 次提交

replace tensorboard with summary to support VDL (#276) · bcface6a

由 Bo Zhou 提交于 5月 29, 2020

* replace tensorboard with summary to support VDL in the future

* unittest

* rename keys for record

* yapf

bcface6a

30 4月, 2020 1 次提交
- L
  state to obs (#256) · de21118e
  由 LI Yunxiang 提交于 4月 30, 2020
```
* state to obs

* yapf & update softlink in offline-q-learning
```
  de21118e
27 4月, 2020 1 次提交

remove version 1.3 warnings (#252) · 0a068653

由 LI Yunxiang 提交于 4月 27, 2020

* remove version 1.3 warnings

* update

* yapf

* add algorithms test

* Update algs_test.py

* Update algs_test.py

add SAC DDPG TD3 tests

* yapf

0a068653

25 3月, 2020 1 次提交
- H
  fix a2c cannot run in paddle 1.6.0 (#232) · f46ad361
  由 Hongsheng Zeng 提交于 3月 25, 2020
```
* fix a2c cannot run in paddle 1.6.0

* fix impala compatibility

* yapf
```
  f46ad361
23 3月, 2020 1 次提交

resolve the compatibility issue (#226) · aede5aee

由 Bo Zhou 提交于 3月 23, 2020

* fix compatibility issue with the newest paddle

* remove logging lines

* resolve the compatibility issue with the newest paddle

* yapf
Co-authored-by: Nrobot <zenghongsheng@baidu.com>

aede5aee

22 3月, 2020 1 次提交

fix compatibility issue with the newest paddle (#218) · 7a16adc0

由 Bo Zhou 提交于 3月 22, 2020

* fix compatibility issue with the newest paddle

* remove logging lines
Co-authored-by: Nrobot <zenghongsheng@baidu.com>

7a16adc0

16 3月, 2020 1 次提交

update comments for ES (#211) · fa420300

由 Bo Zhou 提交于 3月 16, 2020

* update comments for ES

* check dependence on paddle or torch

* update readme

* update readme#2

* users can still use parl.remote when no DL framework was found

* yapf

fa420300

09 3月, 2020 1 次提交

update parl.maddpg without import gym (#208) · 7f2abd56

由 rical730 提交于 3月 09, 2020

* add maddpg example

* format with yapf

* fix coding style

* fix coding style

* unittest without import multiagent env

* update maddpg code

* update maddpg readme

* add copyright comments

* update parl.maddpg without import gym

* update NeurlIPS2018.gif to NeurlIPS2019.gif

* update readme and comments

7f2abd56

03 3月, 2020 1 次提交
- H
  torch benchmark policy gradient (#203) · bbcb707b
  由 Hongsheng Zeng 提交于 3月 03, 2020
```
* torch benchmark policy gradient

* refine comments and use native api
```
  bbcb707b
08 2月, 2020 1 次提交

add maddpg example (#200) · 9216d941

由 rical730 提交于 2月 08, 2020

* add maddpg example

* format with yapf

* fix coding style

* fix coding style

* unittest without import multiagent env

* update maddpg code

* update maddpg readme

* add copyright comments

9216d941

30 12月, 2019 1 次提交
- L
  add sac (#188) · c070db83
  由 LI Yunxiang 提交于 12月 30, 2019
```
* add sac
```
  c070db83
11 12月, 2019 1 次提交

Training pipeline of NeurIPS2019-Learn-to-Move-Challenge (#183) · 7173368e

由 Hongsheng Zeng 提交于 12月 11, 2019

* Training pipeline of NeurIPS2019-Learn-to-Move-Challenge

* fix grammar mistakes

* release 1.2.1

* copyright

* fix bug

* refine README

* refine README

* fix typo

* Update README.md

* Update README.md

7173368e

27 11月, 2019 1 次提交
- L
  
  add torch td3 (#176) · 7f456dc7
  由 LI Yunxiang 提交于 11月 27, 2019
  
  7f456dc7
22 11月, 2019 1 次提交
- L
  add TD3 (#175) · 6e7f862e
  由 LI Yunxiang 提交于 11月 22, 2019
```
* add TD3

* update

* yapf.....

* Update train.py
```
  6e7f862e
06 11月, 2019 1 次提交

add pytorch a2c (#167) · 4abc0534

由 LI Yunxiang 提交于 11月 06, 2019

* add pytorch a2c

* add set/get_weights test & copyright

* yapf....

* Update model_base_test_torch.py

* update

* Delete banma.py

* Update model_base_test_torch.py

* update

* Update model.py

* update torch tests

* Update model_base_test_torch.py

4abc0534

24 10月, 2019 1 次提交
- L
  add Double & Dueling DQN (#163) · bb9b78b4
  由 LI Yunxiang 提交于 10月 24, 2019
```
* add Double & Dueling DQN

* yapf......................

* update

* Update train.py
```
  bb9b78b4
25 9月, 2019 1 次提交

torchdqn (#150) · 757cc391

由 fuyw 提交于 9月 25, 2019

* git commit -m torchdqn

* yapf

* fix bugs

* fix bugs

* fix bugs

* yapf

* remove fstring format

* torch_test yapf

* yapf

* Add torch in unittest.requirements

* update torch_unittest

* Torch and FLUID conflict problem in __init__.py

* Unittest fail for torch when both torch and fluid exists.

* cluster_test fail in the unittest, add timeout seconds.

* Torch backend for PARL

* add sleep time for unit test send_job_test.py

* Unit test for send_job_test.py

* use multiple try for unit test

* Fix compatibility for python2.7.

* fix send_job_test.py bugs

* check file exist before send_job_test.py

* Modify send_job_test.py

757cc391

12 8月, 2019 1 次提交
- H
  fix bug of ParamAttr (#126) · 2ad3c4c0
  由 Hongsheng Zeng 提交于 8月 12, 2019
```
* fix bug of ParamAttr

* refine imports of unittest
```
  2ad3c4c0
24 7月, 2019 2 次提交

B
remove previous algorithms; use the forward function in parl.Model (#96) · 0e174277
由 Bo Zhou 提交于 7月 24, 2019
```
* remove previous algorithms; use the forward function in parl.Model

* remove abundant lines

* yapf
```
0e174277

breaking changes#1 (#95) · 6efa7871

由 Bo Zhou 提交于 7月 24, 2019

* intra-version: move parl.framework into parl.core.fluid

* add folder: parl.core

* remove former test folders

* yapf

* yapf0.24

6efa7871

15 4月, 2019 1 次提交

A2C example (#62) · 39846831

由 Hongsheng Zeng 提交于 4月 15, 2019

* add IMPALA algorithm and some common utils

* update README.md

* refactor files structure of impala algorithm; seperate numpy utils from utils

* add hyper parameter scheduler module; add entropy and lr scheduler in impala

* clip reward in atari wrapper instead of learner side; fix codestyle

* add benchmark result of impala; refine code of impala example; add obs_format in atari_wrappers

* Update README.md

* add a3c algorithm, A2C example and rl_utils

* require training in single gpu/cpu

* only check cpu/gpu num in learner

* refine Readme

* update impala benchmark picture; update Readme

* add benchmark result of A2C

* move get_params/set_params in agent_base

* fix shell script cannot run in ubuntu

* refine comment and document

* Update README.md

* Update README.md

39846831

08 4月, 2019 1 次提交

implement of IMPALA with the newest parallel design (#60) · b28289ac

由 Hongsheng Zeng 提交于 4月 08, 2019

* add IMPALA algorithm and some common utils

* update README.md

* refactor files structure of impala algorithm; seperate numpy utils from utils

* add hyper parameter scheduler module; add entropy and lr scheduler in impala

* clip reward in atari wrapper instead of learner side; fix codestyle

* add benchmark result of impala; refine code of impala example; add obs_format in atari_wrappers

* Update README.md

b28289ac

04 1月, 2019 1 次提交

add PPO example (#39) · f8de849b

由 Hongsheng Zeng 提交于 1月 04, 2019

* add PPO example

* Update Readme

* Update Readme

* fix codestyle

* Update Readme

* refine action mapping

* add more unitest case

* remove unnecessary params initialize, add more comments, add benchmark result

* rename

* remove PARL dependence in readme of examples

f8de849b

15 12月, 2018 1 次提交

Add DDPG example (#36) · 53c94787

由 Hongsheng Zeng 提交于 12月 15, 2018

* add DDPG example, fix some tiny bug

* add license

* unify code structure

* unify code structure

* refine gputils, fix seed in QuickStart

* use white noise in DDPG

* fix codestyle

53c94787

07 12月, 2018 1 次提交

Add QuickStart example (#35) · cdd4622a

由 Hongsheng Zeng 提交于 12月 06, 2018

* add QuickStart example, refine DQN example

* add examples link

* refine the naming, and add quick start training result

cdd4622a

04 12月, 2018 1 次提交

DQN example (#33) · 4a4366a5

由 Hongsheng Zeng 提交于 12月 04, 2018

* add DQN example, add Agent unittest

* refine readme

* refine  code

* simplify code

4a4366a5