提交 · fa93980e8306c9984f840e5e9c253b8ebe29f499 · PaddlePaddle / PARL

14 7月, 2020 1 次提交
- R
  
  update readme (#346) · fa93980e
  由 rical730 提交于 7月 14, 2020
  
  fa93980e
13 7月, 2020 1 次提交
- R
  
  update readme (#344) · fb43d292
  由 rical730 提交于 7月 13, 2020
  
  fb43d292
01 7月, 2020 1 次提交

由 rical730 提交于 7月 01, 2020

* fix self.alg

* torch agent initialization

* remove definition of self.alg in PPO

* replace self.algorithm with self.alg

* remove unnecessary definition of self.alg

* fix cn readme

* unittest

* yapf

a50793e4

24 6月, 2020 1 次提交
- B
  
  support paddle 1.8.2 (#317) · 2e56337e
  由 Bo Zhou 提交于 6月 24, 2020
  
  2e56337e
23 6月, 2020 1 次提交

add toturials homework (#314) · 1bb52b4b

由 rical730 提交于 6月 23, 2020

* add tutorials

* yapf

* yapf

* copyright

* yapf

* update tutorial lesson5

* delete drawing code

* yapf

* remove action_mapping

* update dqn and add README

* update

* update

* yapf

* add toturials homework

1bb52b4b

16 6月, 2020 1 次提交
- R
  update tutorials (#298) · e8797bd0
  由 rical730 提交于 6月 16, 2020
```
* update tutorials
```
  e8797bd0
10 6月, 2020 2 次提交

add tutorials (#270) · 1cf17a57

由 rical730 提交于 6月 10, 2020

* add tutorials

* yapf

* yapf

* copyright

* yapf

* update tutorial lesson5

* delete drawing code

* yapf

* remove action_mapping

* update dqn and add README

* update

* update

* yapf

1cf17a57

R
upgrade DQN's lr interface compatibility (#291) · a9159021
由 rical730 提交于 6月 10, 2020
```
* upgrade DQN's lr interface compatibility

* yapf

* update example DQN
```
a9159021

29 5月, 2020 1 次提交

replace tensorboard with summary to support VDL (#276) · bcface6a

由 Bo Zhou 提交于 5月 29, 2020

* replace tensorboard with summary to support VDL in the future

* unittest

* rename keys for record

* yapf

bcface6a

07 5月, 2020 1 次提交
- L
  update ddpg (#260) · 87e68119
  由 LI Yunxiang 提交于 5月 07, 2020
```
* update ddpg

* Update train.py
```
  87e68119
06 5月, 2020 1 次提交
- L
  use cartpole-v0 in dqn (#259) · e5e1685a
  由 LI Yunxiang 提交于 5月 06, 2020
```
* use cartpole-v0 in dqn

* Update README.md
```
  e5e1685a
30 4月, 2020 1 次提交
- L
  state to obs (#256) · de21118e
  由 LI Yunxiang 提交于 4月 30, 2020
```
* state to obs

* yapf & update softlink in offline-q-learning
```
  de21118e
28 4月, 2020 1 次提交

add simple dqn demo (#254) · 117b1c38

由 LI Yunxiang 提交于 4月 28, 2020

* add simple dqn

* Update README.md

* Update train.py

* update

* update image in README

* update readme

* simplify

* yapf

* Update README.md

* Update README.md

* Update README.md

* Update train.py

* yapf

117b1c38

27 4月, 2020 1 次提交

remove version 1.3 warnings (#252) · 0a068653

由 LI Yunxiang 提交于 4月 27, 2020

* remove version 1.3 warnings

* update

* yapf

* add algorithms test

* Update algs_test.py

* Update algs_test.py

add SAC DDPG TD3 tests

* yapf

0a068653

30 3月, 2020 1 次提交
- H
  release v1.2.3 (#234) · d18740e5
  由 Hongsheng Zeng 提交于 3月 30, 2020
```
* release v1.2.3

* change dep of liftsim a2c
```
  d18740e5
25 3月, 2020 1 次提交
- H
  fix a2c cannot run in paddle 1.6.0 (#232) · f46ad361
  由 Hongsheng Zeng 提交于 3月 25, 2020
```
* fix a2c cannot run in paddle 1.6.0

* fix impala compatibility

* yapf
```
  f46ad361
24 3月, 2020 1 次提交
- H
  
  release 1.2.2 (#223) · e9d08fae
  由 Hongsheng Zeng 提交于 3月 24, 2020
  
  e9d08fae
23 3月, 2020 3 次提交

resolve the compatibility issue (#226) · aede5aee

由 Bo Zhou 提交于 3月 23, 2020

* fix compatibility issue with the newest paddle

* remove logging lines

* resolve the compatibility issue with the newest paddle

* yapf
Co-authored-by: Nrobot <zenghongsheng@baidu.com>

aede5aee

add SGD and Adam Optimizer for DeepES (#222) · b1cabc2d

由 rical730 提交于 3月 23, 2020

* add SGD and Adam Optimizer for DeepES

* update deepes readme

* add warning when input different size in the same param update()

* add error return in update(), add optimizer.cc

* separate SGD and Adam, optimizer type in config is not case sensitive

* delete optimizer.cc

* config optimizer in deepes.proto

* more readable

* update maddpg readme, fixed gym version

b1cabc2d

B
add tutorial of deepes, written with numpy, less than 100 lines (#225) · 7b5c5241
由 Bo Zhou 提交于 3月 23, 2020
```
* add tutorial of deepes, written with numpy, less than 100lines

* modify learning_rate as an arugment of Agent
```
7b5c5241

18 3月, 2020 1 次提交

LiftSim A2C baseline (#209) · 6b70b81d

由 Hongsheng Zeng 提交于 3月 18, 2020

* liftsim a2c baseline

* update readme

* compatible with different os

* empty

* refine comments

* remove unnecessary assertion; add tensorboard guide

* remove unnecessary assertion

* update parl dependence of A2C

6b70b81d

16 3月, 2020 1 次提交

update comments for ES (#211) · fa420300

由 Bo Zhou 提交于 3月 16, 2020

* update comments for ES

* check dependence on paddle or torch

* update readme

* update readme#2

* users can still use parl.remote when no DL framework was found

* yapf

fa420300

06 3月, 2020 1 次提交
- B
  fix paddle version bug (#207) · 450a4a34
  由 Bo Zhou 提交于 3月 06, 2020
```
* fix paddle version bug

* add gym dependence (introduced by MADDPG)

* recall
```
  450a4a34
08 2月, 2020 1 次提交

add maddpg example (#200) · 9216d941

由 rical730 提交于 2月 08, 2020

* add maddpg example

* format with yapf

* fix coding style

* fix coding style

* unittest without import multiagent env

* update maddpg code

* update maddpg readme

* add copyright comments

9216d941

14 1月, 2020 1 次提交
- L
  add offline q learning (#193) · 6a672c80
  由 LI Yunxiang 提交于 1月 14, 2020
```
* add offline q learning

* Update README.md

* update

* yapf
```
  6a672c80
30 12月, 2019 1 次提交
- L
  add sac (#188) · c070db83
  由 LI Yunxiang 提交于 12月 30, 2019
```
* add sac
```
  c070db83
21 12月, 2019 1 次提交
- H
  
  fix typo; refine readme (#190) · cb4b3852
  由 Hongsheng Zeng 提交于 12月 21, 2019
  
  cb4b3852
17 12月, 2019 1 次提交
- H
  
  fix paddle version bug; refine scripts (#186) · c5a8c2ba
  由 Hongsheng Zeng 提交于 12月 17, 2019
  
  c5a8c2ba
11 12月, 2019 1 次提交

Training pipeline of NeurIPS2019-Learn-to-Move-Challenge (#183) · 7173368e

由 Hongsheng Zeng 提交于 12月 11, 2019

* Training pipeline of NeurIPS2019-Learn-to-Move-Challenge

* fix grammar mistakes

* release 1.2.1

* copyright

* fix bug

* refine README

* refine README

* fix typo

* Update README.md

* Update README.md

7173368e

09 12月, 2019 1 次提交
- L
  Update reward calculation in QuickStart (#182) · 1475ca77
  由 LI Yunxiang 提交于 12月 09, 2019
```
* Update reward calculation in QuickStart

* update

* yapf
```
  1475ca77
04 12月, 2019 1 次提交

Update train.py (#181) · dbb5931a

由 LI Yunxiang 提交于 12月 04, 2019

* Update train.py

remove create_actors thread in train.py

* Update GA3C train.py

dbb5931a

22 11月, 2019 1 次提交
- L
  add TD3 (#175) · 6e7f862e
  由 LI Yunxiang 提交于 11月 22, 2019
```
* add TD3

* update

* yapf.....

* Update train.py
```
  6e7f862e
18 11月, 2019 1 次提交
- L
  update dqn readme (#174) · bb0bf579
  由 LI Yunxiang 提交于 11月 18, 2019
```
* update dqn readme

* update merge.png
```
  bb0bf579
16 11月, 2019 1 次提交

make job run task in a separate process (#170) · 64aebb6d

由 Hongsheng Zeng 提交于 11月 16, 2019

* make job run task in a separate process

* fix typo

* add more debug info in xparl client

* refine control flow of different processes in xparl job

* refine control flow of different processes in xparl job

* remove tsinghua source

* remove tsinghua source

* remove unnecessary logic

* fix typo

* refine comments and some logic

* fix bug, `decay=0` means totally synchronize weights of source model to target model

64aebb6d

11 11月, 2019 1 次提交
- L
  add save_params in docs and quickStart (#172) · 4c98e3fd
  由 LI Yunxiang 提交于 11月 11, 2019
```
* add save_param in docs and quickstart

* Update train.py
```
  4c98e3fd
04 11月, 2019 1 次提交
- H
  Final submitted models of NeurIPS2019 challenge (#168) · 7c406386
  由 Hongsheng Zeng 提交于 11月 04, 2019
```
* final submit models of NeurIPS2019 challenge

* update readme

* fix yapf

* refine comment
```
  7c406386
29 10月, 2019 1 次提交
- L
  
  update dqn lr_scheduler (#164) · d5a8d268
  由 LI Yunxiang 提交于 10月 29, 2019
  
  d5a8d268
24 10月, 2019 1 次提交
- L
  add Double & Dueling DQN (#163) · bb9b78b4
  由 LI Yunxiang 提交于 10月 24, 2019
```
* add Double & Dueling DQN

* yapf......................

* update

* Update train.py
```
  bb9b78b4
25 9月, 2019 1 次提交
- L
  add dygraph pg (#155) · 49b0e706
  由 LI Yunxiang 提交于 9月 25, 2019
```
* add dygraph pg

* update acc. comments

* update comments
```
  49b0e706
17 9月, 2019 1 次提交
- H
  Limit impala to single GPU training (#152) · 89c3366b
  由 Hongsheng Zeng 提交于 9月 17, 2019
```
* Limit impala to single GPU training

* refine comment of scheduler

* refine comment
```
  89c3366b