提交 · 432d75b777a67d10e02d4d50159fdc501ef36b99 · PaddlePaddle / PARL

18 4月, 2019 1 次提交

Add a Chinese documentation (#65) · 432d75b7

由 Bo Zhou 提交于 4月 18, 2019

* Update README.md

* Create README.cn.md

* Update README.cn.md

* Update README.cn.md

* Update README.cn.md

* Update README.cn.md

* Update README.cn.md

* Update README.cn.md

* Update README.cn.md

* Update README.cn.md

* Update README.cn.md

* Update README.cn.md

* Update README.md

* Update README.md

* Update README.md

* Update README.cn.md

432d75b7

17 4月, 2019 1 次提交

GA3C example (#63) · 3c511e8f

由 Hongsheng Zeng 提交于 4月 17, 2019

* add IMPALA algorithm and some common utils

* update README.md

* refactor files structure of impala algorithm; seperate numpy utils from utils

* add hyper parameter scheduler module; add entropy and lr scheduler in impala

* clip reward in atari wrapper instead of learner side; fix codestyle

* add benchmark result of impala; refine code of impala example; add obs_format in atari_wrappers

* Update README.md

* add a3c algorithm, A2C example and rl_utils

* require training in single gpu/cpu

* only check cpu/gpu num in learner

* refine Readme

* update impala benchmark picture; update Readme

* add benchmark result of A2C

* move get_params/set_params in agent_base

* add GA3C example

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* refine Readme

* add benchmark

* add default safe eps in numpy logp calculation

* refine document; make unittest stable

3c511e8f

15 4月, 2019 1 次提交

A2C example (#62) · 39846831

由 Hongsheng Zeng 提交于 4月 15, 2019

* add IMPALA algorithm and some common utils

* update README.md

* refactor files structure of impala algorithm; seperate numpy utils from utils

* add hyper parameter scheduler module; add entropy and lr scheduler in impala

* clip reward in atari wrapper instead of learner side; fix codestyle

* add benchmark result of impala; refine code of impala example; add obs_format in atari_wrappers

* Update README.md

* add a3c algorithm, A2C example and rl_utils

* require training in single gpu/cpu

* only check cpu/gpu num in learner

* refine Readme

* update impala benchmark picture; update Readme

* add benchmark result of A2C

* move get_params/set_params in agent_base

* fix shell script cannot run in ubuntu

* refine comment and document

* Update README.md

* Update README.md

39846831

13 4月, 2019 1 次提交

add some introduction for our parallelization feature (#61) · 452050a0

由 Bo Zhou 提交于 4月 13, 2019

* Update remote_decorator.py

* Update README.md

* add an figure for the demonstration about parallelization

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* add a link to IMPALA

452050a0

08 4月, 2019 1 次提交

implement of IMPALA with the newest parallel design (#60) · b28289ac

由 Hongsheng Zeng 提交于 4月 08, 2019

* add IMPALA algorithm and some common utils

* update README.md

* refactor files structure of impala algorithm; seperate numpy utils from utils

* add hyper parameter scheduler module; add entropy and lr scheduler in impala

* clip reward in atari wrapper instead of learner side; fix codestyle

* add benchmark result of impala; refine code of impala example; add obs_format in atari_wrappers

* Update README.md

b28289ac

26 3月, 2019 1 次提交

add api set_params/get_params in Model (#56) · 7346a23d

由 Hongsheng Zeng 提交于 3月 26, 2019

* add api set_params/get_params in Model; add Interface of Network and LayerFunc to solve circular imports; refine parameter_names api of Model

* remove licence in third party code; remove interface of Network and LayerFunc; move get_parameter_pairs and get_parameter_names api to Network

* refine comment

* refine commment

7346a23d

11 3月, 2019 1 次提交

update documents (#58) · d8449b74

由 Bo Zhou 提交于 3月 11, 2019

* Update README.md

* Update train.py

* Update README.md

* Update agent_base.py

* Update train.py

* Update train.py

* Update train.py

d8449b74

07 3月, 2019 1 次提交

new feature: parl.remote (#54) · 348db1fb

由 Hongsheng Zeng 提交于 3月 07, 2019

* refine remote module, add heartbeat machanism and unittest

* yapf

* yapf

* support get ip address in CentOS, add dependence

* yapf

* add dependence in Dockerfile

* refine message_tag, Compatible with Python2 and python3

* refine unittest and comments

* remove ParlError, use to_pybytes api to compatible with Python 2 and python 3

* Not need to use to_pybytes

* use parl-test docker image for unittest, which has python2 and python3 env

* test different release order of sockets

* test for different closing way fo context and socket

* tmp commit for debug in teamcity

* tmp commit for debug in teamcity

* tmp commit for debug in teamcity

* use zmq.context destroy to close multi-thread socket, refine RemoteError

* set linger=0 for command socket in RemoteObject

* remove close context unittest

* fix codestyle

* fix codestyle

* rename parl.remote to parl.remote_class; will not exit client when having errors in function call; use sepereate server port in unittest to avoiding closing server manually

* rename parl.remote to parl.remote_class; will not exit client when having errors in function call; use sepereate server port in unittest to avoiding closing server manually

* fix typo

* remove unnecessary try/except in reply loop of client

* import RemoteManager to parl; refine comment

348db1fb

05 3月, 2019 1 次提交

run unittest in python2 and python3 (#55) · e80604f8

由 Hongsheng Zeng 提交于 3月 05, 2019

* run unittest in python2 and python3

* refine structure of repo

* refine structure of repo

* add --fix-misssing

* fix teamcity

* add --fix-misssing

* update paddle version in python2

e80604f8

01 3月, 2019 1 次提交
- B
  Update some docs. (#51) · 46188cd4
  由 Bo Zhou 提交于 3月 01, 2019
```
* Update model_base.py

* Update README.md

* Update README.md
```
  46188cd4
27 2月, 2019 1 次提交

first version of network communication (#49) · bbde58fb

由 Bo Zhou 提交于 2月 27, 2019

* first version of network communication

* fix code styple problems

* add a script to get machine's information

* code styple problems#2

* fix unit test problems

* update dockfile to fix the installation issue of cmake

* thread-saftey ensurance & copright

* resolve comments

bbde58fb

14 2月, 2019 1 次提交

fix PPO bug; add more benchmark result (#47) · 65ad2a4e

由 Hongsheng Zeng 提交于 2月 14, 2019

* fix PPO bug; add more benchmark result

* refine code

* update benchmark of PPO, after fix bug

* refine code

65ad2a4e

24 1月, 2019 1 次提交

Add more dqn benchmark result and unify train scripts (#46) · 6fdf4448

由 Hongsheng Zeng 提交于 1月 24, 2019

* add more dqn benchmark result; unify train scripts

* resize benchmark picture

* resize benchmark picture, refine comments of args

* change dependence, mujoco only support python3 now

6fdf4448

18 1月, 2019 1 次提交

Refine documents of PARL (#43) · 7a7583ab

由 Hongsheng Zeng 提交于 1月 18, 2019

* remove not used files, add benchmark for DQN and DDPG, add Parameters management Readme

* Update README.md

* Update README.md

* add parl dependence in examples, use np shuffle instead of sklean

* fix codestyle

* refine readme of nips example

* fix bug

* fix code style

* Update README.md

* Update README.md

* Update README.md

* refine document and remove outdated design doc

* Update README.md

* Update README.md

* refine comment

* release version 1.0

* gif of examples

* Update README.md

* update Readme

7a7583ab

15 1月, 2019 2 次提交

B
update readme for competition folder (#42) · 4163d732
由 Bo Zhou 提交于 1月 15, 2019
```
* Update README.md

* add experimental results
```
4163d732

NeurIPS2018-AI-for-Prosthetics-Challenge training code (#40) · cdb50056

由 Hongsheng Zeng 提交于 1月 15, 2019

* NeurIPS2018-AI-for-Prosthetics-Challenge training code

* remove model_zoo, provide download link

* remove model_zoo, provide download link

* add restore_from_one_head api, refine README, fix logger bug

* fix test bug

* fix rpm bug, refine ddpg train script

* fix rpm bug, refine Readme

cdb50056

04 1月, 2019 1 次提交

add PPO example (#39) · f8de849b

由 Hongsheng Zeng 提交于 1月 04, 2019

* add PPO example

* Update Readme

* Update Readme

* fix codestyle

* Update Readme

* refine action mapping

* add more unitest case

* remove unnecessary params initialize, add more comments, add benchmark result

* rename

* remove PARL dependence in readme of examples

f8de849b

28 12月, 2018 1 次提交

Provide synchronizable create_parameter in PARL (#38) · bd37f473

由 Hongsheng Zeng 提交于 12月 28, 2018

* Provide synchronizable create_parameter in PARL

* use AttrHold to make LayerFunc support more than two parameters

* refine code

* refine code

* fix #25

bd37f473

15 12月, 2018 1 次提交

Add DDPG example (#36) · 53c94787

由 Hongsheng Zeng 提交于 12月 15, 2018

* add DDPG example, fix some tiny bug

* add license

* unify code structure

* unify code structure

* refine gputils, fix seed in QuickStart

* use white noise in DDPG

* fix codestyle

53c94787

12 12月, 2018 1 次提交
- D
  add a episode in quick start to show the final test reward (#37) · 58e8fe28
  由 Davanoffi Liang 提交于 12月 12, 2018
```
* add a episode to show the final test reward

* make code more clear
```
  58e8fe28
07 12月, 2018 1 次提交

Add QuickStart example (#35) · cdd4622a

由 Hongsheng Zeng 提交于 12月 06, 2018

* add QuickStart example, refine DQN example

* add examples link

* refine the naming, and add quick start training result

cdd4622a

04 12月, 2018 2 次提交
- H
  DQN example (#33) · 4a4366a5
  由 Hongsheng Zeng 提交于 12月 04, 2018
```
* add DQN example, add Agent unittest

* refine readme

* refine  code

* simplify code
```
  4a4366a5
- B
  Update README.md (#34) · 5be4ca00
  由 Bo Zhou 提交于 12月 04, 2018
```
a more detailed example for DQN model.
```
  5be4ca00
30 11月, 2018 1 次提交

add testing module of NeurIPS2018-AI-for-Prosthetics-Challenge (#32) · b249dee3

由 Hongsheng Zeng 提交于 11月 30, 2018

* add testing module of NeurIPS2018-AI-for-Prosthetics-Challenge, add dependencies of setup

* add copyright

* add google drive link

* fix depedencie

* refine setup

b249dee3

29 11月, 2018 1 次提交

add introduction about abstractions and features in README and logo (#31) · ec005b50

由 Bo Zhou 提交于 11月 29, 2018

* Update README.md

* Update README.md

* add diagram/logo

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

ec005b50

27 11月, 2018 2 次提交
- B
  add setup.py for installation (#30) · c6f50c33
  由 Bo Zhou 提交于 11月 27, 2018
```
* add setup.py for installation

* rename agent.py to make it consistent with other framework base

* namespace bug
```
  c6f50c33
- H
  
  fix bug of logger (#29) · 3c46b7c4
  由 Hongsheng Zeng 提交于 11月 27, 2018
  
  3c46b7c4
26 11月, 2018 1 次提交

sync paras in program, fix deepcopy bug, python3 compatibility (#28) · e11b40c5

由 Hongsheng Zeng 提交于 11月 26, 2018

* sync paras in program, fix deepcopy bug, python3 compatibility

* refactor code, add plutil directory, clean import order

* remove old comment

* refine comment

* fix codestyle

* cache sync program, add gputils module, refine model_base unittest

* fix codestyle

* refine sync params cache

* add fetch_value module

e11b40c5

22 11月, 2018 1 次提交

add logger module (#27) · 942c3c5c

由 Hongsheng Zeng 提交于 11月 22, 2018

* add logger module

* refine comment

* add license

* refine set_level api

* refine unittest

* fix codestyle with yapf

* add termcolor dependency

942c3c5c

20 11月, 2018 1 次提交

redesign basic class in PARL (#26) · 1a1e1f03

由 Bo Zhou 提交于 11月 20, 2018

* redesign basic class in PARL

* code style fixed

* update yaml's version

* update yaml's version & update code to fix style problem

* add debug message for  function

* delete test code

* rename function: has_fun -> has_func

1a1e1f03

13 11月, 2018 1 次提交
- T
  
  support for Paddle-v1.1 (#24) · 2fc4e8c3
  由 TomorrowIsAnOtherDay 提交于 11月 13, 2018
  
  2fc4e8c3
30 9月, 2018 1 次提交
- T
  add IARL directory for camera-ready paper of CoRL2018 · b32f8940
  由 TomorrowIsAnOtherDay 提交于 9月 30, 2018
```
code will be released before 31.October (#20)
```
  b32f8940
11 9月, 2018 1 次提交

fix wrapper of dynamic_lstm cannot support h_0 and c_0 parameter (#17) · 8001db66

由 Hongsheng Zeng 提交于 9月 11, 2018

* fix wrapper of dynamic_lstm cannot support h_0 and c_0 initialization, fix bug of wrapper of dynamic_gru

* use sampling_id of fluid to sampling ids

* remove test simple games unittest, avoid timeout

* change pip source

8001db66

12 6月, 2018 1 次提交
- H
  added test_simple_games (#15) · 21a9efed
  由 Haonan 提交于 6月 11, 2018
```
added test_simple_games
```
  21a9efed
06 6月, 2018 1 次提交

preliminary implementations of the ComputationTask, Algorithm, and Model classes (#9) · 4b4b5824

由 Haonan 提交于 6月 05, 2018

* prelimary implementations of ComputationTask, Algorithm and Model classes

* remove "model_func" from the args of an algorithm

* a clean clone() function for Algorithm and Model

* add use_next_value as a input to learn()

* further re-structure

* added Feedforward and RLAlgorithm classes

* maxid -> argmax

* discrete_distribution -> category_distribution

* category -> categorical

* revisions

4b4b5824

01 6月, 2018 2 次提交
- X
  add design doc (#13) · 2ce57115
  由 Xiaochen Lian 提交于 5月 31, 2018
```
add design doc
```
  2ce57115
- L
  
  CI: Add dockerfile to install missing package. (#14) · 6fae20c4
  由 Lei Wang 提交于 5月 31, 2018
  
  6fae20c4
28 5月, 2018 1 次提交
- X
  A simple replay buffer (#5) · 406ffdbf
  由 Xiaochen Lian 提交于 5月 27, 2018
```
* simple replay buffer and its test

* add error handling

* add test for deep copy
```
  406ffdbf
22 5月, 2018 1 次提交
- L
  
  CI: enable ci task. (#10) · 7b0407b9
  由 Lei Wang 提交于 5月 21, 2018
  
  7b0407b9
18 5月, 2018 1 次提交
- H
  
  improve CMakeLists.txt (#8) · e60103f5
  由 Haonan 提交于 5月 18, 2018
  
  e60103f5