提交 · 3984683148a6bcac0501477478de622e043274d1 · PaddlePaddle / PARL

15 4月, 2019 1 次提交

由 Hongsheng Zeng 提交于 4月 15, 2019

* add IMPALA algorithm and some common utils

* update README.md

* refactor files structure of impala algorithm; seperate numpy utils from utils

* add hyper parameter scheduler module; add entropy and lr scheduler in impala

* clip reward in atari wrapper instead of learner side; fix codestyle

* add benchmark result of impala; refine code of impala example; add obs_format in atari_wrappers

* Update README.md

* add a3c algorithm, A2C example and rl_utils

* require training in single gpu/cpu

* only check cpu/gpu num in learner

* refine Readme

* update impala benchmark picture; update Readme

* add benchmark result of A2C

* move get_params/set_params in agent_base

* fix shell script cannot run in ubuntu

* refine comment and document

* Update README.md

* Update README.md

39846831

11 3月, 2019 1 次提交

update documents (#58) · d8449b74

由 Bo Zhou 提交于 3月 11, 2019

* Update README.md

* Update train.py

* Update README.md

* Update agent_base.py

* Update train.py

* Update train.py

* Update train.py

d8449b74

15 1月, 2019 1 次提交

NeurIPS2018-AI-for-Prosthetics-Challenge training code (#40) · cdb50056

由 Hongsheng Zeng 提交于 1月 15, 2019

* NeurIPS2018-AI-for-Prosthetics-Challenge training code

* remove model_zoo, provide download link

* remove model_zoo, provide download link

* add restore_from_one_head api, refine README, fix logger bug

* fix test bug

* fix rpm bug, refine ddpg train script

* fix rpm bug, refine Readme

cdb50056

04 12月, 2018 1 次提交

DQN example (#33) · 4a4366a5

由 Hongsheng Zeng 提交于 12月 04, 2018

* add DQN example, add Agent unittest

* refine readme

* refine  code

* simplify code

4a4366a5

27 11月, 2018 1 次提交

add setup.py for installation (#30) · c6f50c33

由 Bo Zhou 提交于 11月 27, 2018

* add setup.py for installation

* rename agent.py to make it consistent with other framework base

* namespace bug

c6f50c33

20 11月, 2018 1 次提交

redesign basic class in PARL (#26) · 1a1e1f03

由 Bo Zhou 提交于 11月 20, 2018

* redesign basic class in PARL

* code style fixed

* update yaml's version

* update yaml's version & update code to fix style problem

* add debug message for  function

* delete test code

* rename function: has_fun -> has_func

1a1e1f03

12 6月, 2018 1 次提交
- H
  added test_simple_games (#15) · 21a9efed
  由 Haonan 提交于 6月 11, 2018
```
added test_simple_games
```
  21a9efed
06 6月, 2018 1 次提交

preliminary implementations of the ComputationTask, Algorithm, and Model classes (#9) · 4b4b5824

由 Haonan 提交于 6月 05, 2018

* prelimary implementations of ComputationTask, Algorithm and Model classes

* remove "model_func" from the args of an algorithm

* a clean clone() function for Algorithm and Model

* add use_next_value as a input to learn()

* further re-structure

* added Feedforward and RLAlgorithm classes

* maxid -> argmax

* discrete_distribution -> category_distribution

* category -> categorical

* revisions

4b4b5824