提交 · 3c511e8fa3cfecea94ef6b2b54453b5fb9c0764e · PaddlePaddle / PARL

17 4月, 2019 1 次提交

由 Hongsheng Zeng 提交于 4月 17, 2019

* add IMPALA algorithm and some common utils

* update README.md

* refactor files structure of impala algorithm; seperate numpy utils from utils

* add hyper parameter scheduler module; add entropy and lr scheduler in impala

* clip reward in atari wrapper instead of learner side; fix codestyle

* add benchmark result of impala; refine code of impala example; add obs_format in atari_wrappers

* Update README.md

* add a3c algorithm, A2C example and rl_utils

* require training in single gpu/cpu

* only check cpu/gpu num in learner

* refine Readme

* update impala benchmark picture; update Readme

* add benchmark result of A2C

* move get_params/set_params in agent_base

* add GA3C example

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* refine Readme

* add benchmark

* add default safe eps in numpy logp calculation

* refine document; make unittest stable

3c511e8f

08 4月, 2019 1 次提交

implement of IMPALA with the newest parallel design (#60) · b28289ac

由 Hongsheng Zeng 提交于 4月 08, 2019

* add IMPALA algorithm and some common utils

* update README.md

* refactor files structure of impala algorithm; seperate numpy utils from utils

* add hyper parameter scheduler module; add entropy and lr scheduler in impala

* clip reward in atari wrapper instead of learner side; fix codestyle

* add benchmark result of impala; refine code of impala example; add obs_format in atari_wrappers

* Update README.md

b28289ac

18 1月, 2019 1 次提交

Refine documents of PARL (#43) · 7a7583ab

由 Hongsheng Zeng 提交于 1月 18, 2019

* remove not used files, add benchmark for DQN and DDPG, add Parameters management Readme

* Update README.md

* Update README.md

* add parl dependence in examples, use np shuffle instead of sklean

* fix codestyle

* refine readme of nips example

* fix bug

* fix code style

* Update README.md

* Update README.md

* Update README.md

* refine document and remove outdated design doc

* Update README.md

* Update README.md

* refine comment

* release version 1.0

* gif of examples

* Update README.md

* update Readme

7a7583ab

26 11月, 2018 1 次提交

sync paras in program, fix deepcopy bug, python3 compatibility (#28) · e11b40c5

由 Hongsheng Zeng 提交于 11月 26, 2018

* sync paras in program, fix deepcopy bug, python3 compatibility

* refactor code, add plutil directory, clean import order

* remove old comment

* refine comment

* fix codestyle

* cache sync program, add gputils module, refine model_base unittest

* fix codestyle

* refine sync params cache

* add fetch_value module

e11b40c5

20 11月, 2018 1 次提交

redesign basic class in PARL (#26) · 1a1e1f03

由 Bo Zhou 提交于 11月 20, 2018

* redesign basic class in PARL

* code style fixed

* update yaml's version

* update yaml's version & update code to fix style problem

* add debug message for  function

* delete test code

* rename function: has_fun -> has_func

1a1e1f03

11 9月, 2018 1 次提交

fix wrapper of dynamic_lstm cannot support h_0 and c_0 parameter (#17) · 8001db66

由 Hongsheng Zeng 提交于 9月 11, 2018

* fix wrapper of dynamic_lstm cannot support h_0 and c_0 initialization, fix bug of wrapper of dynamic_gru

* use sampling_id of fluid to sampling ids

* remove test simple games unittest, avoid timeout

* change pip source

8001db66

12 6月, 2018 1 次提交
- H
  added test_simple_games (#15) · 21a9efed
  由 Haonan 提交于 6月 11, 2018
```
added test_simple_games
```
  21a9efed
06 6月, 2018 1 次提交

preliminary implementations of the ComputationTask, Algorithm, and Model classes (#9) · 4b4b5824

由 Haonan 提交于 6月 05, 2018

* prelimary implementations of ComputationTask, Algorithm and Model classes

* remove "model_func" from the args of an algorithm

* a clean clone() function for Algorithm and Model

* add use_next_value as a input to learn()

* further re-structure

* added Feedforward and RLAlgorithm classes

* maxid -> argmax

* discrete_distribution -> category_distribution

* category -> categorical

* revisions

4b4b5824