提交 · 40a0ab3f484575ed2fc8a2ffa0329828a78d8539 · PaddlePaddle / PARL

18 3月, 2020 1 次提交

由 Hongsheng Zeng 提交于 3月 18, 2020

* liftsim a2c baseline

* update readme

* compatible with different os

* empty

* refine comments

* remove unnecessary assertion; add tensorboard guide

* remove unnecessary assertion

* update parl dependence of A2C

6b70b81d

02 8月, 2019 1 次提交

first pr (#113) · b29a1ec1

由 fuyw 提交于 8月 02, 2019

* first pr

* start a worker when the master is started.

* First PR & Fix logger bugs.

* update docs for a2c, impala and ga3c

* update doc

* yapf modification

* update logger

* yapf correct

* yapf

* setup.py

* old setup.py

* worker 86

b29a1ec1

18 6月, 2019 1 次提交

refine A2C example (#80) · 255ef4f7

由 Hongsheng Zeng 提交于 6月 18, 2019

* refine A2C example

* fix unittest in python2; fix codestyle

* fix codestyle

* refine comment

255ef4f7

15 4月, 2019 1 次提交

A2C example (#62) · 39846831

由 Hongsheng Zeng 提交于 4月 15, 2019

* add IMPALA algorithm and some common utils

* update README.md

* refactor files structure of impala algorithm; seperate numpy utils from utils

* add hyper parameter scheduler module; add entropy and lr scheduler in impala

* clip reward in atari wrapper instead of learner side; fix codestyle

* add benchmark result of impala; refine code of impala example; add obs_format in atari_wrappers

* Update README.md

* add a3c algorithm, A2C example and rl_utils

* require training in single gpu/cpu

* only check cpu/gpu num in learner

* refine Readme

* update impala benchmark picture; update Readme

* add benchmark result of A2C

* move get_params/set_params in agent_base

* fix shell script cannot run in ubuntu

* refine comment and document

* Update README.md

* Update README.md

39846831

08 4月, 2019 1 次提交

implement of IMPALA with the newest parallel design (#60) · b28289ac

由 Hongsheng Zeng 提交于 4月 08, 2019

* add IMPALA algorithm and some common utils

* update README.md

* refactor files structure of impala algorithm; seperate numpy utils from utils

* add hyper parameter scheduler module; add entropy and lr scheduler in impala

* clip reward in atari wrapper instead of learner side; fix codestyle

* add benchmark result of impala; refine code of impala example; add obs_format in atari_wrappers

* Update README.md

b28289ac