examples/IMPALA/learner.py · b28289ac6d14e4609014dc2127d1ccb7b57fb7ac · PaddlePaddle / PARL

implement of IMPALA with the newest parallel design (#60) · b28289ac

由 Hongsheng Zeng 提交于 4月 08, 2019

* add IMPALA algorithm and some common utils

* update README.md

* refactor files structure of impala algorithm; seperate numpy utils from utils

* add hyper parameter scheduler module; add entropy and lr scheduler in impala

* clip reward in atari wrapper instead of learner side; fix codestyle

* add benchmark result of impala; refine code of impala example; add obs_format in atari_wrappers

* Update README.md

b28289ac

learner.py 9.4 KB

PaddlePaddle / PARL

Replace learner.py