• H
    implement of IMPALA with the newest parallel design (#60) · b28289ac
    Hongsheng Zeng 提交于
    * add IMPALA algorithm and some common utils
    
    * update README.md
    
    * refactor files structure of impala algorithm; seperate numpy utils from utils
    
    * add hyper parameter scheduler module; add entropy and lr scheduler in impala
    
    * clip reward in atari wrapper instead of learner side; fix codestyle
    
    * add benchmark result of impala; refine code of impala example; add obs_format in atari_wrappers
    
    * Update README.md
    b28289ac
policy_distribution.py 3.7 KB