1. 15 4月, 2019 1 次提交
    • H
      A2C example (#62) · 39846831
      Hongsheng Zeng 提交于
      * add IMPALA algorithm and some common utils
      
      * update README.md
      
      * refactor files structure of impala algorithm; seperate numpy utils from utils
      
      * add hyper parameter scheduler module; add entropy and lr scheduler in impala
      
      * clip reward in atari wrapper instead of learner side; fix codestyle
      
      * add benchmark result of impala; refine code of impala example; add obs_format in atari_wrappers
      
      * Update README.md
      
      * add a3c algorithm, A2C example and rl_utils
      
      * require training in single gpu/cpu
      
      * only check cpu/gpu num in learner
      
      * refine Readme
      
      * update impala benchmark picture; update Readme
      
      * add benchmark result of A2C
      
      * move get_params/set_params in agent_base
      
      * fix shell script cannot run in ubuntu
      
      * refine comment and document
      
      * Update README.md
      
      * Update README.md
      39846831
  2. 11 3月, 2019 1 次提交
    • B
      update documents (#58) · d8449b74
      Bo Zhou 提交于
      * Update README.md
      
      * Update train.py
      
      * Update README.md
      
      * Update agent_base.py
      
      * Update train.py
      
      * Update train.py
      
      * Update train.py
      d8449b74
  3. 15 1月, 2019 1 次提交
    • H
      NeurIPS2018-AI-for-Prosthetics-Challenge training code (#40) · cdb50056
      Hongsheng Zeng 提交于
      * NeurIPS2018-AI-for-Prosthetics-Challenge training code
      
      * remove model_zoo, provide download link
      
      * remove model_zoo, provide download link
      
      * add restore_from_one_head api, refine README, fix logger bug
      
      * fix test bug
      
      * fix rpm bug, refine ddpg train script
      
      * fix rpm bug, refine Readme
      cdb50056
  4. 04 12月, 2018 1 次提交
    • H
      DQN example (#33) · 4a4366a5
      Hongsheng Zeng 提交于
      * add DQN example, add Agent unittest
      
      * refine readme
      
      * refine  code
      
      * simplify code
      4a4366a5
  5. 27 11月, 2018 1 次提交
  6. 20 11月, 2018 1 次提交
    • B
      redesign basic class in PARL (#26) · 1a1e1f03
      Bo Zhou 提交于
      * redesign basic class in PARL
      
      * code style fixed
      
      * update yaml's version
      
      * update yaml's version & update code to fix style problem
      
      * add debug message for  function
      
      * delete test code
      
      * rename function: has_fun -> has_func
      1a1e1f03
  7. 12 6月, 2018 1 次提交
  8. 06 6月, 2018 1 次提交
    • H
      preliminary implementations of the ComputationTask, Algorithm, and Model classes (#9) · 4b4b5824
      Haonan 提交于
      * prelimary implementations of ComputationTask, Algorithm and Model classes
      
      * remove "model_func" from the args of an algorithm
      
      * a clean clone() function for Algorithm and Model
      
      * add use_next_value as a input to learn()
      
      * further re-structure
      
      * added Feedforward and RLAlgorithm classes
      
      * maxid -> argmax
      
      * discrete_distribution -> category_distribution
      
      * category -> categorical
      
      * revisions
      4b4b5824