1. 09 3月, 2020 1 次提交
    • R
      update parl.maddpg without import gym (#208) · 7f2abd56
      rical730 提交于
      * add maddpg example
      
      * format with yapf
      
      * fix coding style
      
      * fix coding style
      
      * unittest without import multiagent env
      
      * update maddpg code
      
      * update maddpg readme
      
      * add copyright comments
      
      * update parl.maddpg without import gym
      
      * update NeurlIPS2018.gif to NeurlIPS2019.gif
      
      * update readme and comments
      7f2abd56
  2. 03 3月, 2020 1 次提交
  3. 08 2月, 2020 1 次提交
    • R
      add maddpg example (#200) · 9216d941
      rical730 提交于
      * add maddpg example
      
      * format with yapf
      
      * fix coding style
      
      * fix coding style
      
      * unittest without import multiagent env
      
      * update maddpg code
      
      * update maddpg readme
      
      * add copyright comments
      9216d941
  4. 30 12月, 2019 1 次提交
  5. 11 12月, 2019 1 次提交
  6. 27 11月, 2019 1 次提交
  7. 22 11月, 2019 1 次提交
  8. 06 11月, 2019 1 次提交
    • L
      add pytorch a2c (#167) · 4abc0534
      LI Yunxiang 提交于
      * add pytorch a2c
      
      * add set/get_weights test & copyright
      
      * yapf....
      
      * Update model_base_test_torch.py
      
      * update
      
      * Delete banma.py
      
      * Update model_base_test_torch.py
      
      * update
      
      * Update model.py
      
      * update torch tests
      
      * Update model_base_test_torch.py
      4abc0534
  9. 24 10月, 2019 1 次提交
  10. 25 9月, 2019 1 次提交
    • F
      torchdqn (#150) · 757cc391
      fuyw 提交于
      * git commit -m torchdqn
      
      * yapf
      
      * fix bugs
      
      * fix bugs
      
      * fix bugs
      
      * yapf
      
      * remove fstring format
      
      * torch_test yapf
      
      * yapf
      
      * Add torch in unittest.requirements
      
      * update torch_unittest
      
      * Torch and FLUID conflict problem in __init__.py
      
      * Unittest fail for torch when both torch and fluid exists.
      
      * cluster_test fail in the unittest, add timeout seconds.
      
      * Torch backend for PARL
      
      * add sleep time for unit test send_job_test.py
      
      * Unit test for send_job_test.py
      
      * use multiple try for unit test
      
      * Fix compatibility for python2.7.
      
      * fix send_job_test.py bugs
      
      * check file exist before send_job_test.py
      
      * Modify send_job_test.py
      757cc391
  11. 12 8月, 2019 1 次提交
  12. 24 7月, 2019 2 次提交
  13. 15 4月, 2019 1 次提交
    • H
      A2C example (#62) · 39846831
      Hongsheng Zeng 提交于
      * add IMPALA algorithm and some common utils
      
      * update README.md
      
      * refactor files structure of impala algorithm; seperate numpy utils from utils
      
      * add hyper parameter scheduler module; add entropy and lr scheduler in impala
      
      * clip reward in atari wrapper instead of learner side; fix codestyle
      
      * add benchmark result of impala; refine code of impala example; add obs_format in atari_wrappers
      
      * Update README.md
      
      * add a3c algorithm, A2C example and rl_utils
      
      * require training in single gpu/cpu
      
      * only check cpu/gpu num in learner
      
      * refine Readme
      
      * update impala benchmark picture; update Readme
      
      * add benchmark result of A2C
      
      * move get_params/set_params in agent_base
      
      * fix shell script cannot run in ubuntu
      
      * refine comment and document
      
      * Update README.md
      
      * Update README.md
      39846831
  14. 08 4月, 2019 1 次提交
    • H
      implement of IMPALA with the newest parallel design (#60) · b28289ac
      Hongsheng Zeng 提交于
      * add IMPALA algorithm and some common utils
      
      * update README.md
      
      * refactor files structure of impala algorithm; seperate numpy utils from utils
      
      * add hyper parameter scheduler module; add entropy and lr scheduler in impala
      
      * clip reward in atari wrapper instead of learner side; fix codestyle
      
      * add benchmark result of impala; refine code of impala example; add obs_format in atari_wrappers
      
      * Update README.md
      b28289ac
  15. 04 1月, 2019 1 次提交
    • H
      add PPO example (#39) · f8de849b
      Hongsheng Zeng 提交于
      * add PPO example
      
      * Update Readme
      
      * Update Readme
      
      * fix codestyle
      
      * Update Readme
      
      * refine action mapping
      
      * add more unitest case
      
      * remove unnecessary params initialize, add more comments, add benchmark result
      
      * rename
      
      * remove PARL dependence in readme of examples
      f8de849b
  16. 15 12月, 2018 1 次提交
    • H
      Add DDPG example (#36) · 53c94787
      Hongsheng Zeng 提交于
      * add DDPG example, fix some tiny bug
      
      * add license
      
      * unify code structure
      
      * unify code structure
      
      * refine gputils, fix seed in QuickStart
      
      * use white noise in DDPG
      
      * fix codestyle
      53c94787
  17. 07 12月, 2018 1 次提交
    • H
      Add QuickStart example (#35) · cdd4622a
      Hongsheng Zeng 提交于
      * add QuickStart example, refine DQN example
      
      * add examples link
      
      * refine the naming, and add quick start training result
      cdd4622a
  18. 04 12月, 2018 1 次提交
    • H
      DQN example (#33) · 4a4366a5
      Hongsheng Zeng 提交于
      * add DQN example, add Agent unittest
      
      * refine readme
      
      * refine  code
      
      * simplify code
      4a4366a5