1. 21 12月, 2019 1 次提交
  2. 17 12月, 2019 1 次提交
  3. 11 12月, 2019 1 次提交
  4. 09 12月, 2019 1 次提交
  5. 04 12月, 2019 1 次提交
  6. 22 11月, 2019 1 次提交
  7. 18 11月, 2019 1 次提交
  8. 16 11月, 2019 1 次提交
    • H
      make job run task in a separate process (#170) · 64aebb6d
      Hongsheng Zeng 提交于
      * make job run task in a separate process
      
      * fix typo
      
      * add more debug info in xparl client
      
      * refine control flow of different processes in xparl job
      
      * refine control flow of different processes in xparl job
      
      * remove tsinghua source
      
      * remove tsinghua source
      
      * remove unnecessary logic
      
      * fix typo
      
      * refine comments and some logic
      
      * fix bug, `decay=0` means totally synchronize weights of source model to target model
      64aebb6d
  9. 11 11月, 2019 1 次提交
  10. 04 11月, 2019 1 次提交
  11. 29 10月, 2019 1 次提交
  12. 24 10月, 2019 1 次提交
  13. 25 9月, 2019 1 次提交
  14. 17 9月, 2019 1 次提交
  15. 26 8月, 2019 1 次提交
    • B
      fix minor problems in the docs (#138) · b6122aa2
      Bo Zhou 提交于
      * fix minor probmels in the docs
      
      * typo
      
      * remove pip source
      
      * fix monitor
      
      * add performance of A2C
      
      * Update README.md
      
      * modify logger for GPU detection
      b6122aa2
  16. 13 8月, 2019 1 次提交
    • B
      Zhoubo01 es (#127) · 5612ecde
      Bo Zhou 提交于
      * add learning curve for ES
      
      * add learning curve for ES
      
      * support new APIs of the cluster
      
      * Update README.md
      
      * Update README.md
      
      * Update README.md
      
      * Update README.md
      
      * Update README.md
      
      * rename learner.py
      
      * Update README.md
      
      * Update README.md
      
      * Update README.cn.md
      
      * Update README.md
      
      * Update README.cn.md
      
      * Update README.md
      5612ecde
  17. 12 8月, 2019 1 次提交
    • H
      ES example (#105) · 60d68135
      Hongsheng Zeng 提交于
      * ES example
      
      * refine settings
      
      * fix yapf
      
      * refine documentation; remove csv logger
      
      * fix bug
      
      * merge learner.py and train.py; add version requirements of gym and atari_py
      
      * unify actor num
      60d68135
  18. 06 8月, 2019 1 次提交
    • L
      add new_alg.rst (#123) · a7670972
      LI Yunxiang 提交于
      * add new_alg.rst
      
      * rename LiftSim_demo as LiftSim_baseline
      
      * Update new_alg.rst
      
      * Update new_alg.rst
      a7670972
  19. 05 8月, 2019 1 次提交
    • L
      add liftsim baseline (#120) · c1646351
      LI Yunxiang 提交于
      * add liftsim baseline
      
      * yapf
      
      * yapf...
      
      * modify acc. comments
      
      * yapf
      
      * yapf..........
      
      * yapf!
      
      why is yapf on paddle different from that on my mac!!!!!
      c1646351
  20. 02 8月, 2019 1 次提交
    • F
      first pr (#113) · b29a1ec1
      fuyw 提交于
      * first pr
      
      * start a worker when the master is started.
      
      * First PR & Fix logger bugs.
      
      * update docs for a2c, impala and ga3c
      
      * update doc
      
      * yapf modification
      
      * update logger
      
      * yapf correct
      
      * yapf
      
      * setup.py
      
      * old setup.py
      
      * worker 86
      b29a1ec1
  21. 01 8月, 2019 1 次提交
    • B
      Save params (#107) · 7dafee77
      Bo Zhou 提交于
      * new feature: save params
      
      * add unittest for save()/retore()
      
      * add an example demonstrating the usage
      
      * rename the variable
      
      * yapf
      
      * fix comment
      7dafee77
  22. 29 7月, 2019 1 次提交
  23. 26 7月, 2019 1 次提交
    • B
      replace PE with compiler(new feature in paddle151). (#99) · d33f3002
      Bo Zhou 提交于
      * fix the compatibility issue
      
      * fix the comment issue
      
      * support paddle 1.5.1 and replace PE with compiler
      
      * yapf&copyright
      
      * yapf
      
      * fix the teamcity problem
      
      * fix the teamcity problem
      
      * fix comment
      
      * only support paddle 1.5.1
      
      * Cmake
      
      * fix comment
      d33f3002
  24. 25 7月, 2019 1 次提交
  25. 24 7月, 2019 2 次提交
  26. 10 7月, 2019 1 次提交
    • B
      make the quickstart more compact (#88) · 9dc152f0
      Bo Zhou 提交于
      * make the quickstart more compact
      
      * remove args in the main function
      
      * yapf
      
      * add gif
      
      * remove render
      
      * Update README.md
      
      * Update README.md
      
      * Update README.md
      9dc152f0
  27. 05 7月, 2019 1 次提交
    • B
      Documents cn (#85) · 96c58265
      Bo Zhou 提交于
      * Update README.cn.md
      
      * Update README.md
      
      * Update README.md
      
      * Update README.cn.md
      
      * Update README.md
      
      * Update README.md
      
      * Update README.md
      
      * Update README.md
      
      * Update README.md
      96c58265
  28. 18 6月, 2019 1 次提交
  29. 23 4月, 2019 1 次提交
  30. 19 4月, 2019 1 次提交
  31. 18 4月, 2019 1 次提交
    • H
      Refine (#67) · 3556c786
      Hongsheng Zeng 提交于
      * fix typo
      
      * Update README.md
      
      * Update README.md
      
      * Update README.md
      
      * soft depend on fluid; add module to monitor client status
      
      * improve performance of IMPALA example
      
      * fix bug of some client cannot exit normally
      
      * refine comment
      
      * .
      3556c786
  32. 17 4月, 2019 1 次提交
    • H
      GA3C example (#63) · 3c511e8f
      Hongsheng Zeng 提交于
      * add IMPALA algorithm and some common utils
      
      * update README.md
      
      * refactor files structure of impala algorithm; seperate numpy utils from utils
      
      * add hyper parameter scheduler module; add entropy and lr scheduler in impala
      
      * clip reward in atari wrapper instead of learner side; fix codestyle
      
      * add benchmark result of impala; refine code of impala example; add obs_format in atari_wrappers
      
      * Update README.md
      
      * add a3c algorithm, A2C example and rl_utils
      
      * require training in single gpu/cpu
      
      * only check cpu/gpu num in learner
      
      * refine Readme
      
      * update impala benchmark picture; update Readme
      
      * add benchmark result of A2C
      
      * move get_params/set_params in agent_base
      
      * add GA3C example
      
      * Update README.md
      
      * Update README.md
      
      * Update README.md
      
      * Update README.md
      
      * refine Readme
      
      * add benchmark
      
      * add default safe eps in numpy logp calculation
      
      * refine document; make unittest stable
      3c511e8f
  33. 15 4月, 2019 1 次提交
    • H
      A2C example (#62) · 39846831
      Hongsheng Zeng 提交于
      * add IMPALA algorithm and some common utils
      
      * update README.md
      
      * refactor files structure of impala algorithm; seperate numpy utils from utils
      
      * add hyper parameter scheduler module; add entropy and lr scheduler in impala
      
      * clip reward in atari wrapper instead of learner side; fix codestyle
      
      * add benchmark result of impala; refine code of impala example; add obs_format in atari_wrappers
      
      * Update README.md
      
      * add a3c algorithm, A2C example and rl_utils
      
      * require training in single gpu/cpu
      
      * only check cpu/gpu num in learner
      
      * refine Readme
      
      * update impala benchmark picture; update Readme
      
      * add benchmark result of A2C
      
      * move get_params/set_params in agent_base
      
      * fix shell script cannot run in ubuntu
      
      * refine comment and document
      
      * Update README.md
      
      * Update README.md
      39846831
  34. 08 4月, 2019 1 次提交
    • H
      implement of IMPALA with the newest parallel design (#60) · b28289ac
      Hongsheng Zeng 提交于
      * add IMPALA algorithm and some common utils
      
      * update README.md
      
      * refactor files structure of impala algorithm; seperate numpy utils from utils
      
      * add hyper parameter scheduler module; add entropy and lr scheduler in impala
      
      * clip reward in atari wrapper instead of learner side; fix codestyle
      
      * add benchmark result of impala; refine code of impala example; add obs_format in atari_wrappers
      
      * Update README.md
      b28289ac
  35. 26 3月, 2019 1 次提交
    • H
      add api set_params/get_params in Model (#56) · 7346a23d
      Hongsheng Zeng 提交于
      * add api set_params/get_params in Model; add Interface of Network and LayerFunc to solve circular imports; refine parameter_names api of Model
      
      * remove licence in third party code; remove interface of Network and LayerFunc; move get_parameter_pairs and get_parameter_names api to Network
      
      * refine comment
      
      * refine commment
      7346a23d
  36. 11 3月, 2019 1 次提交
    • B
      update documents (#58) · d8449b74
      Bo Zhou 提交于
      * Update README.md
      
      * Update train.py
      
      * Update README.md
      
      * Update agent_base.py
      
      * Update train.py
      
      * Update train.py
      
      * Update train.py
      d8449b74
  37. 01 3月, 2019 1 次提交
  38. 14 2月, 2019 1 次提交
  39. 24 1月, 2019 1 次提交