1. 19 4月, 2019 3 次提交
  2. 18 4月, 2019 3 次提交
    • Q
      compatible queue definition (#66) · ed4f4322
      qinduohao 提交于
      * add compatible queue definition
      ed4f4322
    • H
      Refine (#67) · 3556c786
      Hongsheng Zeng 提交于
      * fix typo
      
      * Update README.md
      
      * Update README.md
      
      * Update README.md
      
      * soft depend on fluid; add module to monitor client status
      
      * improve performance of IMPALA example
      
      * fix bug of some client cannot exit normally
      
      * refine comment
      
      * .
      3556c786
    • B
      Add a Chinese documentation (#65) · 432d75b7
      Bo Zhou 提交于
      * Update README.md
      
      * Create README.cn.md
      
      * Update README.cn.md
      
      * Update README.cn.md
      
      * Update README.cn.md
      
      * Update README.cn.md
      
      * Update README.cn.md
      
      * Update README.cn.md
      
      * Update README.cn.md
      
      * Update README.cn.md
      
      * Update README.cn.md
      
      * Update README.cn.md
      
      * Update README.md
      
      * Update README.md
      
      * Update README.md
      
      * Update README.cn.md
      432d75b7
  3. 17 4月, 2019 1 次提交
    • H
      GA3C example (#63) · 3c511e8f
      Hongsheng Zeng 提交于
      * add IMPALA algorithm and some common utils
      
      * update README.md
      
      * refactor files structure of impala algorithm; seperate numpy utils from utils
      
      * add hyper parameter scheduler module; add entropy and lr scheduler in impala
      
      * clip reward in atari wrapper instead of learner side; fix codestyle
      
      * add benchmark result of impala; refine code of impala example; add obs_format in atari_wrappers
      
      * Update README.md
      
      * add a3c algorithm, A2C example and rl_utils
      
      * require training in single gpu/cpu
      
      * only check cpu/gpu num in learner
      
      * refine Readme
      
      * update impala benchmark picture; update Readme
      
      * add benchmark result of A2C
      
      * move get_params/set_params in agent_base
      
      * add GA3C example
      
      * Update README.md
      
      * Update README.md
      
      * Update README.md
      
      * Update README.md
      
      * refine Readme
      
      * add benchmark
      
      * add default safe eps in numpy logp calculation
      
      * refine document; make unittest stable
      3c511e8f
  4. 15 4月, 2019 1 次提交
    • H
      A2C example (#62) · 39846831
      Hongsheng Zeng 提交于
      * add IMPALA algorithm and some common utils
      
      * update README.md
      
      * refactor files structure of impala algorithm; seperate numpy utils from utils
      
      * add hyper parameter scheduler module; add entropy and lr scheduler in impala
      
      * clip reward in atari wrapper instead of learner side; fix codestyle
      
      * add benchmark result of impala; refine code of impala example; add obs_format in atari_wrappers
      
      * Update README.md
      
      * add a3c algorithm, A2C example and rl_utils
      
      * require training in single gpu/cpu
      
      * only check cpu/gpu num in learner
      
      * refine Readme
      
      * update impala benchmark picture; update Readme
      
      * add benchmark result of A2C
      
      * move get_params/set_params in agent_base
      
      * fix shell script cannot run in ubuntu
      
      * refine comment and document
      
      * Update README.md
      
      * Update README.md
      39846831
  5. 13 4月, 2019 1 次提交
    • B
      add some introduction for our parallelization feature (#61) · 452050a0
      Bo Zhou 提交于
      * Update remote_decorator.py
      
      * Update README.md
      
      * add an figure for the demonstration about parallelization
      
      * Update README.md
      
      * Update README.md
      
      * Update README.md
      
      * Update README.md
      
      * Update README.md
      
      * Update README.md
      
      * Update README.md
      
      * Update README.md
      
      * Update README.md
      
      * Update README.md
      
      * Update README.md
      
      * Update README.md
      
      * Update README.md
      
      * Update README.md
      
      * add a link to IMPALA
      452050a0
  6. 08 4月, 2019 1 次提交
    • H
      implement of IMPALA with the newest parallel design (#60) · b28289ac
      Hongsheng Zeng 提交于
      * add IMPALA algorithm and some common utils
      
      * update README.md
      
      * refactor files structure of impala algorithm; seperate numpy utils from utils
      
      * add hyper parameter scheduler module; add entropy and lr scheduler in impala
      
      * clip reward in atari wrapper instead of learner side; fix codestyle
      
      * add benchmark result of impala; refine code of impala example; add obs_format in atari_wrappers
      
      * Update README.md
      b28289ac
  7. 26 3月, 2019 1 次提交
    • H
      add api set_params/get_params in Model (#56) · 7346a23d
      Hongsheng Zeng 提交于
      * add api set_params/get_params in Model; add Interface of Network and LayerFunc to solve circular imports; refine parameter_names api of Model
      
      * remove licence in third party code; remove interface of Network and LayerFunc; move get_parameter_pairs and get_parameter_names api to Network
      
      * refine comment
      
      * refine commment
      7346a23d
  8. 11 3月, 2019 1 次提交
    • B
      update documents (#58) · d8449b74
      Bo Zhou 提交于
      * Update README.md
      
      * Update train.py
      
      * Update README.md
      
      * Update agent_base.py
      
      * Update train.py
      
      * Update train.py
      
      * Update train.py
      d8449b74
  9. 07 3月, 2019 1 次提交
    • H
      new feature: parl.remote (#54) · 348db1fb
      Hongsheng Zeng 提交于
      * refine remote module, add heartbeat machanism and unittest
      
      * yapf
      
      * yapf
      
      * support get ip address in CentOS, add dependence
      
      * yapf
      
      * add dependence in Dockerfile
      
      * refine message_tag, Compatible with Python2 and python3
      
      * refine unittest and comments
      
      * remove ParlError, use to_pybytes api to compatible with Python 2 and python 3
      
      * Not need to use to_pybytes
      
      * use parl-test docker image for unittest, which has python2 and python3 env
      
      * test different release order of sockets
      
      * test for different closing way fo context and socket
      
      * tmp commit for debug in teamcity
      
      * tmp commit for debug in teamcity
      
      * tmp commit for debug in teamcity
      
      * use zmq.context destroy to close multi-thread socket, refine RemoteError
      
      * set linger=0 for command socket in RemoteObject
      
      * remove close context unittest
      
      * fix codestyle
      
      * fix codestyle
      
      * rename parl.remote to parl.remote_class; will not exit client when having errors in function call; use sepereate server port in unittest to avoiding closing server manually
      
      * rename parl.remote to parl.remote_class; will not exit client when having errors in function call; use sepereate server port in unittest to avoiding closing server manually
      
      * fix typo
      
      * remove unnecessary try/except in reply loop of client
      
      * import RemoteManager to parl; refine comment
      348db1fb
  10. 05 3月, 2019 1 次提交
    • H
      run unittest in python2 and python3 (#55) · e80604f8
      Hongsheng Zeng 提交于
      * run unittest in python2 and python3
      
      * refine structure of repo
      
      * refine structure of repo
      
      * add --fix-misssing
      
      * fix teamcity
      
      * add --fix-misssing
      
      * update paddle version in python2
      e80604f8
  11. 01 3月, 2019 1 次提交
  12. 27 2月, 2019 1 次提交
    • B
      first version of network communication (#49) · bbde58fb
      Bo Zhou 提交于
      * first version of network communication
      
      * fix code styple problems
      
      * add a script to get machine's information
      
      * code styple problems#2
      
      * fix unit test problems
      
      * update dockfile to fix the installation issue of cmake
      
      * thread-saftey ensurance & copright
      
      * resolve comments
      bbde58fb
  13. 14 2月, 2019 1 次提交
  14. 24 1月, 2019 1 次提交
  15. 18 1月, 2019 1 次提交
    • H
      Refine documents of PARL (#43) · 7a7583ab
      Hongsheng Zeng 提交于
      * remove not used files, add benchmark for DQN and DDPG, add Parameters management Readme
      
      * Update README.md
      
      * Update README.md
      
      * add parl dependence in examples, use np shuffle instead of sklean
      
      * fix codestyle
      
      * refine readme of nips example
      
      * fix bug
      
      * fix code style
      
      * Update README.md
      
      * Update README.md
      
      * Update README.md
      
      * refine document and remove outdated design doc
      
      * Update README.md
      
      * Update README.md
      
      * refine comment
      
      * release version 1.0
      
      * gif of examples
      
      * Update README.md
      
      * update Readme
      7a7583ab
  16. 15 1月, 2019 2 次提交
  17. 04 1月, 2019 1 次提交
    • H
      add PPO example (#39) · f8de849b
      Hongsheng Zeng 提交于
      * add PPO example
      
      * Update Readme
      
      * Update Readme
      
      * fix codestyle
      
      * Update Readme
      
      * refine action mapping
      
      * add more unitest case
      
      * remove unnecessary params initialize, add more comments, add benchmark result
      
      * rename
      
      * remove PARL dependence in readme of examples
      f8de849b
  18. 28 12月, 2018 1 次提交
  19. 15 12月, 2018 1 次提交
    • H
      Add DDPG example (#36) · 53c94787
      Hongsheng Zeng 提交于
      * add DDPG example, fix some tiny bug
      
      * add license
      
      * unify code structure
      
      * unify code structure
      
      * refine gputils, fix seed in QuickStart
      
      * use white noise in DDPG
      
      * fix codestyle
      53c94787
  20. 12 12月, 2018 1 次提交
  21. 07 12月, 2018 1 次提交
    • H
      Add QuickStart example (#35) · cdd4622a
      Hongsheng Zeng 提交于
      * add QuickStart example, refine DQN example
      
      * add examples link
      
      * refine the naming, and add quick start training result
      cdd4622a
  22. 04 12月, 2018 2 次提交
  23. 30 11月, 2018 1 次提交
  24. 29 11月, 2018 1 次提交
  25. 27 11月, 2018 2 次提交
  26. 26 11月, 2018 1 次提交
    • H
      sync paras in program, fix deepcopy bug, python3 compatibility (#28) · e11b40c5
      Hongsheng Zeng 提交于
      * sync paras in program, fix deepcopy bug, python3 compatibility
      
      * refactor code, add plutil directory, clean import order
      
      * remove old comment
      
      * refine comment
      
      * fix codestyle
      
      * cache sync program, add gputils module, refine model_base unittest
      
      * fix codestyle
      
      * refine sync params cache
      
      * add fetch_value module
      e11b40c5
  27. 22 11月, 2018 1 次提交
    • H
      add logger module (#27) · 942c3c5c
      Hongsheng Zeng 提交于
      * add logger module
      
      * refine comment
      
      * add license
      
      * refine set_level api
      
      * refine unittest
      
      * fix codestyle with yapf
      
      * add termcolor dependency
      942c3c5c
  28. 20 11月, 2018 1 次提交
    • B
      redesign basic class in PARL (#26) · 1a1e1f03
      Bo Zhou 提交于
      * redesign basic class in PARL
      
      * code style fixed
      
      * update yaml's version
      
      * update yaml's version & update code to fix style problem
      
      * add debug message for  function
      
      * delete test code
      
      * rename function: has_fun -> has_func
      1a1e1f03
  29. 13 11月, 2018 1 次提交
  30. 30 9月, 2018 1 次提交
  31. 11 9月, 2018 1 次提交
  32. 12 6月, 2018 1 次提交
  33. 06 6月, 2018 1 次提交
    • H
      preliminary implementations of the ComputationTask, Algorithm, and Model classes (#9) · 4b4b5824
      Haonan 提交于
      * prelimary implementations of ComputationTask, Algorithm and Model classes
      
      * remove "model_func" from the args of an algorithm
      
      * a clean clone() function for Algorithm and Model
      
      * add use_next_value as a input to learn()
      
      * further re-structure
      
      * added Feedforward and RLAlgorithm classes
      
      * maxid -> argmax
      
      * discrete_distribution -> category_distribution
      
      * category -> categorical
      
      * revisions
      4b4b5824