提交 · 1cbcfb159809ba1775f5b0a8fc4ab7533fd6f25e · PaddlePaddle / PARL

21 8月, 2020 1 次提交

由 rical730 提交于 8月 21, 2020

* add torch coma

* add Apache License comment

* update readme

* update readme for installing sc2 on windows

* update readme

* add new line at the end of shell file

* update readme

* update readme of coma

* fix model_path

* self.algorithm to self.alg
Co-authored-by: NBo Zhou <2466956298@qq.com>

1cbcfb15

16 3月, 2020 1 次提交

update comments for ES (#211) · fa420300

由 Bo Zhou 提交于 3月 16, 2020

* update comments for ES

* check dependence on paddle or torch

* update readme

* update readme#2

* users can still use parl.remote when no DL framework was found

* yapf

fa420300

25 9月, 2019 1 次提交

torchdqn (#150) · 757cc391

由 fuyw 提交于 9月 25, 2019

* git commit -m torchdqn

* yapf

* fix bugs

* fix bugs

* fix bugs

* yapf

* remove fstring format

* torch_test yapf

* yapf

* Add torch in unittest.requirements

* update torch_unittest

* Torch and FLUID conflict problem in __init__.py

* Unittest fail for torch when both torch and fluid exists.

* cluster_test fail in the unittest, add timeout seconds.

* Torch backend for PARL

* add sleep time for unit test send_job_test.py

* Unit test for send_job_test.py

* use multiple try for unit test

* Fix compatibility for python2.7.

* fix send_job_test.py bugs

* check file exist before send_job_test.py

* Modify send_job_test.py

757cc391

24 7月, 2019 1 次提交

breaking changes#1 (#95) · 6efa7871

由 Bo Zhou 提交于 7月 24, 2019

* intra-version: move parl.framework into parl.core.fluid

* add folder: parl.core

* remove former test folders

* yapf

* yapf0.24

6efa7871

15 4月, 2019 1 次提交

A2C example (#62) · 39846831

由 Hongsheng Zeng 提交于 4月 15, 2019

* add IMPALA algorithm and some common utils

* update README.md

* refactor files structure of impala algorithm; seperate numpy utils from utils

* add hyper parameter scheduler module; add entropy and lr scheduler in impala

* clip reward in atari wrapper instead of learner side; fix codestyle

* add benchmark result of impala; refine code of impala example; add obs_format in atari_wrappers

* Update README.md

* add a3c algorithm, A2C example and rl_utils

* require training in single gpu/cpu

* only check cpu/gpu num in learner

* refine Readme

* update impala benchmark picture; update Readme

* add benchmark result of A2C

* move get_params/set_params in agent_base

* fix shell script cannot run in ubuntu

* refine comment and document

* Update README.md

* Update README.md

39846831

08 4月, 2019 1 次提交

implement of IMPALA with the newest parallel design (#60) · b28289ac

由 Hongsheng Zeng 提交于 4月 08, 2019

* add IMPALA algorithm and some common utils

* update README.md

* refactor files structure of impala algorithm; seperate numpy utils from utils

* add hyper parameter scheduler module; add entropy and lr scheduler in impala

* clip reward in atari wrapper instead of learner side; fix codestyle

* add benchmark result of impala; refine code of impala example; add obs_format in atari_wrappers

* Update README.md

b28289ac

04 1月, 2019 1 次提交

add PPO example (#39) · f8de849b

由 Hongsheng Zeng 提交于 1月 04, 2019

* add PPO example

* Update Readme

* Update Readme

* fix codestyle

* Update Readme

* refine action mapping

* add more unitest case

* remove unnecessary params initialize, add more comments, add benchmark result

* rename

* remove PARL dependence in readme of examples

f8de849b

15 12月, 2018 1 次提交

Add DDPG example (#36) · 53c94787

由 Hongsheng Zeng 提交于 12月 15, 2018

* add DDPG example, fix some tiny bug

* add license

* unify code structure

* unify code structure

* refine gputils, fix seed in QuickStart

* use white noise in DDPG

* fix codestyle

53c94787

07 12月, 2018 1 次提交

Add QuickStart example (#35) · cdd4622a

由 Hongsheng Zeng 提交于 12月 06, 2018

* add QuickStart example, refine DQN example

* add examples link

* refine the naming, and add quick start training result

cdd4622a

04 12月, 2018 1 次提交

DQN example (#33) · 4a4366a5

由 Hongsheng Zeng 提交于 12月 04, 2018

* add DQN example, add Agent unittest

* refine readme

* refine  code

* simplify code

4a4366a5

06 6月, 2018 1 次提交

preliminary implementations of the ComputationTask, Algorithm, and Model classes (#9) · 4b4b5824

由 Haonan 提交于 6月 05, 2018

* prelimary implementations of ComputationTask, Algorithm and Model classes

* remove "model_func" from the args of an algorithm

* a clean clone() function for Algorithm and Model

* add use_next_value as a input to learn()

* further re-structure

* added Feedforward and RLAlgorithm classes

* maxid -> argmax

* discrete_distribution -> category_distribution

* category -> categorical

* revisions

4b4b5824

17 5月, 2018 2 次提交
- H
  
  revisions · ad049bca
  由 haonanyu 提交于 5月 16, 2018
  
  ad049bca
- H
  
  parameter sharing in fluid with simple test cases · 1e32a717
  由 haonanyu 提交于 5月 14, 2018
  
  1e32a717