提交 · 9216d9413e5568f0e6fcf78aae7e07cba54a8aee · PaddlePaddle / PARL

08 2月, 2020 1 次提交

由 rical730 提交于 2月 08, 2020

* add maddpg example

* format with yapf

* fix coding style

* fix coding style

* unittest without import multiagent env

* update maddpg code

* update maddpg readme

* add copyright comments

9216d941

30 12月, 2019 1 次提交
- L
  add sac (#188) · c070db83
  由 LI Yunxiang 提交于 12月 30, 2019
```
* add sac
```
  c070db83
22 11月, 2019 1 次提交
- L
  add TD3 (#175) · 6e7f862e
  由 LI Yunxiang 提交于 11月 22, 2019
```
* add TD3

* update

* yapf.....

* Update train.py
```
  6e7f862e
24 10月, 2019 1 次提交
- L
  add Double & Dueling DQN (#163) · bb9b78b4
  由 LI Yunxiang 提交于 10月 24, 2019
```
* add Double & Dueling DQN

* yapf......................

* update

* Update train.py
```
  bb9b78b4
24 7月, 2019 1 次提交

breaking changes#1 (#95) · 6efa7871

由 Bo Zhou 提交于 7月 24, 2019

* intra-version: move parl.framework into parl.core.fluid

* add folder: parl.core

* remove former test folders

* yapf

* yapf0.24

6efa7871

08 4月, 2019 1 次提交

implement of IMPALA with the newest parallel design (#60) · b28289ac

由 Hongsheng Zeng 提交于 4月 08, 2019

* add IMPALA algorithm and some common utils

* update README.md

* refactor files structure of impala algorithm; seperate numpy utils from utils

* add hyper parameter scheduler module; add entropy and lr scheduler in impala

* clip reward in atari wrapper instead of learner side; fix codestyle

* add benchmark result of impala; refine code of impala example; add obs_format in atari_wrappers

* Update README.md

b28289ac

26 11月, 2018 1 次提交

sync paras in program, fix deepcopy bug, python3 compatibility (#28) · e11b40c5

由 Hongsheng Zeng 提交于 11月 26, 2018

* sync paras in program, fix deepcopy bug, python3 compatibility

* refactor code, add plutil directory, clean import order

* remove old comment

* refine comment

* fix codestyle

* cache sync program, add gputils module, refine model_base unittest

* fix codestyle

* refine sync params cache

* add fetch_value module

e11b40c5

20 11月, 2018 1 次提交

redesign basic class in PARL (#26) · 1a1e1f03

由 Bo Zhou 提交于 11月 20, 2018

* redesign basic class in PARL

* code style fixed

* update yaml's version

* update yaml's version & update code to fix style problem

* add debug message for  function

* delete test code

* rename function: has_fun -> has_func

1a1e1f03

06 6月, 2018 1 次提交

preliminary implementations of the ComputationTask, Algorithm, and Model classes (#9) · 4b4b5824

由 Haonan 提交于 6月 05, 2018

* prelimary implementations of ComputationTask, Algorithm and Model classes

* remove "model_func" from the args of an algorithm

* a clean clone() function for Algorithm and Model

* add use_next_value as a input to learn()

* further re-structure

* added Feedforward and RLAlgorithm classes

* maxid -> argmax

* discrete_distribution -> category_distribution

* category -> categorical

* revisions

4b4b5824

17 5月, 2018 2 次提交
- H
  
  revisions · ad049bca
  由 haonanyu 提交于 5月 16, 2018
  
  ad049bca
- H
  
  parameter sharing in fluid with simple test cases · 1e32a717
  由 haonanyu 提交于 5月 14, 2018
  
  1e32a717