提交 · f8de849bc8206ecdc919d1a140e306384a5e568f · PaddlePaddle / PARL

04 1月, 2019 1 次提交

由 Hongsheng Zeng 提交于 1月 04, 2019

* add PPO example

* Update Readme

* Update Readme

* fix codestyle

* Update Readme

* refine action mapping

* add more unitest case

* remove unnecessary params initialize, add more comments, add benchmark result

* rename

* remove PARL dependence in readme of examples

f8de849b

15 12月, 2018 1 次提交

Add DDPG example (#36) · 53c94787

由 Hongsheng Zeng 提交于 12月 15, 2018

* add DDPG example, fix some tiny bug

* add license

* unify code structure

* unify code structure

* refine gputils, fix seed in QuickStart

* use white noise in DDPG

* fix codestyle

53c94787

07 12月, 2018 1 次提交

Add QuickStart example (#35) · cdd4622a

由 Hongsheng Zeng 提交于 12月 06, 2018

* add QuickStart example, refine DQN example

* add examples link

* refine the naming, and add quick start training result

cdd4622a

04 12月, 2018 1 次提交

DQN example (#33) · 4a4366a5

由 Hongsheng Zeng 提交于 12月 04, 2018

* add DQN example, add Agent unittest

* refine readme

* refine  code

* simplify code

4a4366a5

06 6月, 2018 1 次提交

preliminary implementations of the ComputationTask, Algorithm, and Model classes (#9) · 4b4b5824

由 Haonan 提交于 6月 05, 2018

* prelimary implementations of ComputationTask, Algorithm and Model classes

* remove "model_func" from the args of an algorithm

* a clean clone() function for Algorithm and Model

* add use_next_value as a input to learn()

* further re-structure

* added Feedforward and RLAlgorithm classes

* maxid -> argmax

* discrete_distribution -> category_distribution

* category -> categorical

* revisions

4b4b5824

17 5月, 2018 2 次提交
- H
  
  revisions · ad049bca
  由 haonanyu 提交于 5月 16, 2018
  
  ad049bca
- H
  
  parameter sharing in fluid with simple test cases · 1e32a717
  由 haonanyu 提交于 5月 14, 2018
  
  1e32a717