- 08 2月, 2020 1 次提交
-
-
由 rical730 提交于
* add maddpg example * format with yapf * fix coding style * fix coding style * unittest without import multiagent env * update maddpg code * update maddpg readme * add copyright comments
-
- 30 12月, 2019 1 次提交
-
-
由 LI Yunxiang 提交于
* add sac
-
- 22 11月, 2019 1 次提交
-
-
由 LI Yunxiang 提交于
* add TD3 * update * yapf..... * Update train.py
-
- 24 10月, 2019 1 次提交
-
-
由 LI Yunxiang 提交于
* add Double & Dueling DQN * yapf...................... * update * Update train.py
-
- 24 7月, 2019 1 次提交
-
-
由 Bo Zhou 提交于
* intra-version: move parl.framework into parl.core.fluid * add folder: parl.core * remove former test folders * yapf * yapf0.24
-
- 08 4月, 2019 1 次提交
-
-
由 Hongsheng Zeng 提交于
* add IMPALA algorithm and some common utils * update README.md * refactor files structure of impala algorithm; seperate numpy utils from utils * add hyper parameter scheduler module; add entropy and lr scheduler in impala * clip reward in atari wrapper instead of learner side; fix codestyle * add benchmark result of impala; refine code of impala example; add obs_format in atari_wrappers * Update README.md
-
- 26 11月, 2018 1 次提交
-
-
由 Hongsheng Zeng 提交于
* sync paras in program, fix deepcopy bug, python3 compatibility * refactor code, add plutil directory, clean import order * remove old comment * refine comment * fix codestyle * cache sync program, add gputils module, refine model_base unittest * fix codestyle * refine sync params cache * add fetch_value module
-
- 20 11月, 2018 1 次提交
-
-
由 Bo Zhou 提交于
* redesign basic class in PARL * code style fixed * update yaml's version * update yaml's version & update code to fix style problem * add debug message for function * delete test code * rename function: has_fun -> has_func
-
- 06 6月, 2018 1 次提交
-
-
由 Haonan 提交于
* prelimary implementations of ComputationTask, Algorithm and Model classes * remove "model_func" from the args of an algorithm * a clean clone() function for Algorithm and Model * add use_next_value as a input to learn() * further re-structure * added Feedforward and RLAlgorithm classes * maxid -> argmax * discrete_distribution -> category_distribution * category -> categorical * revisions
-
- 17 5月, 2018 2 次提交