OpenDILab开源决策智能平台 / DI-engine
上一次同步 2 年多

56

0

代码
- 文件
- 提交
- 分支
- Tags
- 贡献者
- 分支图
- Diff
Issue 0
- 列表
- 看板
- 标记
- 里程碑
合并请求 0
DevOps
Wiki 1
- Wiki
分析
- 仓库
- DevOps
项目成员
Pages

前往新版Gitcode，体验更适合开发者的 AI 搜索 >>

查找文件 Blame 历史永久链接 Permalink

D

feature(davide): Implementation of D4PG (#76) · 16a89c35

由 Davide Liu 提交于 9月 30, 2021

* added experience replay and n-step

* implementing distributional q value

* added distributional q-value

* added overview in qac_dist and d4pg

* derived D4PG from DDPG

* fixed a bug when action shape >1

* benchmark D4PG mujoco + minor fixs

-entry for DDPG mujoco
-entry for D4PG mujoco
-config for D4PG mujoco
-fixed style D4PG code
-unittests for QAC distributional

* formatted code

* minor updates (read description)

-added d4pg seria_entry test
-updated comments in QACDIST
-added d4pg in commander register
-added q_value in d4pg return dict
-added priority update in d4pg entry
-added assertion in QACDIST

16a89c35

To learn more about this project, read the wiki.

README.md 22.7 KB