Skip to content
体验新版
项目
组织
正在加载...
登录
切换导航
打开侧边栏
OpenDILab开源决策智能平台
DI-engine
提交
8ad508dd
D
DI-engine
项目概览
OpenDILab开源决策智能平台
/
DI-engine
上一次同步 接近 3 年
通知
67
Star
322
Fork
1
代码
文件
提交
分支
Tags
贡献者
分支图
Diff
Issue
0
列表
看板
标记
里程碑
合并请求
0
DevOps
流水线
流水线任务
计划
Wiki
1
Wiki
分析
仓库
DevOps
项目成员
Pages
D
DI-engine
项目概览
项目概览
详情
发布
仓库
仓库
文件
提交
分支
标签
贡献者
分支图
比较
Issue
0
Issue
0
列表
看板
标记
里程碑
合并请求
0
合并请求
0
Pages
DevOps
DevOps
流水线
流水线任务
计划
分析
分析
仓库分析
DevOps
Wiki
1
Wiki
成员
成员
收起侧边栏
关闭侧边栏
动态
分支图
创建新Issue
流水线任务
提交
Issue看板
体验新版 GitCode,发现更多精彩内容 >>
提交
8ad508dd
编写于
10月 14, 2021
作者:
P
PaParaZz1
浏览文件
操作
浏览文件
下载
电子邮件补丁
差异文件
Deploying to gh-pages from @ 13222d5a47f6ee74c4ab8e98382d0c5528bcea9a
🚀
上级
fc57bd02
变更
1
隐藏空白更改
内联
并排
Showing
1 changed file
with
4 addition
and
0 deletion
+4
-0
_modules/ding/policy/dqn.html
_modules/ding/policy/dqn.html
+4
-0
未找到文件。
_modules/ding/policy/dqn.html
浏览文件 @
8ad508dd
...
...
@@ -355,11 +355,15 @@
<span
class=
"k"
>
return
</span>
<span
class=
"p"
>
{
</span>
<span
class=
"s1"
>
'
cur_lr
'
</span><span
class=
"p"
>
:
</span>
<span
class=
"bp"
>
self
</span><span
class=
"o"
>
.
</span><span
class=
"n"
>
_optimizer
</span><span
class=
"o"
>
.
</span><span
class=
"n"
>
defaults
</span><span
class=
"p"
>
[
</span><span
class=
"s1"
>
'
lr
'
</span><span
class=
"p"
>
],
</span>
<span
class=
"s1"
>
'
total_loss
'
</span><span
class=
"p"
>
:
</span>
<span
class=
"n"
>
loss
</span><span
class=
"o"
>
.
</span><span
class=
"n"
>
item
</span><span
class=
"p"
>
(),
</span>
<span
class=
"s1"
>
'
q_value
'
</span><span
class=
"p"
>
:
</span>
<span
class=
"n"
>
q_value
</span><span
class=
"o"
>
.
</span><span
class=
"n"
>
mean
</span><span
class=
"p"
>
()
</span><span
class=
"o"
>
.
</span><span
class=
"n"
>
item
</span><span
class=
"p"
>
(),
</span>
<span
class=
"s1"
>
'
priority
'
</span><span
class=
"p"
>
:
</span>
<span
class=
"n"
>
td_error_per_sample
</span><span
class=
"o"
>
.
</span><span
class=
"n"
>
abs
</span><span
class=
"p"
>
()
</span><span
class=
"o"
>
.
</span><span
class=
"n"
>
tolist
</span><span
class=
"p"
>
(),
</span>
<span
class=
"c1"
>
# Only discrete action satisfying len(data[
'
action
'
])==1 can return this and draw histogram on tensorboard.
</span>
<span
class=
"c1"
>
#
'
[histogram]action_distribution
'
: data[
'
action
'
],
</span>
<span
class=
"p"
>
}
</span></div>
<span
class=
"k"
>
def
</span>
<span
class=
"nf"
>
_monitor_vars_learn
</span><span
class=
"p"
>
(
</span><span
class=
"bp"
>
self
</span><span
class=
"p"
>
)
</span>
<span
class=
"o"
>
-
>
</span>
<span
class=
"n"
>
List
</span><span
class=
"p"
>
[
</span><span
class=
"nb"
>
str
</span><span
class=
"p"
>
]:
</span>
<span
class=
"k"
>
return
</span>
<span
class=
"p"
>
[
</span><span
class=
"s1"
>
'
cur_lr
'
</span><span
class=
"p"
>
,
</span>
<span
class=
"s1"
>
'
total_loss
'
</span><span
class=
"p"
>
,
</span>
<span
class=
"s1"
>
'
q_value
'
</span><span
class=
"p"
>
]
</span>
<div
class=
"viewcode-block"
id=
"DQNPolicy._state_dict_learn"
><a
class=
"viewcode-back"
href=
"../../../api_doc/policy/dqn.html#ding.policy.dqn.DQNPolicy._state_dict_learn"
>
[docs]
</a>
<span
class=
"k"
>
def
</span>
<span
class=
"nf"
>
_state_dict_learn
</span><span
class=
"p"
>
(
</span><span
class=
"bp"
>
self
</span><span
class=
"p"
>
)
</span>
<span
class=
"o"
>
-
>
</span>
<span
class=
"n"
>
Dict
</span><span
class=
"p"
>
[
</span><span
class=
"nb"
>
str
</span><span
class=
"p"
>
,
</span>
<span
class=
"n"
>
Any
</span><span
class=
"p"
>
]:
</span>
<span
class=
"sd"
>
"""
</span>
<span
class=
"sd"
>
Overview:
</span>
...
...
编辑
预览
Markdown
is supported
0%
请重试
或
添加新附件
.
添加附件
取消
You are about to add
0
people
to the discussion. Proceed with caution.
先完成此消息的编辑!
取消
想要评论请
注册
或
登录