Skip to content
体验新版
项目
组织
正在加载...
登录
切换导航
打开侧边栏
机器未来
Paddle
提交
2ad34dc8
P
Paddle
项目概览
机器未来
/
Paddle
与 Fork 源项目一致
Fork自
PaddlePaddle / Paddle
通知
1
Star
1
Fork
0
代码
文件
提交
分支
Tags
贡献者
分支图
Diff
Issue
1
列表
看板
标记
里程碑
合并请求
0
Wiki
0
Wiki
分析
仓库
DevOps
项目成员
Pages
P
Paddle
项目概览
项目概览
详情
发布
仓库
仓库
文件
提交
分支
标签
贡献者
分支图
比较
Issue
1
Issue
1
列表
看板
标记
里程碑
合并请求
0
合并请求
0
Pages
分析
分析
仓库分析
DevOps
Wiki
0
Wiki
成员
成员
收起侧边栏
关闭侧边栏
动态
分支图
创建新Issue
提交
Issue看板
提交
2ad34dc8
编写于
1月 08, 2018
作者:
T
Travis CI
浏览文件
操作
浏览文件
下载
电子邮件补丁
差异文件
Deploy to GitHub Pages:
691b5cac
上级
12c0344e
变更
4
展开全部
显示空白变更内容
内联
并排
Showing
4 changed file
with
16 addition
and
10 deletion
+16
-10
develop/doc/api/v2/fluid/layers.html
develop/doc/api/v2/fluid/layers.html
+7
-4
develop/doc/searchindex.js
develop/doc/searchindex.js
+1
-1
develop/doc_cn/api/v2/fluid/layers.html
develop/doc_cn/api/v2/fluid/layers.html
+7
-4
develop/doc_cn/searchindex.js
develop/doc_cn/searchindex.js
+1
-1
未找到文件。
develop/doc/api/v2/fluid/layers.html
浏览文件 @
2ad34dc8
...
...
@@ -2020,15 +2020,18 @@ explain how sequence_expand works:</p>
<dd><p>
GRU unit layer. The equation of a gru step is:
</p>
<blockquote>
<div><div
class=
"math"
>
\[ \begin{align}\begin{aligned}u_t
&
= actGate(xu_{t} + W_u h_{t-1} + b_u)\\r_t
&
= actGate(xr_{t} + W_r h_{t-1} + b_r)\\
ch_t
&
= actNode(xc_t + W_c dot(r_t, h_{t-1}) + b_c)\\h_t
&
= dot((1-u_t), ch_{t-1}) + dot(u_t, h_t
)\end{aligned}\end{align} \]
</div>
\[ \begin{align}\begin{aligned}u_t
&
= actGate(xu_{t} + W_u h_{t-1} + b_u)\\r_t
&
= actGate(xr_{t} + W_r h_{t-1} + b_r)\\
m_t
&
= actNode(xm_t + W_c dot(r_t, h_{t-1}) + b_m)\\h_t
&
= dot((1-u_t), m_t) + dot(u_t, h_{t-1}
)\end{aligned}\end{align} \]
</div>
</div></blockquote>
<p>
The inputs of gru unit includes
<span
class=
"math"
>
\(z_t\)
</span>
,
<span
class=
"math"
>
\(h_{t-1}\)
</span>
. In terms
of the equation above, the
<span
class=
"math"
>
\(z_t\)
</span>
is split into 3 parts -
<span
class=
"math"
>
\(xu_t\)
</span>
,
<span
class=
"math"
>
\(xr_t\)
</span>
and
<span
class=
"math"
>
\(x
c
_t\)
</span>
. This means that in order to
<span
class=
"math"
>
\(xu_t\)
</span>
,
<span
class=
"math"
>
\(xr_t\)
</span>
and
<span
class=
"math"
>
\(x
m
_t\)
</span>
. This means that in order to
implement a full GRU unit operator for an input, a fully
connected layer has to be applied, such that
<span
class=
"math"
>
\(z_t = W_{fc}x_t\)
</span>
.
</p>
<p>
This layer has three outputs
<span
class=
"math"
>
\(h_t\)
</span>
,
<span
class=
"math"
>
\(dot(r_t, h_{t - 1})\)
</span>
and concatenation of
<span
class=
"math"
>
\(u_t\)
</span>
,
<span
class=
"math"
>
\(r_t\)
</span>
and
<span
class=
"math"
>
\(ch_t\)
</span>
.
</p>
<p>
The terms
<span
class=
"math"
>
\(u_t\)
</span>
and
<span
class=
"math"
>
\(r_t\)
</span>
represent the update and reset gates
of the GRU cell. Unlike LSTM, GRU has one lesser gate. However, there is
an intermediate candidate hidden output, which is denoted by
<span
class=
"math"
>
\(m_t\)
</span>
.
This layer has three outputs
<span
class=
"math"
>
\(h_t\)
</span>
,
<span
class=
"math"
>
\(dot(r_t, h_{t-1})\)
</span>
and concatenation of
<span
class=
"math"
>
\(u_t\)
</span>
,
<span
class=
"math"
>
\(r_t\)
</span>
and
<span
class=
"math"
>
\(m_t\)
</span>
.
</p>
<table
class=
"docutils field-list"
frame=
"void"
rules=
"none"
>
<col
class=
"field-name"
/>
<col
class=
"field-body"
/>
...
...
develop/doc/searchindex.js
浏览文件 @
2ad34dc8
因为 它太大了无法显示 source diff 。你可以改为
查看blob
。
develop/doc_cn/api/v2/fluid/layers.html
浏览文件 @
2ad34dc8
...
...
@@ -2033,15 +2033,18 @@ explain how sequence_expand works:</p>
<dd><p>
GRU unit layer. The equation of a gru step is:
</p>
<blockquote>
<div><div
class=
"math"
>
\[ \begin{align}\begin{aligned}u_t
&
= actGate(xu_{t} + W_u h_{t-1} + b_u)\\r_t
&
= actGate(xr_{t} + W_r h_{t-1} + b_r)\\
ch_t
&
= actNode(xc_t + W_c dot(r_t, h_{t-1}) + b_c)\\h_t
&
= dot((1-u_t), ch_{t-1}) + dot(u_t, h_t
)\end{aligned}\end{align} \]
</div>
\[ \begin{align}\begin{aligned}u_t
&
= actGate(xu_{t} + W_u h_{t-1} + b_u)\\r_t
&
= actGate(xr_{t} + W_r h_{t-1} + b_r)\\
m_t
&
= actNode(xm_t + W_c dot(r_t, h_{t-1}) + b_m)\\h_t
&
= dot((1-u_t), m_t) + dot(u_t, h_{t-1}
)\end{aligned}\end{align} \]
</div>
</div></blockquote>
<p>
The inputs of gru unit includes
<span
class=
"math"
>
\(z_t\)
</span>
,
<span
class=
"math"
>
\(h_{t-1}\)
</span>
. In terms
of the equation above, the
<span
class=
"math"
>
\(z_t\)
</span>
is split into 3 parts -
<span
class=
"math"
>
\(xu_t\)
</span>
,
<span
class=
"math"
>
\(xr_t\)
</span>
and
<span
class=
"math"
>
\(x
c
_t\)
</span>
. This means that in order to
<span
class=
"math"
>
\(xu_t\)
</span>
,
<span
class=
"math"
>
\(xr_t\)
</span>
and
<span
class=
"math"
>
\(x
m
_t\)
</span>
. This means that in order to
implement a full GRU unit operator for an input, a fully
connected layer has to be applied, such that
<span
class=
"math"
>
\(z_t = W_{fc}x_t\)
</span>
.
</p>
<p>
This layer has three outputs
<span
class=
"math"
>
\(h_t\)
</span>
,
<span
class=
"math"
>
\(dot(r_t, h_{t - 1})\)
</span>
and concatenation of
<span
class=
"math"
>
\(u_t\)
</span>
,
<span
class=
"math"
>
\(r_t\)
</span>
and
<span
class=
"math"
>
\(ch_t\)
</span>
.
</p>
<p>
The terms
<span
class=
"math"
>
\(u_t\)
</span>
and
<span
class=
"math"
>
\(r_t\)
</span>
represent the update and reset gates
of the GRU cell. Unlike LSTM, GRU has one lesser gate. However, there is
an intermediate candidate hidden output, which is denoted by
<span
class=
"math"
>
\(m_t\)
</span>
.
This layer has three outputs
<span
class=
"math"
>
\(h_t\)
</span>
,
<span
class=
"math"
>
\(dot(r_t, h_{t-1})\)
</span>
and concatenation of
<span
class=
"math"
>
\(u_t\)
</span>
,
<span
class=
"math"
>
\(r_t\)
</span>
and
<span
class=
"math"
>
\(m_t\)
</span>
.
</p>
<table
class=
"docutils field-list"
frame=
"void"
rules=
"none"
>
<col
class=
"field-name"
/>
<col
class=
"field-body"
/>
...
...
develop/doc_cn/searchindex.js
浏览文件 @
2ad34dc8
此差异已折叠。
点击以展开。
编辑
预览
Markdown
is supported
0%
请重试
或
添加新附件
.
添加附件
取消
You are about to add
0
people
to the discussion. Proceed with caution.
先完成此消息的编辑!
取消
想要评论请
注册
或
登录