Skip to content
体验新版
项目
组织
正在加载...
登录
切换导航
打开侧边栏
PaddlePaddle
PaddleClas
提交
b01a79ab
P
PaddleClas
项目概览
PaddlePaddle
/
PaddleClas
大约 2 年 前同步成功
通知
118
Star
4999
Fork
1114
代码
文件
提交
分支
Tags
贡献者
分支图
Diff
Issue
19
列表
看板
标记
里程碑
合并请求
6
Wiki
0
Wiki
分析
仓库
DevOps
项目成员
Pages
P
PaddleClas
项目概览
项目概览
详情
发布
仓库
仓库
文件
提交
分支
标签
贡献者
分支图
比较
Issue
19
Issue
19
列表
看板
标记
里程碑
合并请求
6
合并请求
6
Pages
分析
分析
仓库分析
DevOps
Wiki
0
Wiki
成员
成员
收起侧边栏
关闭侧边栏
动态
分支图
创建新Issue
提交
Issue看板
未验证
提交
b01a79ab
编写于
6月 09, 2022
作者:
littletomatodonkey
提交者:
GitHub
6月 09, 2022
浏览文件
操作
浏览文件
下载
电子邮件补丁
差异文件
fix swin (#2004)
上级
794af8c0
变更
1
显示空白变更内容
内联
并排
Showing
1 changed file
with
31 addition
and
10 deletion
+31
-10
ppcls/arch/backbone/legendary_models/swin_transformer.py
ppcls/arch/backbone/legendary_models/swin_transformer.py
+31
-10
未找到文件。
ppcls/arch/backbone/legendary_models/swin_transformer.py
浏览文件 @
b01a79ab
...
@@ -157,6 +157,7 @@ class WindowAttention(nn.Layer):
...
@@ -157,6 +157,7 @@ class WindowAttention(nn.Layer):
relative_coords
[:,
:,
1
]
+=
self
.
window_size
[
1
]
-
1
relative_coords
[:,
:,
1
]
+=
self
.
window_size
[
1
]
-
1
relative_coords
[:,
:,
0
]
*=
2
*
self
.
window_size
[
1
]
-
1
relative_coords
[:,
:,
0
]
*=
2
*
self
.
window_size
[
1
]
-
1
relative_position_index
=
relative_coords
.
sum
(
-
1
)
# Wh*Ww, Wh*Ww
relative_position_index
=
relative_coords
.
sum
(
-
1
)
# Wh*Ww, Wh*Ww
self
.
register_buffer
(
"relative_position_index"
,
self
.
register_buffer
(
"relative_position_index"
,
relative_position_index
)
relative_position_index
)
...
@@ -168,6 +169,23 @@ class WindowAttention(nn.Layer):
...
@@ -168,6 +169,23 @@ class WindowAttention(nn.Layer):
trunc_normal_
(
self
.
relative_position_bias_table
)
trunc_normal_
(
self
.
relative_position_bias_table
)
self
.
softmax
=
nn
.
Softmax
(
axis
=-
1
)
self
.
softmax
=
nn
.
Softmax
(
axis
=-
1
)
def
eval
(
self
,
):
# this is used to re-param swin for model export
relative_position_bias_table
=
self
.
relative_position_bias_table
window_size
=
self
.
window_size
index
=
self
.
relative_position_index
.
reshape
([
-
1
])
relative_position_bias
=
paddle
.
index_select
(
relative_position_bias_table
,
index
)
relative_position_bias
=
relative_position_bias
.
reshape
([
window_size
[
0
]
*
window_size
[
1
],
window_size
[
0
]
*
window_size
[
1
],
-
1
])
# Wh*Ww,Wh*Ww,nH
relative_position_bias
=
relative_position_bias
.
transpose
(
[
2
,
0
,
1
])
# nH, Wh*Ww, Wh*Ww
relative_position_bias
=
relative_position_bias
.
unsqueeze
(
0
)
self
.
register_buffer
(
"relative_position_bias"
,
relative_position_bias
)
def
forward
(
self
,
x
,
mask
=
None
):
def
forward
(
self
,
x
,
mask
=
None
):
"""
"""
Args:
Args:
...
@@ -183,6 +201,7 @@ class WindowAttention(nn.Layer):
...
@@ -183,6 +201,7 @@ class WindowAttention(nn.Layer):
q
=
q
*
self
.
scale
q
=
q
*
self
.
scale
attn
=
paddle
.
mm
(
q
,
k
.
transpose
([
0
,
1
,
3
,
2
]))
attn
=
paddle
.
mm
(
q
,
k
.
transpose
([
0
,
1
,
3
,
2
]))
if
self
.
training
or
not
hasattr
(
self
,
"relative_position_bias"
):
index
=
self
.
relative_position_index
.
reshape
([
-
1
])
index
=
self
.
relative_position_index
.
reshape
([
-
1
])
relative_position_bias
=
paddle
.
index_select
(
relative_position_bias
=
paddle
.
index_select
(
...
@@ -195,6 +214,8 @@ class WindowAttention(nn.Layer):
...
@@ -195,6 +214,8 @@ class WindowAttention(nn.Layer):
relative_position_bias
=
relative_position_bias
.
transpose
(
relative_position_bias
=
relative_position_bias
.
transpose
(
[
2
,
0
,
1
])
# nH, Wh*Ww, Wh*Ww
[
2
,
0
,
1
])
# nH, Wh*Ww, Wh*Ww
attn
=
attn
+
relative_position_bias
.
unsqueeze
(
0
)
attn
=
attn
+
relative_position_bias
.
unsqueeze
(
0
)
else
:
attn
=
attn
+
self
.
relative_position_bias
if
mask
is
not
None
:
if
mask
is
not
None
:
nW
=
mask
.
shape
[
0
]
nW
=
mask
.
shape
[
0
]
...
...
编辑
预览
Markdown
is supported
0%
请重试
或
添加新附件
.
添加附件
取消
You are about to add
0
people
to the discussion. Proceed with caution.
先完成此消息的编辑!
取消
想要评论请
注册
或
登录