Skip to content
体验新版
项目
组织
正在加载...
登录
切换导航
打开侧边栏
MegEngine 天元
MegEngine
提交
4f28e146
MegEngine
项目概览
MegEngine 天元
/
MegEngine
1 年多 前同步成功
通知
404
Star
4705
Fork
582
代码
文件
提交
分支
Tags
贡献者
分支图
Diff
Issue
0
列表
看板
标记
里程碑
合并请求
0
DevOps
流水线
流水线任务
计划
Wiki
0
Wiki
分析
仓库
DevOps
项目成员
Pages
MegEngine
项目概览
项目概览
详情
发布
仓库
仓库
文件
提交
分支
标签
贡献者
分支图
比较
Issue
0
Issue
0
列表
看板
标记
里程碑
合并请求
0
合并请求
0
Pages
DevOps
DevOps
流水线
流水线任务
计划
分析
分析
仓库分析
DevOps
Wiki
0
Wiki
成员
成员
收起侧边栏
关闭侧边栏
动态
分支图
创建新Issue
流水线任务
提交
Issue看板
提交
4f28e146
编写于
9月 18, 2021
作者:
M
Megvii Engine Team
浏览文件
操作
浏览文件
下载
电子邮件补丁
差异文件
fix(dnn): fix compatibility broken of convolution format
GitOrigin-RevId: c17f02d03daa8c1ed42430fabcc375690ff65b03
上级
f642b05e
变更
1
隐藏空白更改
内联
并排
Showing
1 changed file
with
13 addition
and
13 deletion
+13
-13
dnn/scripts/opr_param_defs.py
dnn/scripts/opr_param_defs.py
+13
-13
未找到文件。
dnn/scripts/opr_param_defs.py
浏览文件 @
4f28e146
...
...
@@ -43,15 +43,15 @@ pdef('Axis').add_fields('int32', 'axis', 0)
Doc
(
'NCHW4_NCHW32 = 12'
,
'NCHW4_NCHW32 means input tensors are nchw4 layout, output tensor is nchw32 layout'
),
Doc
(
'NCHW32_NCHW4 = 13'
,
'NCHW32_NCHW4 means input tensors are nchw32 layout, output tensor is nchw4 layout'
),
Doc
(
'NCHW4_NCHW = 14'
,
'NCHW4_NCHW means input tensors are nchw4 layout, output tensor is nchw layout'
),
Doc
(
'NCHW4_NHWC = 15'
,
'NCHW4_NHWC means input tensors are nchw4 layout, output tensor is nhwc layout'
),
Doc
(
'NHWC_NCHW = 16'
,
'NHWC_NCHW means input tensors are nhwc layout, '
Doc
(
'NHWC_NCHW = 15'
,
'NHWC_NCHW means input tensors are nhwc layout, '
'output tensor is nchw layout'
),
Doc
(
'NHWC_NCHW4_IC_SMALL = 1
7
'
,
'NHWC_NCHW4_IC_SMALL means input tensors are nhwc(c < 4) layout, '
Doc
(
'NHWC_NCHW4_IC_SMALL = 1
6
'
,
'NHWC_NCHW4_IC_SMALL means input tensors are nhwc(c < 4) layout, '
'output tensor is nchw4 layout, padding c=4'
),
Doc
(
'NCHW_NCHW4_IC_SMALL = 1
8
'
,
'NCHW_NCHW4_IC_SMALL means input tensors are nchw(c < 4) layout, '
Doc
(
'NCHW_NCHW4_IC_SMALL = 1
7
'
,
'NCHW_NCHW4_IC_SMALL means input tensors are nchw(c < 4) layout, '
'output tensor is nchw4 layout, padding c=4'
),
Doc
(
'CHWN4 = 19'
,
'CHWN4 is currently only used on Nvidia platform for fast implementation '
'of convolution using CUDA/SASS. The channels are splitted to groups of 4 channels.'
))
Doc
(
'CHWN4 = 18'
,
'CHWN4 is currently only used on Nvidia platform for fast implementation '
'of convolution using CUDA/SASS. The channels are splitted to groups of 4 channels.'
),
Doc
(
'NCHW4_NHWC = 19'
,
'NCHW4_NHWC means input tensors are nchw4 layout, output tensor is nhwc layout'
))
)
(
pdef
(
'Convolution'
,
version
=
1
,
is_legacy
=
True
).
...
...
@@ -100,17 +100,17 @@ pdef('Axis').add_fields('int32', 'axis', 0)
Doc
(
'NCHW4_NCHW32 = 9'
,
'NCHW4_NCHW32 means input tensors are nchw4 layout, output tensor is nchw32 layout'
),
Doc
(
'NCHW32_NCHW4 = 10'
,
'NCHW32_NCHW4 means input tensors are nchw32 layout, output tensor is nchw4 layout'
),
Doc
(
'NCHW4_NCHW = 11'
,
'NCHW4_NCHW means input tensors are nchw4 layout, output tensor is nchw layout'
),
Doc
(
'NCHW4_NHWC = 12'
,
'NCHW4_NHWC means input tensors are nchw4 layout, output tensor is nhwc layout'
),
Doc
(
'NHWC_NCHW = 13'
,
'NHWC_NCHW means input tensors are nhwc layout, '
Doc
(
'NHWC_NCHW = 12'
,
'NHWC_NCHW means input tensors are nhwc layout, '
'output tensor is nchw layout'
),
Doc
(
'NHWC_NCHW4_IC_SMALL = 1
4
'
,
'NHWC_NCHW4_IC_SMALL means input tensors are nhwc(c < 4) layout, '
Doc
(
'NHWC_NCHW4_IC_SMALL = 1
3
'
,
'NHWC_NCHW4_IC_SMALL means input tensors are nhwc(c < 4) layout, '
'output tensor is nchw4 layout, padding c=4'
),
Doc
(
'NCHW_NCHW4_IC_SMALL = 1
5
'
,
'NCHW_NCHW4_IC_SMALL means input tensors are nchw(c < 4) layout, '
Doc
(
'NCHW_NCHW4_IC_SMALL = 1
4
'
,
'NCHW_NCHW4_IC_SMALL means input tensors are nchw(c < 4) layout, '
'output tensor is nchw4 layout, padding c=4'
),
Doc
(
'CHWN4 = 1
6
'
,
'CHWN4 is currently only used on Nvidia platform for fast implementation '
Doc
(
'CHWN4 = 1
5
'
,
'CHWN4 is currently only used on Nvidia platform for fast implementation '
'of convolution using CUDA/SASS. The channels are splitted to groups of 4 channels.'
),
Doc
(
'NCHW64 = 17'
,
'NCHW64 is designed for convolution implementation to utilizing TensorCore '
'instructions for 4-bit integers on Nvidia platforms'
)).
Doc
(
'NCHW64 = 16'
,
'NCHW64 is designed for convolution implementation to utilizing TensorCore '
'instructions for 4-bit integers on Nvidia platforms'
),
Doc
(
'NCHW4_NHWC = 17'
,
'NCHW4_NHWC means input tensors are nchw4 layout, output tensor is nhwc layout'
)).
add_enum_alias
(
'ComputeMode'
,
'ConvolutionV1'
,
name_field
=
'compute_mode'
)
)
...
...
编辑
预览
Markdown
is supported
0%
请重试
或
添加新附件
.
添加附件
取消
You are about to add
0
people
to the discussion. Proceed with caution.
先完成此消息的编辑!
取消
想要评论请
注册
或
登录