speedup fp32 depthwise conv
* update con_dw
* update
* add conv_depthwise_3x3s1.cc and conv_depthwise_3x3s2.cc
* add conv_depthwise_3x3s1_fp32 and conv_depthwise_3x3s2_fp32
* add new conv_dw
* only support conv_dw pad=0, 1
* add conv_dw_s1 conv_dw_s2 fp32
* //conv2_func _impl2{nullptr};
update conv_dw, add conv_3x3s1 and conv_3x3s2, pad=[0,1]
* fix format, test=develop
* fix formmat, test=develop
Showing
想要评论请 注册 或 登录