Add native depthwise_convolution op (forward pass).
The current depthwise_conv is very inefficient by calling slice() on each input channel on input and filters, followed by a conv() on each input channel, after which is a concat(). Change: 115583330
Showing
想要评论请 注册 或 登录