“e5281b3c2d14fdd0cc515268307e29521eb40305”上不存在“paddle/phi/kernels/mode_kernel.h”
* refine reduce by cub * optimize KernelDepthwiseConvFilterGrad * optimize depthwise conv and reduce mean and reduce sum * fix bug: dilation * cuda arch and cuda 8 compatible