“a873fa84ceca411a5a776ff8ae303f8be24df95a”上不存在“paddle/fluid/operators/collective/c_reduce_sum_op.cu.cc”
  • Y
    improve dw conv performance · 4b9df8fb
    yiicy 提交于
    *  imporve prepack_input func speed in int8 3x3s1 dw conv
    
    * fix code style
    
    * fix code style
    
    * improve 3x3s1 dw fp32 conv speed a little
    
    * arm add 5x5s1 int8 dw conv, test=develop
    4b9df8fb
conv3x3s2_depthwise_int8.cc 21.7 KB