1. 09 3月, 2020 1 次提交
  2. 12 2月, 2020 1 次提交
    • Y
      [ARM] add int8 5x5s2 dw conv impl, test=develop (#2813) · 2af785b7
      yiicy 提交于
      * [ARM] add 5x5s2 depthwise conv armv8 impl, test=develop
      
      * [ARM] add int8 5x5s2 dw conv armv7 impl, test=develop
      
      * [ARM] add int8 5x5s2 dw conv impl, test=develop
      
      * [ARM] close int8 conv ut, test=develop
      2af785b7
  3. 04 2月, 2020 1 次提交
    • Y
      [ARM] 5x5dw and sgemv support fuse activation, test=develop (#2797) · 42bbd157
      yiicy 提交于
      * refactor 5x5s1 dw conv armv8, test=develop
      
      * [ARM] refactor depthwise conv 5x5s1, and support relu6, leakey relu, test=develop
      
      * [ARM] sgemv support fuse relu6 and leakey relu,test=develop
      
      * [ARM] reduce some conv ut case, test=develop
      
      * [ARM] fix 5x5dw conv pick kernel bug, test=develop
      
      * fix code style, test=develop
      
      * [ARM] fix sgemv fuse relu6 bug, test=develop
      
      * [ARM] fix fp32 5x5s1 dw bug, test=develop
      
      * [ARM] fix fp32 5x5 dw conv pick kernel bug, test=develop
      42bbd157
  4. 16 1月, 2020 1 次提交
  5. 08 1月, 2020 1 次提交
  6. 19 12月, 2019 1 次提交
  7. 17 12月, 2019 1 次提交
  8. 11 12月, 2019 1 次提交
  9. 03 12月, 2019 1 次提交
  10. 27 11月, 2019 1 次提交
    • T
      fix winograd reinitwhenneed (#2511) · d21a05dc
      TianXiaogang 提交于
      
      * add winograd c4 implement (#2494)
      test=develop
      fix: fix conv_block prepack_input_nxwc4 bug
      * fix: optimize sgemm_c4 in armv7
           change condition of choose winograd kernel
      * fix: change conv choose kernel condition
      test=develop
      d21a05dc
  11. 26 11月, 2019 1 次提交
    • T
      add winograd c4 implement (#2494) · 4476c4ec
      TianXiaogang 提交于
      fix: fix conv_block prepack_input_nxwc4 bug
      * fix: optimize sgemm_c4 in armv7
           change condition of choose winograd kernel
      * fix: change conv choose kernel condition
      4476c4ec
  12. 22 11月, 2019 1 次提交
    • H
      update conv 2-pad to 4-pad (#2404) · b3a5fc1a
      HappyAngel 提交于
      * fix conv 2-pad to 4-pad
      
      * fix compute conv shape
      
      * fix pad, test=develop
      
      * change conv_depthwise_3x3s1_fp.cc name to conv3x3s1p01_depthwise_fp32.cc to distinguish between conv3x3s1_depthwise_fp32.cc
      
      * delete printf note in conv3x3s1, test=develop
      
      * delete printf note, test=develop
      
      * delete gem_sdot.h, test=develop
      
      it is coped from __gemm_sdot_meta_.h
      
      * update compute padding, test=develop
      
      * fix padding size, must be 2 or 4. test=develop
      
      * fix format in operators/conv_op.cc, test=develop
      
      * change #if 0 to #if 1, test=develop
      
      * put 2-pad to 4-pad in AttachImpl, test=develop
      
      * fix clang-format error inn tests/math/connv_compute_test, test=develop
      
      * fix x86 test result error, test=develop
      
      * add asymmetric padding test case in liite/tests/math/conv_compute.cc, test=develop
      
      * change paddings type to support dynamically modify, test=develop
      
      * fix x86 build error in connv_compute_test, test=develop
      
      * fix opencl build error, test=develop
      
      * fix oopencl build error, test=develop
      
      * fix  opencl/conv_compute build error, test=develop
      
      * fix  opencl/conv_compute build error, test=develop
      
      * fix format in kernels/opencl/conv_computte_ttest,test=develop
      
      * fix build error, test=develop
      
      fix build error in kernels/x86/conv_compute.h
      b3a5fc1a
  13. 21 10月, 2019 1 次提交
  14. 09 10月, 2019 1 次提交
    • Y
      improve dw conv performance · 498a30cf
      yiicy 提交于
      *  imporve prepack_input func speed in int8 3x3s1 dw conv
      
      * fix code style
      
      * fix code style
      
      * improve 3x3s1 dw fp32 conv speed a little
      
      * arm add 5x5s1 int8 dw conv, test=develop
      498a30cf
  15. 18 9月, 2019 1 次提交
    • X
      fix bias quantize error && fix clang build error (#2049) · 8d6f475e
      Xiaoyang LI 提交于
      * fix gemm_int8, gemv-int8 and conv-int8 math function, add float bias
      
      * change conv impl
      
      * neon int8 kernel support float bias
      
      * arm compute kernel support float bias
      
      * add math_test target
      
      * add tensor utils for testing, fix sgemm ut error
      
      * add gemm_int8 unit test, support float bias
      
      * fix build script
      
      * add conv compute unit test for arm
      
      * fix build script, test=develop
      
      * fix fp32 dw conv3x3s1, test=develop
      
      * add fp32 dw conv3x3s1, test=develop
      
      * add armv7 fp32 dw conv3x3s1, test=develop
      
      * add fp32 depthwise conv3x3s2, test=develop
      
      * fix fp32 conv3x3 depthwise build error, test=develop
      
      * fix gemm_like conv trans weights error, test=develop
      
      * fix int8 depthwise conv3x3 error, test=develop
      
      * turn on all test for arm fp32 conv, test=develop
      
      * fix int8 conv1x1 error
      
      * fix int8 direct conv3x3s1 error, test=develop
      
      * fix int8 direct conv3x3s2, test=develop
      
      * turn on all test for arm int8 conv, test=develop
      
      * fix int8 fc error, change mobilenetv1-int8 ground-truth result to fluid, test=develop
      
      * remove debug info, strip ut binary, test=develop
      
      * fix conv compute error, test=develop
      
      * change Init() to ReInitWhenNeeded(), test=develop
      
      * fix code style, test=develop
      
      * remote engine_test, test=develop
      
      * fix building server tests error, test=develop
      
      * fix sdot clang build error, test=develop
      
      * fix sgemm ut timeout error, test=develop
      
      * fix clang build error, test=develop
      
      * turn off math basic test due to ci time out, test=develop
      
      * fix conv_int8 ut error, test=develop
      8d6f475e
  16. 16 8月, 2019 1 次提交