1. 27 11月, 2019 1 次提交
  2. 25 11月, 2019 1 次提交
  3. 22 11月, 2019 2 次提交
    • H
      update conv 2-pad to 4-pad (#2404) · 820eb6d4
      HappyAngel 提交于
      * fix conv 2-pad to 4-pad
      
      * fix compute conv shape
      
      * fix pad, test=develop
      
      * change conv_depthwise_3x3s1_fp.cc name to conv3x3s1p01_depthwise_fp32.cc to distinguish between conv3x3s1_depthwise_fp32.cc
      
      * delete printf note in conv3x3s1, test=develop
      
      * delete printf note, test=develop
      
      * delete gem_sdot.h, test=develop
      
      it is coped from __gemm_sdot_meta_.h
      
      * update compute padding, test=develop
      
      * fix padding size, must be 2 or 4. test=develop
      
      * fix format in operators/conv_op.cc, test=develop
      
      * change #if 0 to #if 1, test=develop
      
      * put 2-pad to 4-pad in AttachImpl, test=develop
      
      * fix clang-format error inn tests/math/connv_compute_test, test=develop
      
      * fix x86 test result error, test=develop
      
      * add asymmetric padding test case in liite/tests/math/conv_compute.cc, test=develop
      
      * change paddings type to support dynamically modify, test=develop
      
      * fix x86 build error in connv_compute_test, test=develop
      
      * fix opencl build error, test=develop
      
      * fix oopencl build error, test=develop
      
      * fix  opencl/conv_compute build error, test=develop
      
      * fix  opencl/conv_compute build error, test=develop
      
      * fix format in kernels/opencl/conv_computte_ttest,test=develop
      
      * fix build error, test=develop
      
      fix build error in kernels/x86/conv_compute.h
      820eb6d4
    • H
      update pooling 2-padding to 4-padding (#2410) · a7f7d49b
      HappyAngel 提交于
      * fix pooling bug and speed
      
      * fix build error
      
      * delete VLOGin pool, test=develop
      
      * add openmp, test=develop
      
      * fix lite/kernels/arm/pool_compute_test basic_pooling compute error bug, test=develop
      
      * update pooling 2-pad to 4-pad, test=develop
      
      * fix 2-pad to 4-pad in operators/pool_op.h, AttachKernel will set param, so 2-pad to 4-pad funcs should put in AttachKernel. test=ddevellop
      
      * put 2-pad to 4-pad in AttachImpl, test=develop
      
      * according to reviews, fix some format error. test=develop
      
      * fix format errorr, add (). test=develop
      
      * change paddings type to support dynamically modify, test=develop
      
      * update padding type int other devices, test=develop
      
      * fix x8d build error on shared_ptr, test=ddevelop
      
      * fix formmat in operators pool_op.cc, test=develop
      a7f7d49b
  4. 19 11月, 2019 3 次提交
  5. 18 11月, 2019 2 次提交
  6. 16 11月, 2019 1 次提交
  7. 15 11月, 2019 1 次提交
    • J
      Add content-dnn ops (#2429) · 603b810f
      juncaipeng 提交于
      * add search_seq_depadding x86 and cuda
      * add match_matrix_tensor x86
      * add search_grnn x86, no test
      603b810f
  8. 14 11月, 2019 2 次提交
  9. 13 11月, 2019 4 次提交
  10. 12 11月, 2019 1 次提交
  11. 06 11月, 2019 2 次提交
  12. 05 11月, 2019 1 次提交
  13. 28 10月, 2019 1 次提交
    • H
      [LITE][XPU] initial support for XPU (#2202) · 06d058fe
      hong19860320 提交于
      * Initial support for XPU
      * Fix compiling errors of XPU
      * Move XPU op kernel bridges from backends to kernels to fix deps order
      * Change the namespace and directory of XPU bridges
      * Add XPU SDK
      * Fix header files and namespace of XPU SDK
      * Add unit tests for relu and conv2d ops
      * Restore the modification of paddle_api_test
      * Supports simple model which contains only a relu layer
      * Add compiling scripts for XPU
      * Fix compiling errors of XPU
      * Add comments for XPU LoadModel and BuildModel
      06d058fe
  14. 23 10月, 2019 1 次提交
  15. 22 10月, 2019 2 次提交
    • J
      Optimize quant_dequant (#2215) · f480d474
      juncaipeng 提交于
      * Add DeleteQuantOpFuser
      * Add fake_quantize_dequantize_moving_avg_abs_max_op
      * Add DeleteQuantDequantOpFuser
      f480d474
    • T
      Transformer pr (#2214) · f0a6c1eb
      TianXiaogang 提交于
      * feat: add beam_search_special function for support nlp model
      
      * fix: add beam_search_compute kernel input and output
      
      * feat: add assign op & copy_compute kernel
      
      * feat: add fill_const_batch_size_like op & kernel
      
      * feat: add layer_norm op and kernel and ut
      
      * fix: fix some bugs
          fix mul_op infer_shape bug when x_dim_idx = 2, x_dims.size()=3 & y_dim_idx = 1, y_dims.size()=2
          fix elementwise_compute bug when y axis is all 1
          fix beam_search choose math_func wrong bug
          fix layer_norm get attr bug
          fix fill_constant_batch_size_like shape_set bug
      
      * feat: add gather op and kernel & and transform ut
      
      * feats: add ops and fix bugs to support transformer op
             fix type_cast passes to skip `while`
             fix elementwise infer_shape bug when x.dims=3 and y.dims={1} & axis=0
             fix lookup_table compute bug
             fix read_from_array/beam_search/increment/compate/gather ops data_type problems
      
      * fix:
          transfomer ut add word read inferface
          fix copy/gather/norm/layer_norm include path problem
      
      * fix:debug info
      
      * fix: fix input reshape bug
      
      * fix: fix norm bug
      
      * style: style fix & test=develop
      
      * style: fix operators cmakelist
      
      * style: fix operators cmakelist; test=develop
      
      * fix and test=develop
      
      * fix and test=develop
      
      * style: style fix; test=develop
      f0a6c1eb
  16. 17 10月, 2019 1 次提交
  17. 15 10月, 2019 1 次提交
    • H
      [NPU] Fix and refine the supporting of multi NPU models (#2037) · 7a731b7f
      hong19860320 提交于
      * [NPU] Fix the bug of loading multi NPU models
      test=develop
      
      * [NPU] Use lite tensor to store NPU model, fix the management of multi NPU models, support loading NPU model from memory and reduce the modification of framework
      test=develop
      
      * [NPU] Remove redundant header files for NPU bridges,
      test=develop
      
      * [NPU] fix NPU deps
      test=develop
      
      * [NPU] refine the compiling script for NPU
      test=develop
      
      * [NPU] remove redundant subdirectory in lite/CMakeLists.txt
      test=develop
      
      * [NPU] Fix and refine NPU test case
      test=develop
      
      * [NPU] revoke the modification of other non-NPU modules
      test=develop
      
      * [NPU] Remove NPU bridges if target is tiny publish
      test=develop
      7a731b7f
  18. 14 10月, 2019 3 次提交
  19. 11 10月, 2019 1 次提交
    • Z
      CUDA: can run yolov3 int8 (#2172) · 7931104f
      Zhaolong Xing 提交于
      * add conv int8 support(in condition which the input or output channel not be the times of 4)
      add add_kernel for cuda.
      
      * can run yolov3 fp32
      test=develop
      
      * 1. fix bug with yolov3 run
      test=develop
      
      * can run yolov3 int8 test=develop
      7931104f
  20. 18 9月, 2019 1 次提交
    • X
      fix bias quantize error && fix clang build error (#2049) · 81dffbe8
      Xiaoyang LI 提交于
      * fix gemm_int8, gemv-int8 and conv-int8 math function, add float bias
      
      * change conv impl
      
      * neon int8 kernel support float bias
      
      * arm compute kernel support float bias
      
      * add math_test target
      
      * add tensor utils for testing, fix sgemm ut error
      
      * add gemm_int8 unit test, support float bias
      
      * fix build script
      
      * add conv compute unit test for arm
      
      * fix build script, test=develop
      
      * fix fp32 dw conv3x3s1, test=develop
      
      * add fp32 dw conv3x3s1, test=develop
      
      * add armv7 fp32 dw conv3x3s1, test=develop
      
      * add fp32 depthwise conv3x3s2, test=develop
      
      * fix fp32 conv3x3 depthwise build error, test=develop
      
      * fix gemm_like conv trans weights error, test=develop
      
      * fix int8 depthwise conv3x3 error, test=develop
      
      * turn on all test for arm fp32 conv, test=develop
      
      * fix int8 conv1x1 error
      
      * fix int8 direct conv3x3s1 error, test=develop
      
      * fix int8 direct conv3x3s2, test=develop
      
      * turn on all test for arm int8 conv, test=develop
      
      * fix int8 fc error, change mobilenetv1-int8 ground-truth result to fluid, test=develop
      
      * remove debug info, strip ut binary, test=develop
      
      * fix conv compute error, test=develop
      
      * change Init() to ReInitWhenNeeded(), test=develop
      
      * fix code style, test=develop
      
      * remote engine_test, test=develop
      
      * fix building server tests error, test=develop
      
      * fix sdot clang build error, test=develop
      
      * fix sgemm ut timeout error, test=develop
      
      * fix clang build error, test=develop
      
      * turn off math basic test due to ci time out, test=develop
      
      * fix conv_int8 ut error, test=develop
      81dffbe8
  21. 17 9月, 2019 2 次提交
  22. 16 9月, 2019 1 次提交
    • L
      Gru op (#2002) · eb42f9ee
      lhl960107 提交于
      * add x86 gru&&relu&&sequence_expand_as op test=develop
      eb42f9ee
  23. 12 9月, 2019 2 次提交
  24. 10 9月, 2019 1 次提交
  25. 09 9月, 2019 1 次提交
  26. 07 9月, 2019 1 次提交