1. 05 3月, 2020 1 次提交
  2. 29 2月, 2020 1 次提交
  3. 28 2月, 2020 1 次提交
    • M
      add sequence_conv op and arm kernel (#3016) · 0775140a
      mapingshuo 提交于
      * add sequence_conv op and arm kernel
      
      * add test, test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * test=develop
      
      * modify code style. test=develop
      
      * fix ut, test=develop
      
      * delete unused code, test=develop
      0775140a
  4. 17 2月, 2020 1 次提交
  5. 24 1月, 2020 1 次提交
  6. 16 1月, 2020 1 次提交
  7. 15 1月, 2020 1 次提交
  8. 14 1月, 2020 3 次提交
  9. 13 1月, 2020 2 次提交
  10. 10 1月, 2020 1 次提交
  11. 09 1月, 2020 1 次提交
  12. 08 1月, 2020 1 次提交
  13. 07 1月, 2020 1 次提交
  14. 03 1月, 2020 1 次提交
  15. 31 12月, 2019 3 次提交
    • W
      X86 and cuda compile simutaneously cmake .. -DCMAKE_BUILD_TYPE=RelWithDebInfo... · f1cedb8f
      Wilber 提交于
      X86 and cuda compile simutaneously cmake ..  -DCMAKE_BUILD_TYPE=RelWithDebInfo  -DWITH_MKL=ON           -DLITE_WITH_CUDA=ON           -DWITH_MKLDNN=OFF           -DLITE_WITH_X86=ON           -DLITE_WITH_PROFILE=OFF          -DWITH_LITE=OFF           -DLITE_WITH_LIGHT_WEIGHT_FRAMEWORK=OFF           -DWITH_PYTHON=OFF           -DWITH_TESTING=ON           -DLITE_WITH_ARM=OFF           -DLITE_ON_TINY_PUBLISH=OFF           -DCUDNN_ROOT=/usr/local/cudnn/           -DLITE_BUILD_EXTRA=ON (#2708)
      
      x86 and cuda compile simutaneously
      f1cedb8f
    • Z
      [XPU] bn unit test (#2706) · bc6d5adc
      zhupengyang 提交于
      test=develop
      bc6d5adc
    • H
      [LITE][NPU][XPU] Refine the registration and implementation of op bridges (#2700) · a29c84a2
      hong19860320 提交于
      * Fix the compiling error which occurs when specify the ddk_root path and build for huawei NPU.
      
      * Refine the registration of op bridges and make it similar to the registration of op and kernel.
      
      * Refine the interfaces of the graph and node for op bridges, and support creating constant and data node automatically according to the attribute 'persistable' of the target tensor.
      
      * Add the unit test of the scale and softmax op bridge for NPU.
      a29c84a2
  16. 26 12月, 2019 1 次提交
  17. 24 12月, 2019 3 次提交
  18. 23 12月, 2019 1 次提交
  19. 21 12月, 2019 1 次提交
  20. 20 12月, 2019 2 次提交
  21. 13 12月, 2019 1 次提交
  22. 10 12月, 2019 1 次提交
  23. 07 12月, 2019 1 次提交
    • J
      Support mask_rcnn (#2484) · c2f72cb3
      juncaipeng 提交于
      * add arm split lod tensor, test=develop
      
      * add arm merge lod tensor, test=develop
      
      * update split merge lod tensor, test=develop
      
      * add reduce_prob op, test=develop
      
      * support mask_rcnn succeed, test=develop
      c2f72cb3
  24. 16 11月, 2019 1 次提交
  25. 12 11月, 2019 1 次提交
  26. 07 11月, 2019 1 次提交
  27. 28 10月, 2019 1 次提交
    • H
      [LITE][XPU] initial support for XPU (#2202) · 06d058fe
      hong19860320 提交于
      * Initial support for XPU
      * Fix compiling errors of XPU
      * Move XPU op kernel bridges from backends to kernels to fix deps order
      * Change the namespace and directory of XPU bridges
      * Add XPU SDK
      * Fix header files and namespace of XPU SDK
      * Add unit tests for relu and conv2d ops
      * Restore the modification of paddle_api_test
      * Supports simple model which contains only a relu layer
      * Add compiling scripts for XPU
      * Fix compiling errors of XPU
      * Add comments for XPU LoadModel and BuildModel
      06d058fe
  28. 18 9月, 2019 1 次提交
    • X
      fix bias quantize error && fix clang build error (#2049) · 81dffbe8
      Xiaoyang LI 提交于
      * fix gemm_int8, gemv-int8 and conv-int8 math function, add float bias
      
      * change conv impl
      
      * neon int8 kernel support float bias
      
      * arm compute kernel support float bias
      
      * add math_test target
      
      * add tensor utils for testing, fix sgemm ut error
      
      * add gemm_int8 unit test, support float bias
      
      * fix build script
      
      * add conv compute unit test for arm
      
      * fix build script, test=develop
      
      * fix fp32 dw conv3x3s1, test=develop
      
      * add fp32 dw conv3x3s1, test=develop
      
      * add armv7 fp32 dw conv3x3s1, test=develop
      
      * add fp32 depthwise conv3x3s2, test=develop
      
      * fix fp32 conv3x3 depthwise build error, test=develop
      
      * fix gemm_like conv trans weights error, test=develop
      
      * fix int8 depthwise conv3x3 error, test=develop
      
      * turn on all test for arm fp32 conv, test=develop
      
      * fix int8 conv1x1 error
      
      * fix int8 direct conv3x3s1 error, test=develop
      
      * fix int8 direct conv3x3s2, test=develop
      
      * turn on all test for arm int8 conv, test=develop
      
      * fix int8 fc error, change mobilenetv1-int8 ground-truth result to fluid, test=develop
      
      * remove debug info, strip ut binary, test=develop
      
      * fix conv compute error, test=develop
      
      * change Init() to ReInitWhenNeeded(), test=develop
      
      * fix code style, test=develop
      
      * remote engine_test, test=develop
      
      * fix building server tests error, test=develop
      
      * fix sdot clang build error, test=develop
      
      * fix sgemm ut timeout error, test=develop
      
      * fix clang build error, test=develop
      
      * turn off math basic test due to ci time out, test=develop
      
      * fix conv_int8 ut error, test=develop
      81dffbe8
  29. 12 9月, 2019 1 次提交
  30. 09 9月, 2019 1 次提交
  31. 04 9月, 2019 1 次提交
  32. 02 9月, 2019 1 次提交
    • J
      Add ops and fix bugs for Faster RCNN (#1942) · 635b4958
      juncaipeng 提交于
      * add ops for faster rcnn
      
      * disable test for generate_proposals and roi_align, test=develop
      
      * remove .swp file
      
      * remove log in tensor slice
      
      * finish the unit test for roi_align, test=develop
      
      * add box_clip op and fix tensor slice bug
      
      * remove add four op twice
      
      * rewrite the implement for box_coder and sequence_expand, add faster_rcnn_test, test=develop
      
      * fix test bug of box_clip in x86 server, test=develop
      635b4958