1. 03 1月, 2020 1 次提交
  2. 31 12月, 2019 3 次提交
    • W
      X86 and cuda compile simutaneously cmake .. -DCMAKE_BUILD_TYPE=RelWithDebInfo... · f1cedb8f
      Wilber 提交于
      X86 and cuda compile simutaneously cmake ..  -DCMAKE_BUILD_TYPE=RelWithDebInfo  -DWITH_MKL=ON           -DLITE_WITH_CUDA=ON           -DWITH_MKLDNN=OFF           -DLITE_WITH_X86=ON           -DLITE_WITH_PROFILE=OFF          -DWITH_LITE=OFF           -DLITE_WITH_LIGHT_WEIGHT_FRAMEWORK=OFF           -DWITH_PYTHON=OFF           -DWITH_TESTING=ON           -DLITE_WITH_ARM=OFF           -DLITE_ON_TINY_PUBLISH=OFF           -DCUDNN_ROOT=/usr/local/cudnn/           -DLITE_BUILD_EXTRA=ON (#2708)
      
      x86 and cuda compile simutaneously
      f1cedb8f
    • Z
      [XPU] bn unit test (#2706) · bc6d5adc
      zhupengyang 提交于
      test=develop
      bc6d5adc
    • H
      [LITE][NPU][XPU] Refine the registration and implementation of op bridges (#2700) · a29c84a2
      hong19860320 提交于
      * Fix the compiling error which occurs when specify the ddk_root path and build for huawei NPU.
      
      * Refine the registration of op bridges and make it similar to the registration of op and kernel.
      
      * Refine the interfaces of the graph and node for op bridges, and support creating constant and data node automatically according to the attribute 'persistable' of the target tensor.
      
      * Add the unit test of the scale and softmax op bridge for NPU.
      a29c84a2
  3. 26 12月, 2019 1 次提交
  4. 25 12月, 2019 2 次提交
    • J
      fix op inputs and outputs type (#2647) · 168ce9a9
      juncaipeng 提交于
      * fix op inputs and outputs type, test=develop
      168ce9a9
    • Y
      [X86] Polish the implementation of fc and imporve the unittest (#2656) · 28481458
      Yiqun Liu 提交于
      * Remove GEMM padding in fc_compute.
      test=develop
      
      * Write a common ParallelFor function to run the for loop in parallel.
      
      * Add the codes of padding GEMM back in fc.
      
      * Refine the code of fc when padding_weight is false to avoid the definition of temporary Tensor.
      
      * Refine the unit test of fc and add testing case of padding and parallel.
      test=develop
      
      * Enable more test cases in common fc unittest, including padding and parallel for x86 target.
      
      * Remove the fc test under kernels/x86.
      test=develop
      
      * Disable relu in test of fc for non-x86 target.
      test=develop
      
      * Change the eps of arm.
      test=develop
      28481458
  5. 24 12月, 2019 5 次提交
  6. 23 12月, 2019 1 次提交
  7. 21 12月, 2019 1 次提交
  8. 20 12月, 2019 2 次提交
  9. 18 12月, 2019 1 次提交
  10. 13 12月, 2019 1 次提交
  11. 10 12月, 2019 1 次提交
  12. 07 12月, 2019 1 次提交
    • J
      Support mask_rcnn (#2484) · c2f72cb3
      juncaipeng 提交于
      * add arm split lod tensor, test=develop
      
      * add arm merge lod tensor, test=develop
      
      * update split merge lod tensor, test=develop
      
      * add reduce_prob op, test=develop
      
      * support mask_rcnn succeed, test=develop
      c2f72cb3
  13. 28 11月, 2019 1 次提交
  14. 27 11月, 2019 1 次提交
  15. 19 11月, 2019 1 次提交
  16. 16 11月, 2019 1 次提交
  17. 13 11月, 2019 2 次提交
  18. 12 11月, 2019 1 次提交
  19. 07 11月, 2019 1 次提交
  20. 06 11月, 2019 2 次提交
  21. 28 10月, 2019 1 次提交
    • H
      [LITE][XPU] initial support for XPU (#2202) · 06d058fe
      hong19860320 提交于
      * Initial support for XPU
      * Fix compiling errors of XPU
      * Move XPU op kernel bridges from backends to kernels to fix deps order
      * Change the namespace and directory of XPU bridges
      * Add XPU SDK
      * Fix header files and namespace of XPU SDK
      * Add unit tests for relu and conv2d ops
      * Restore the modification of paddle_api_test
      * Supports simple model which contains only a relu layer
      * Add compiling scripts for XPU
      * Fix compiling errors of XPU
      * Add comments for XPU LoadModel and BuildModel
      06d058fe
  22. 11 10月, 2019 1 次提交
  23. 23 9月, 2019 1 次提交
  24. 19 9月, 2019 1 次提交
  25. 18 9月, 2019 1 次提交
    • X
      fix bias quantize error && fix clang build error (#2049) · 81dffbe8
      Xiaoyang LI 提交于
      * fix gemm_int8, gemv-int8 and conv-int8 math function, add float bias
      
      * change conv impl
      
      * neon int8 kernel support float bias
      
      * arm compute kernel support float bias
      
      * add math_test target
      
      * add tensor utils for testing, fix sgemm ut error
      
      * add gemm_int8 unit test, support float bias
      
      * fix build script
      
      * add conv compute unit test for arm
      
      * fix build script, test=develop
      
      * fix fp32 dw conv3x3s1, test=develop
      
      * add fp32 dw conv3x3s1, test=develop
      
      * add armv7 fp32 dw conv3x3s1, test=develop
      
      * add fp32 depthwise conv3x3s2, test=develop
      
      * fix fp32 conv3x3 depthwise build error, test=develop
      
      * fix gemm_like conv trans weights error, test=develop
      
      * fix int8 depthwise conv3x3 error, test=develop
      
      * turn on all test for arm fp32 conv, test=develop
      
      * fix int8 conv1x1 error
      
      * fix int8 direct conv3x3s1 error, test=develop
      
      * fix int8 direct conv3x3s2, test=develop
      
      * turn on all test for arm int8 conv, test=develop
      
      * fix int8 fc error, change mobilenetv1-int8 ground-truth result to fluid, test=develop
      
      * remove debug info, strip ut binary, test=develop
      
      * fix conv compute error, test=develop
      
      * change Init() to ReInitWhenNeeded(), test=develop
      
      * fix code style, test=develop
      
      * remote engine_test, test=develop
      
      * fix building server tests error, test=develop
      
      * fix sdot clang build error, test=develop
      
      * fix sgemm ut timeout error, test=develop
      
      * fix clang build error, test=develop
      
      * turn off math basic test due to ci time out, test=develop
      
      * fix conv_int8 ut error, test=develop
      81dffbe8
  26. 17 9月, 2019 2 次提交
  27. 12 9月, 2019 2 次提交
  28. 10 9月, 2019 1 次提交