1. 24 3月, 2020 1 次提交
  2. 14 2月, 2020 1 次提交
    • Y
      [X86] Optimize gru and softmax (#2877) · 6b30c58a
      Yiqun Liu 提交于
      * Optimize softmax. When the input tensor is 2-D and axis is 1, there is no need to resize.
      
      * Optimize the gru, avoid calling Tensor::Slice.
      test=develop
      
      * Remove a std::vector in softmax.
      test=develop
      
      * Define CalculateSeqWidth to get the width of a sequence.
      test=develop
      6b30c58a
  3. 13 2月, 2020 1 次提交
  4. 11 2月, 2020 1 次提交
  5. 02 1月, 2020 1 次提交
  6. 26 12月, 2019 1 次提交
  7. 25 12月, 2019 1 次提交
    • Y
      [X86] Polish the implementation of fc and imporve the unittest (#2656) · a0f01efa
      Yiqun Liu 提交于
      * Remove GEMM padding in fc_compute.
      test=develop
      
      * Write a common ParallelFor function to run the for loop in parallel.
      
      * Add the codes of padding GEMM back in fc.
      
      * Refine the code of fc when padding_weight is false to avoid the definition of temporary Tensor.
      
      * Refine the unit test of fc and add testing case of padding and parallel.
      test=develop
      
      * Enable more test cases in common fc unittest, including padding and parallel for x86 target.
      
      * Remove the fc test under kernels/x86.
      test=develop
      
      * Disable relu in test of fc for non-x86 target.
      test=develop
      
      * Change the eps of arm.
      test=develop
      a0f01efa
  8. 27 11月, 2019 1 次提交
  9. 22 11月, 2019 1 次提交
    • H
      update pooling 2-padding to 4-padding (#2410) · 4bdb6171
      HappyAngel 提交于
      * fix pooling bug and speed
      
      * fix build error
      
      * delete VLOGin pool, test=develop
      
      * add openmp, test=develop
      
      * fix lite/kernels/arm/pool_compute_test basic_pooling compute error bug, test=develop
      
      * update pooling 2-pad to 4-pad, test=develop
      
      * fix 2-pad to 4-pad in operators/pool_op.h, AttachKernel will set param, so 2-pad to 4-pad funcs should put in AttachKernel. test=ddevellop
      
      * put 2-pad to 4-pad in AttachImpl, test=develop
      
      * according to reviews, fix some format error. test=develop
      
      * fix format errorr, add (). test=develop
      
      * change paddings type to support dynamically modify, test=develop
      
      * update padding type int other devices, test=develop
      
      * fix x8d build error on shared_ptr, test=ddevelop
      
      * fix formmat in operators pool_op.cc, test=develop
      4bdb6171
  10. 19 11月, 2019 1 次提交
  11. 07 11月, 2019 1 次提交
    • W
      fix jit_matmul bug according to paddle pr#20948 test=develop (#2392) · fc722c6b
      Wilber 提交于
      fix jit::matmul bug. Input x shape is (m, k), weight shape is (k, n). When k < 512, m==1, and n is a multiple of 16, the weight pointer is not correctly updated in the group calculation in the implementation of jit::matmul, resulting in the result diff 
      fc722c6b
  12. 29 10月, 2019 1 次提交
    • L
      Add tanh op and gelu op for x86 platform (#2265) · e3368aa4
      liu zhengxi 提交于
      * add tanh op in x86 platform and its unittest, test=develop
      
      * add gelu op on x86 platform and add its unittests, test=develop
      
      * update depends for math_function for activation for gelu, test=develop
      e3368aa4
  13. 23 10月, 2019 1 次提交
  14. 14 10月, 2019 1 次提交
  15. 19 9月, 2019 2 次提交
  16. 16 9月, 2019 1 次提交
    • L
      Gru op (#2002) · 1cb36af6
      lhl960107 提交于
      * add x86 gru&&relu&&sequence_expand_as op test=develop
      1cb36af6
  17. 12 9月, 2019 1 次提交
  18. 10 9月, 2019 1 次提交
  19. 07 9月, 2019 1 次提交
  20. 03 9月, 2019 1 次提交