1. 13 1月, 2020 1 次提交
  2. 10 1月, 2020 2 次提交
    • G
      [cherry-pick] Add FC padding, ernie test unit and layernorm parallel (#22198) · 3df38f5c
      GaoWei8 提交于
      * Optimize the kernel implementation of layernorm with openmp (#20895)
      
      * Add ernie c++ inference test (#21015)
      
      * Add ernie unit test
      test=develop
      
      * Add ernie unit test
      test=develop
      
      * Add ernie unit test
      test=develop
      
      * remove ngraph
      
      * optimize gpu test
      test=develop
      
      * optimize codes
      test=develop
      
      * fix cmake fails on inference_download_and_uncompress (#21185)
      
      * solve cmake fails on inference_download_and_uncompress
      test=develop
      
      * solve cmake fails on inference_download_and_uncompress
      test=develop
      
      * Add fc padding to improve mkl GEMM's performance when N and K are multiple of 128. (#20972)
      
      * Add fc padding to solve mkl performance
      test=develop
      
      * fix gpu pass and error information
      test=develop
      
      * fix fc_fuse_pass_test
      test=develop
      
      * fix error information
      test=develop
      
      * fix error information
      test=develop
      
      * fix name and add fc op padding test
      test=develop
      
      * fix attributes
      test=develop
      
      * optimize fc padding
      test=develop
      
      * fix test
      test=develop
      
      * Polish the codes of fc when needs padding (#21378)
      
      test=develop
      
      * Add ernie large c++ inference test (#21365)
      
      * add ernie-large test
      test=develop
      
      * add ernie large c++ inference test
      test=develop
      
      * Modify padding strategy: remove weight copy in fc padding (#21650)
      
      test=develop
      
      * optimize fc jit (#21878)
      
      test=develop
      Co-authored-by: NYihua Xu <yihuaxu@hotmail.com>
      3df38f5c
    • fix multi-thread error of fc_gru_fuse_pass.cc, test=develop (#21841) (#22185) · e8e12499
      石晓伟 提交于
      * fix multi-thread error of fc_gru_fuse_pass.cc, test=develop
      
      * export FLAGS and GLOG symbols, test=develop
      e8e12499
  3. 16 12月, 2019 1 次提交
  4. 09 12月, 2019 1 次提交
  5. 08 12月, 2019 1 次提交
  6. 06 12月, 2019 1 次提交
  7. 05 12月, 2019 1 次提交
  8. 04 12月, 2019 2 次提交
  9. 03 12月, 2019 2 次提交
  10. 02 12月, 2019 1 次提交
  11. 29 11月, 2019 2 次提交
  12. 25 11月, 2019 1 次提交
  13. 02 11月, 2019 1 次提交
  14. 01 11月, 2019 1 次提交
  15. 31 10月, 2019 1 次提交
  16. 20 10月, 2019 1 次提交
  17. 18 10月, 2019 2 次提交
  18. 16 10月, 2019 1 次提交
  19. 15 10月, 2019 1 次提交
  20. 14 10月, 2019 2 次提交
  21. 12 10月, 2019 1 次提交
  22. 10 10月, 2019 1 次提交
    • L
      [Cherry-pick] Add C-API for fluid inference api (#20259) · f72d82cc
      liu zhengxi 提交于
      * Add capi for fluid inference api (#20092)
      
      * add capi for fluid inference api, including AnalysisConfig, AnalysisPredictor, PaddleBuf, PaddleTensor, ZeroCopyTensor
      
      * add dll to inference capi (#20180)
      
      * add dll to inference capi, test=develop
      
      * add if win32 in cmakelists, test=develop
      f72d82cc
  23. 01 10月, 2019 1 次提交
  24. 27 9月, 2019 1 次提交
    • update operator compatible info, test=develop (#19978) · 01b9d079
      石晓伟 提交于
      * update operator compatible info, test=develop
      
      * revert cmake/version.cmake, test=develop
      
      * add unit_tests and fix bugs, test=develop
      
      * update ../paddle/fluid/framework/framework.proto, test=develop
      
      * fix bug of paddle/fluid/inference/api/analysis_predictor.cc, test=develop
      
      * update paddle/fluid/framework/version_test.cc, test=develop
      
      * add comments and rename interfaces, test=develop
      01b9d079
  25. 25 9月, 2019 2 次提交
  26. 21 9月, 2019 3 次提交
  27. 20 9月, 2019 1 次提交
  28. 19 9月, 2019 1 次提交
    • Y
      Add a pass to fuse fc+elementwise_add+layernorm (#19776) · 3cd985a6
      Yiqun Liu 提交于
      * Add fc_elementwise_layernorm_fuse pass and unittest.
      
      * Add fused_fc_elementwise_layernorm op and its GPU kernel.
      test=develop
      
      * Apply fc_elementwise_layernorm_fuse_pass to GPU inference.
      
      * Add the setting of attrs in the definition of binary_op.
      test=develop
      
      * Add comment.
      
      * Implement the unittest.
      test=develop
      
      * Change the unittest name of layer_norm.
      test=develop
      3cd985a6
  29. 18 9月, 2019 1 次提交
  30. 17 9月, 2019 2 次提交