1. 10 1月, 2020 1 次提交
    • G
      [cherry-pick] Add FC padding, ernie test unit and layernorm parallel (#22198) · 3df38f5c
      GaoWei8 提交于
      * Optimize the kernel implementation of layernorm with openmp (#20895)
      
      * Add ernie c++ inference test (#21015)
      
      * Add ernie unit test
      test=develop
      
      * Add ernie unit test
      test=develop
      
      * Add ernie unit test
      test=develop
      
      * remove ngraph
      
      * optimize gpu test
      test=develop
      
      * optimize codes
      test=develop
      
      * fix cmake fails on inference_download_and_uncompress (#21185)
      
      * solve cmake fails on inference_download_and_uncompress
      test=develop
      
      * solve cmake fails on inference_download_and_uncompress
      test=develop
      
      * Add fc padding to improve mkl GEMM's performance when N and K are multiple of 128. (#20972)
      
      * Add fc padding to solve mkl performance
      test=develop
      
      * fix gpu pass and error information
      test=develop
      
      * fix fc_fuse_pass_test
      test=develop
      
      * fix error information
      test=develop
      
      * fix error information
      test=develop
      
      * fix name and add fc op padding test
      test=develop
      
      * fix attributes
      test=develop
      
      * optimize fc padding
      test=develop
      
      * fix test
      test=develop
      
      * Polish the codes of fc when needs padding (#21378)
      
      test=develop
      
      * Add ernie large c++ inference test (#21365)
      
      * add ernie-large test
      test=develop
      
      * add ernie large c++ inference test
      test=develop
      
      * Modify padding strategy: remove weight copy in fc padding (#21650)
      
      test=develop
      
      * optimize fc jit (#21878)
      
      test=develop
      Co-authored-by: NYihua Xu <yihuaxu@hotmail.com>
      3df38f5c
  2. 09 12月, 2019 1 次提交
  3. 06 12月, 2019 1 次提交
  4. 02 12月, 2019 1 次提交
  5. 29 11月, 2019 1 次提交
  6. 25 11月, 2019 1 次提交
  7. 31 10月, 2019 1 次提交
  8. 20 10月, 2019 1 次提交
  9. 18 10月, 2019 1 次提交
  10. 16 10月, 2019 1 次提交
  11. 15 10月, 2019 1 次提交
  12. 14 10月, 2019 1 次提交
  13. 10 10月, 2019 1 次提交
    • L
      [Cherry-pick] Add C-API for fluid inference api (#20259) · f72d82cc
      liu zhengxi 提交于
      * Add capi for fluid inference api (#20092)
      
      * add capi for fluid inference api, including AnalysisConfig, AnalysisPredictor, PaddleBuf, PaddleTensor, ZeroCopyTensor
      
      * add dll to inference capi (#20180)
      
      * add dll to inference capi, test=develop
      
      * add if win32 in cmakelists, test=develop
      f72d82cc
  14. 25 9月, 2019 2 次提交
  15. 21 9月, 2019 2 次提交
  16. 17 9月, 2019 1 次提交
  17. 16 9月, 2019 1 次提交
    • Y
      Enhance fc_fuse_pass to enable fusing relu to fc_op (#19733) · c67c8758
      Yiqun Liu 提交于
      * Refine the codes related to fc op.
      
      * Add GPU implementation for fc functor.
      
      * Apply fc_fuse_pass in GPU inference.
      test=develop
      
      * Change the cmake for fc op.
      
      * Change PADDLE_ENFORCE to PADDLE_ENFORCE_EQ.
      
      * Add an attribute to set the activation type in fc_op.
      
      * Enhance the unittest of fc_op.
      test=develop
      
      * Remove the declaration of FCOpGrad back to the header file.
      test=develop
      
      * Set default value for newly added arguments in test_fc_op.
      test=develop
      
      * Enhance fc_fuse_pass to enable fusing relu.
      
      * Allow print the shapes of var_desc in graph.
      test=develop
      
      * Enhance fc_fuse_pass_tester.
      
      * Remove the use of PADDLE_ENFORCE.
      test=develop
      
      * Correct the number of ops after fusing.
      test=develop
      
      * Fix a typo.
      test=develop
      
      * Set activation_type to null when there is no relu in fc.
      test=develop
      
      * Refine fc_fuse_pass's codes.
      
      * Enable the set of shape for tensor.
      
      * Refine repeated_fc_relu_pass and add unittest.
      test=develop
      c67c8758
  18. 03 9月, 2019 1 次提交
    • Y
      A a pass to enable the use of cudnn (#19346) · c5548178
      Yiqun Liu 提交于
      * Add a interface to enable cudnn for inference.
      
      * Add cudnn_placement_pass.
      test=develop
      
      * Set the default value of cudnn_enabled_op_types to null.
      test=develop
      
      * Write the common basic class, placement_pass_base, to refine the codes.
      test=develop
      
      * Call EnableCUDNN in unittest.
      test=develop
      
      * Refine cudnn_placement_pass tester.
      
      * Enable the testing of cudnn_placement_pass in inference's unittest.
      test=develop
      
      * Add the check of op kernels.
      test=develop
      c5548178
  19. 22 8月, 2019 1 次提交
  20. 15 8月, 2019 1 次提交
  21. 30 7月, 2019 1 次提交
  22. 11 7月, 2019 1 次提交
  23. 08 7月, 2019 2 次提交
  24. 05 7月, 2019 1 次提交
  25. 03 7月, 2019 2 次提交
  26. 02 7月, 2019 1 次提交
  27. 27 6月, 2019 1 次提交
    • L
      some fixes for int8 mobilenet_ssd tester (#18112) · 5fd68ac1
      lidanqing 提交于
      * some fixes for int8 mobilenet_ssd tester
      test=develop
      
      * change wrong data file name
      test=develop
      
      * change test images bin file from 200 images to 100 images
      
      * change directory existence to file existence during downloading
      test=develop
      
      * reuse download_data
      test=develop
      
      * run full dataset when iterations=0
      test=develop
      5fd68ac1
  28. 19 6月, 2019 2 次提交
  29. 16 6月, 2019 2 次提交
  30. 14 6月, 2019 1 次提交
    • L
      add Mobilienet ssd int8 analyzer tester (#18075) · 46625415
      lidanqing 提交于
      * add pascalvoc preprocess script and mobilenet-ssd analyzer_tester, wait 17737
      
      * change converting local dataset to downloading and converting tarfile
      test=develop
      
      * change the test data_path
      test=develop
      
      * change copyright (c) 2016 to copyright (c) 2019
      test=develop
      46625415
  31. 13 6月, 2019 2 次提交
  32. 11 6月, 2019 1 次提交
    • Update the Anakin interfaces for content-dnn and MLU (#17890) · bce259e5
      石晓伟 提交于
      * update anakin-engine interfaces for content-dnn
      
      test=develop
      
      * support only-gpu mode of Anakin
      
      modify eltwise parse
      
      test=develop
      
      * modification for thread-safe
      
      test=develop
      
      * Integrated template instance
      
      test=develop
      
      * increase template parameters
      
      test=develop
      
      * support MLU predictor
      
      test=develop
      
      * update anakin cmake files
      
      test=develop
      
      * update TargetWrapper::set_device
      
      * update the initialization of anakin subgraph
      
      test=develop
      
      * use the default constructor of base class
      
      test=develop
      bce259e5
  33. 06 6月, 2019 1 次提交