1. 28 6月, 2022 1 次提交
  2. 27 6月, 2022 1 次提交
    • S
      make inference_api_test compile with dynamic linking library (#41944) · f8b9073f
      Sing_chan 提交于
      * make inference_api_test use import_lib + dll to smaller its size;test=document_fix;test=windows_ci_inference
      
      * make place.h not export symbols; change the order of inference test dependencies
      
      * cant delete analysis pass ..in inference_analysis_test or build test_analyzer.exe will failed; temporarily comment test need paddle_inference_c
      
      * add onnx path; cp onnxruntime.dll to tests/api
      
      * make c_ut depend on paddle_inference_c_shared rather than paddle_inference_c
      
      * make c_ut depend .dll in windows and .lib in linux;path add capi_exp
      
      * init context pool by analysis predictor
      
      * make windows and linux both use shared; compile shared even on_infer off
      
      * move case:profile after case:fuse_statis to avoid context pool null problem
      
      * make c_ut depend .dll in windows and .lib in linux;path add capi_exp
      f8b9073f
  3. 14 6月, 2022 1 次提交
  4. 13 6月, 2022 1 次提交
  5. 04 6月, 2022 1 次提交
  6. 25 4月, 2022 1 次提交
  7. 14 4月, 2022 2 次提交
    • S
      FC+elementwise_add (residual connection) (#41776) · 92d8d0bc
      Sławomir Siwek 提交于
      * Change tensor name to match activation
      
      * declare fc_eltwise_add pass
      
      * merge conv_eltwise refactor PR
      
      * first compilable draft
      
      * unittest feedback tools
      
      * Fuse pass tester
      
      * Move IsReachable() to shared file
      
      * 100% coverage of fuse_pass_tester.cc
      
      * register pass
      
      * Add bias node
      
      * Improve unit tests / remove bias node from pattern
      
      * improve fc_eltwiseadd_unittest
      
      * cancel eltwise_add fuse if act is already fused
      
      * Add elementwise_input scale
      
      * Residual MVP
      
      * Add new FC attrs
      
      * Add more test cases
      
      * Add missing op attrs
      
      * Adapt code to new Elementwise pattern
      
      * reuse existing fcpattern
      
      * improve code style
      
      * remove unused arguments
      
      * fix typo
      
      * remove whitespace
      
      * remove int8 related code
      
      * Remove attributes from base ops
      
      * style
      
      * style check
      
      * Remove input from base op
      
      * Set attribute during fuse
      
      * ut timeout
      
      * download and test model
      
      * DRY
      
      * apply feedback from review
      
      * Style check
      
      * fix typo
      
      * cosmetic changes
      
      * explicitly set residual as output
      
      * VIT-OCR accuracy check
      
      * trigger CI
      
      * remove whitespaces
      
      * fix missing data file
      92d8d0bc
    • B
      add mkldnn int8 pass [step3] (#41599) · 8e2d4d30
      baoachun 提交于
      * add mkldnn int8 pass [step3]
      
      * Add test for compute_propagate_scales_mkldnn_pass
      
      * update pass
      
      * update api comment and python api
      Co-authored-by: Nwozna <joanna.wozna@intel.com>
      8e2d4d30
  8. 02 3月, 2022 1 次提交
  9. 01 3月, 2022 1 次提交
  10. 25 2月, 2022 1 次提交
  11. 23 2月, 2022 1 次提交
  12. 15 12月, 2021 1 次提交
  13. 07 12月, 2021 1 次提交
  14. 26 10月, 2021 1 次提交
  15. 14 9月, 2021 1 次提交
  16. 31 8月, 2021 1 次提交
  17. 27 8月, 2021 2 次提交
  18. 26 8月, 2021 1 次提交
    • S
      Add copy from tensor (#34406) · ac33c0ca
      Shang Zhizhou 提交于
      * add api
      
      * temp save
      
      * revert
      
      * copytocpu async ok
      
      * fix style
      
      * copy sync ok
      
      * fix compile error
      
      * fix compile error
      
      * api done
      
      * update python async api
      
      * fix compile
      
      * remove async python api; add c++ async unittest
      
      * remove python async api
      
      * update unittest
      
      * update unittest
      
      * add C++ unittest for copytensor
      
      * add unittest
      
      * update namespace utils to class TensorUtils
      
      * add unittest
      
      * update unittest
      
      * update unittest
      
      * update code style
      
      * update code style
      
      * update unittest
      ac33c0ca
  19. 09 6月, 2021 1 次提交
  20. 01 6月, 2021 1 次提交
  21. 26 5月, 2021 1 次提交
  22. 19 5月, 2021 1 次提交
  23. 24 4月, 2021 1 次提交
  24. 29 3月, 2021 1 次提交
  25. 25 2月, 2021 2 次提交
  26. 20 2月, 2021 1 次提交
  27. 03 2月, 2021 1 次提交
  28. 01 2月, 2021 1 次提交
  29. 21 1月, 2021 1 次提交
  30. 20 1月, 2021 1 次提交
  31. 29 12月, 2020 1 次提交
  32. 11 12月, 2020 1 次提交
  33. 02 12月, 2020 1 次提交
  34. 30 11月, 2020 1 次提交
  35. 27 11月, 2020 1 次提交
    • S
      detect tensorRT plugin fp16 in runtime (#27933) · b9e76a01
      Shang Zhizhou 提交于
      * remove -DSUPPORTS_CUDA_FP16 in cuda.cmake
      
      * comile with cuda9
      
      * add some unittest
      
      * notest;test=coverage
      
      * add unittest for trt plugin swish && split
      
      * update ernie unittest
      
      * fix some error message
      
      * remove repeated judgement of CUDA version in mbEltwiseLayerNormOpConverter
      
      * fix comile errror when CUDA_ARCH_NAME < Pascal"
      
      * fix comile error
      
      * update unittest timeout
      
      * compile with cuda9
      
      * update error msg
      
      * fix code style
      
      * add some comments
      
      * add define IF_CUDA_ARCH_SUPPORT_FP16
      
      * rename IF_CUDA_ARCH_SUPPORT_FP16 to CUDA_ARCH_FP16_SUPPORTED
      b9e76a01
  36. 26 11月, 2020 1 次提交
  37. 13 11月, 2020 1 次提交