1. 06 2月, 2022 1 次提交
  2. 28 1月, 2022 1 次提交
  3. 27 1月, 2022 4 次提交
    • A
      [PluggableDevice] Add custom kernel support based on pten kernel management (#38848) · a8879215
      Aganlengzi 提交于
      * [Demo] custom kernel based on pten kernel
      
      * merge and npu custom work well
      
      * del comments
      
      * delete other code
      
      * fix CUDAContext
      
      * fix not found small_vector.h
      
      * support NPU
      
      * fix NPUContext
      
      * fix DeviceContext support
      
      * add UT
      
      * fix call
      
      * add UT
      
      * fix
      
      * fix for comments and ut
      
      * add MACRO control
      
      * fix multi input output
      
      * support env CUSTOM_DEVICE_ROOT
      
      * deal with special cases
      
      * fix for Windows
      
      * try coverage with test_custom_kernel_dot.py
      
      * fix test_custom_kernel_dot
      
      * fix test_custom_kernel_dot
      
      * fix merge
      
      * fix merge
      
      * fix CI
      
      * update
      
      * merge and fix
      
      * remove WITH_CUSTOM_KERNEL
      
      * fix merge
      
      * merge and fix
      
      * fix ut
      
      * fix ut for mac
      
      * add more UT
      
      * add more UT
      
      * fix
      a8879215
    • W
      fix shuffle_channel_detect_pass (#39242) · af9ddeb7
      wenbin 提交于
      * shuffle channel pass
      
      * add ut
      
      * timeout fix
      
      * makefile fix
      af9ddeb7
    • 9b79988c
    • W
      [Paddle-Inference]: fix concat slice (#39096) · f080e8d5
      Wangzheee 提交于
      * Paddle-Inference:fix_concat_slice
      
      * Paddle-Inference:fix_concat_slice
      
      * Paddle-Inference:fix_concat_slice
      
      * Paddle-Inference:fix_concat_slice
      
      * [Paddle-Inference]: fix concat slice
      
      * [Paddle-Inference]: fix concat slice
      
      * [Paddle-Inference]: fix concat slice
      f080e8d5
  4. 26 1月, 2022 2 次提交
    • L
      [pten] remove deprecated fluid op kernel for pten (#38842) · 3ab9aef1
      Leo Chen 提交于
      * update cmake file to remove fluid kernel
      
      * add pten declaration.h to where pybind.h used
      
      * fix sync_bn and tensorrt_engine
      
      * refine detection_library
      
      * fix interpreter_core
      
      * support eager legacy
      
      * fit eager legacy for pten
      
      * fall back to cpu if not found kernel
      
      * fix compile problem
      
      * fix compile problem
      
      * refine fallback logic
      
      * fit operator.run()
      
      * fix xpu compile
      
      * fit for new_exec
      
      * add REGISTER_OP_WITHOUT_GRADIENT
      
      * un-cache pt_kernel_context
      
      * fix compile
      
      * fix cudnn
      
      * fix compiling with on_infer
      
      * fix mkldnn
      
      * fix isfinite_v2
      
      * fix xpu problem
      
      * fix op_device
      
      * refine fallback for xpu
      
      * fix xpu compile
      
      * merge develop
      
      * refine code format
      
      * fix compile
      
      * fix compile
      
      * add data_transfer
      
      * fix PreparePtenData
      
      * fix cpu context
      
      * merge develop
      
      * fix compile
      
      * fix error device context
      
      * fix xpu
      
      * fix dev_ctx
      3ab9aef1
    • B
      support npu weight unified H2D copy before inference (#39160) · 106b5514
      baoachun 提交于
      * support npu weight unified H2D copy
      
      * remove redundant variable
      106b5514
  5. 25 1月, 2022 3 次提交
  6. 24 1月, 2022 1 次提交
  7. 18 1月, 2022 4 次提交
  8. 17 1月, 2022 2 次提交
    • W
      disable unsupported trt dimension (#38962) · 55e9087f
      wenbin 提交于
      * develop test
      
      * throw
      
      * ne
      
      * wrong cnt
      55e9087f
    • W
      [Pten] Replace platform::Place to pten::Place. (#38899) · c48a9ad5
      Wilber 提交于
      * add pten::Place data structure.
      
      * update ci problem
      
      * fix ci problem
      
      * update
      
      * using platform::Place=pten::Place
      
      * remove BOOST_GET_CONST for CPUPlace and GPUPlace
      
      * compile pass 25%.
      
      * compile pass 45%
      
      * compile pass 60%
      
      * remove boost_get for xpu npu mlu and ipu
      
      * compile pass on cpu and gpu.
      
      * fix compile problem
      
      * fix compile error.
      
      * update
      
      * fix ci problem
      
      * update
      
      * ci approve
      
      * fix ci problem
      
      * fix ci eager test problem
      
      * remove BOOST_GET_CONST
      
      * fix npu compile
      c48a9ad5
  9. 15 1月, 2022 1 次提交
  10. 14 1月, 2022 1 次提交
    • H
      add flatten_contiguous_range OpConvert for Paddle-TRT (#38922) · 050aa6fe
      heliqi 提交于
      * add trt_convert_flatten_contiguous_rang op
      
      * trt version >7,support trt_convert_flatten_contiguous_rang
      
      * trt version >7,support trt_convert_flatten_contiguous_rang
      
      * trt version >7,support trt_convert_flatten_contiguous_rang
      
      * test cast add trt version >=7 skip
      050aa6fe
  11. 13 1月, 2022 3 次提交
  12. 11 1月, 2022 1 次提交
  13. 10 1月, 2022 1 次提交
    • Z
      [Unify Tensors PR #5] framework::Tensor inherits from DenseTensor,test=allcases (#38632) · 5c73a6ea
      Zhanlue Yang 提交于
      * Added shared_ptr<Allocation> member & corresponding interfaces to Storage
      
      * Removed original pten::Allocation from Storage and adjusted the interfaces accordingly
      
      * Fixed issues with storage offset
      
      * Used place to malloc allocation for TensorStorage
      
      * [Unify Tensors PR #3]Ported framework::Tensor interfaces to pten::DenseTensor
      
      * Fixed issues with place
      
      * Added comments
      
      * Moved mutable_data with stream argument to DenseTensor
      
      * Added set_offset interface
      
      * Fixed CI issues,test=allcases
      
      * [Unify Tensors PR #4] Port LoDTensor interfaces to DenseTensor
      
      * Removed friend class EigenTensor/EigenMatrix/EigenVector from Tensor
      
      * Modified framework::Tensor to inherit from DenseTensor
      
      * Reverted changes too pten_layout() interface
      
      * Removed friend classes
      
      * Rearranged cfunction calls from tensor.data<void>() to tensor.data()
      
      * Fixed CI issues
      
      * Fixed lite issues
      
      * Fixed data() interface issues,test=allcases
      
      * Resolved IsInitialized() issues
      
      * Fixed ResetHolder() issues
      
      * Fixed MKLDNN & Storage issues
      
      * Resolved ShareBufferWith() issues
      
      * Fixed LoD issues
      5c73a6ea
  14. 06 1月, 2022 1 次提交
  15. 05 1月, 2022 2 次提交
  16. 04 1月, 2022 1 次提交
  17. 31 12月, 2021 1 次提交
  18. 30 12月, 2021 2 次提交
  19. 23 12月, 2021 2 次提交
  20. 20 12月, 2021 3 次提交
  21. 17 12月, 2021 2 次提交
  22. 15 12月, 2021 1 次提交