1. 25 8月, 2022 1 次提交
    • K
      Transfer memcpy d2h from fluid to phi (#45150) · 0d14e74a
      kangguangli 提交于
      * transfer memcpy_d2h from fluid to phi
      
      * refine arg check and add comment
      
      * fix cannot fallback to phi kernel
      
      * fix gpu_context host alloc when tensor size = 0
      
      * add kernel for std::vector<DenseTensor> args
      
      * fix bugs in MemcpyD2HMultiIOKernel
      
      * remove useless header file
      
      * polish format
      
      * fix typo
      
      * add testcase for cudapinned place
      
      * refine check condition in test
      
      * polish error message
      
      * polish error message
      
      * remove header in fluid  directory
      
      * merge memcpy_h2d and memcpy_d2h into one file, change register method to simplify implementation
      
      * fix code style check
      0d14e74a
  2. 24 8月, 2022 1 次提交
  3. 23 8月, 2022 2 次提交
  4. 22 8月, 2022 1 次提交
  5. 19 8月, 2022 1 次提交
  6. 18 8月, 2022 2 次提交
  7. 17 8月, 2022 1 次提交
  8. 16 8月, 2022 2 次提交
  9. 15 8月, 2022 1 次提交
  10. 12 8月, 2022 1 次提交
    • S
      Offload calculations from matmul op to fuse pass (#44941) · acb78ea2
      Sławomir Siwek 提交于
      * remove v2_transpose_reshape
      
      * matmul_transpose_reshape
      
      * reshape_transpose_matmul
      
      * Add int8 support for matmulV2
      
      * restore ut
      
      * adjust old ut
      
      * restore parallel UT ruels
      
      * remove mkldnn code from base ops
      
      * move enforces to pass
      
      * remove duplicated functions
      
      * delete duplicated enforces
      
      * feedback from review
      
      * add comments to variables
      
      * enable eltwise support
      
      * dynamic attribute
      
      * remove fusepass tests from op test
      
      * remove fuse pass cases from op test
      
      * revert introduction of dynamic attributes
      
      * style
      Co-authored-by: Nwozna <joanna.wozna@intel.com>
      acb78ea2
  11. 10 8月, 2022 1 次提交
  12. 09 8月, 2022 1 次提交
  13. 08 8月, 2022 2 次提交
  14. 05 8月, 2022 2 次提交
  15. 03 8月, 2022 1 次提交
  16. 02 8月, 2022 1 次提交
  17. 29 7月, 2022 3 次提交
    • A
      add FLAGS_enable_api_kernel_fallback (#44706) · e439d735
      Aganlengzi 提交于
      * add FLAGS_enable_api_kernel_fallback
      
      * deal with more cases
      
      * add ut for coverage
      e439d735
    • L
      move CUDAStream to phi (#44529) · da3743fd
      Leo Chen 提交于
      * init
      
      * move CUDAStream to phi
      
      * fix compilation
      
      * merge develop
      
      * add stream_owned_ member
      
      * split cuda_stream.h
      
      * fix cpu compile
      
      * fix constructor
      
      * fix bug
      
      * fix windows compile
      
      * fix inference test_levit
      
      * fix windows tests
      da3743fd
    • Y
      [Phi] Add yaml for assign_value (#44596) · 88584396
      Yulong Ao 提交于
      * [Phi] Add yaml for assign_value
      
      * [Phi] Fix the bug of the assign api and modify the unittest
      
      * [Phi] Fix the bug when the tensor does not have the backend info
      
      * [Phi] Replace the functional-style cast init by the brace-init
      
      * [Phi] Cast the data explicitly
      88584396
  18. 28 7月, 2022 1 次提交
  19. 27 7月, 2022 1 次提交
  20. 26 7月, 2022 1 次提交
  21. 22 7月, 2022 1 次提交
    • A
      [CustomDevice] register Copy for custom device (#44200) · 3b0aa75e
      Aganlengzi 提交于
      * [CustomDevice] register Copy for custom device
      
      * [CustomDevice] register Copy for custom device
      
      * [CustomDevice] register Copy for custom device
      
      * merge and add uts
      
      * merge and add uts
      
      * fix for blocking and unittests coverage
      3b0aa75e
  22. 19 7月, 2022 1 次提交
  23. 18 7月, 2022 1 次提交
  24. 15 7月, 2022 1 次提交
  25. 14 7月, 2022 2 次提交
  26. 13 7月, 2022 1 次提交
  27. 12 7月, 2022 1 次提交
  28. 07 7月, 2022 1 次提交
  29. 04 7月, 2022 1 次提交
  30. 29 6月, 2022 1 次提交
  31. 28 6月, 2022 1 次提交
    • R
      Remove boost::variant (#43100) · b3cf28f8
      Ruibiao Chen 提交于
      * boost::variant -> paddle::variant
      
      * boost::variant.apply_visit -> paddle::visit
      
      * Update pybind_boost_hraders.h
      
      * Fix CINN compilation errors
      
      * Revert FetchResultType
      b3cf28f8
  32. 24 6月, 2022 1 次提交
    • Y
      [Phi]Change Copy from Kernel to basic component utils (#43622) · 2739bd73
      YuanRisheng 提交于
      * perfect copy
      
      * deal with conflict
      
      * deal with conflict
      
      * fix compile bugs
      
      * fix unittest bugs
      
      * change code format
      
      * deal with conflict
      
      * modify code by review
      
      * fix ce bugs
      
      * fix ce bugs
      
      * add lo
      
      * perfect code format
      
      * deal with conflicts
      2739bd73