1. 05 9月, 2022 1 次提交
    • H
      [phi] Migrate memcpy kernel to PHI, hold NPU op (#45622) · 2f19a364
      HongyuJia 提交于
      * migrate memcpy to phi
      
      * fix typo error
      
      * fix typo error
      
      * fix  bug and testcase
      
      * fix typo, uniform_random_kernel.cc header
      
      * fix Alloc pinned bug
      
      * change GPUContext::GetPinnedPlace
      
      * add GetPinnedPlace function
      
      * add GetPinnedPlace function
      
      * restore default throw error
      
      * fix Unimplemented error
      
      * skip StandaloneExecutor testcase
      
      * delete memcpy_sig
      2f19a364
  2. 04 9月, 2022 1 次提交
  3. 02 9月, 2022 1 次提交
  4. 01 9月, 2022 1 次提交
  5. 30 8月, 2022 2 次提交
    • Z
      Remove extra attribute in OpMaker (#44310) · fe321f9a
      zyfncg 提交于
      * add runtime config in phi
      
      * add runtime attr for op desc and op
      
      * fix no proto error
      
      * adjust opdesc set_attr impl
      
      * try to remove conv_op extra attrs
      
      * add init runtime attr map
      
      * change extra header path
      
      * fix runtime_attr
      
      * fix trace_op
      
      * fix bug of pass
      
      * fix merge conflict
      
      * fix dygraph attrs
      
      * fix bug of pass
      
      * fix dygraph bug
      
      * fix unittest module
      
      * delete extra attr default
      
      * fix dropout kernel
      
      * polish code
      
      * fix extra output of instance_norm
      
      * fix merge confilct
      
      * fix op_desc bug
      
      * add extra attr in yaml for conv3d_transpose
      
      * don't remove extra input and output
      
      * fix save_inference_model
      
      * fix bug of batch_norm
      
      * revert some change
      
      * polish log
      
      * polish code
      
      * add code comment
      Co-authored-by: NChen Weihang <chenweihang@baidu.com>
      fe321f9a
    • K
      fix memcpy_h2d bug related to cuda stream setting when allocate memory (#45450) · 10abdb8f
      kangguangli 提交于
      * fix memcpy_h2d bug related to cuda stream setting when allocate memory
      
      * add header file
      
      * fix compile error for cpu only
      10abdb8f
  6. 25 8月, 2022 2 次提交
    • F
      add support for double attributes (#45390) · efab2eb4
      Feiyu Chan 提交于
      efab2eb4
    • K
      Transfer memcpy d2h from fluid to phi (#45150) · 0d14e74a
      kangguangli 提交于
      * transfer memcpy_d2h from fluid to phi
      
      * refine arg check and add comment
      
      * fix cannot fallback to phi kernel
      
      * fix gpu_context host alloc when tensor size = 0
      
      * add kernel for std::vector<DenseTensor> args
      
      * fix bugs in MemcpyD2HMultiIOKernel
      
      * remove useless header file
      
      * polish format
      
      * fix typo
      
      * add testcase for cudapinned place
      
      * refine check condition in test
      
      * polish error message
      
      * polish error message
      
      * remove header in fluid  directory
      
      * merge memcpy_h2d and memcpy_d2h into one file, change register method to simplify implementation
      
      * fix code style check
      0d14e74a
  7. 24 8月, 2022 1 次提交
  8. 23 8月, 2022 2 次提交
  9. 22 8月, 2022 1 次提交
  10. 19 8月, 2022 1 次提交
  11. 18 8月, 2022 2 次提交
  12. 17 8月, 2022 1 次提交
  13. 16 8月, 2022 2 次提交
  14. 15 8月, 2022 1 次提交
  15. 12 8月, 2022 1 次提交
    • S
      Offload calculations from matmul op to fuse pass (#44941) · acb78ea2
      Sławomir Siwek 提交于
      * remove v2_transpose_reshape
      
      * matmul_transpose_reshape
      
      * reshape_transpose_matmul
      
      * Add int8 support for matmulV2
      
      * restore ut
      
      * adjust old ut
      
      * restore parallel UT ruels
      
      * remove mkldnn code from base ops
      
      * move enforces to pass
      
      * remove duplicated functions
      
      * delete duplicated enforces
      
      * feedback from review
      
      * add comments to variables
      
      * enable eltwise support
      
      * dynamic attribute
      
      * remove fusepass tests from op test
      
      * remove fuse pass cases from op test
      
      * revert introduction of dynamic attributes
      
      * style
      Co-authored-by: Nwozna <joanna.wozna@intel.com>
      acb78ea2
  16. 10 8月, 2022 1 次提交
  17. 09 8月, 2022 1 次提交
  18. 08 8月, 2022 2 次提交
  19. 05 8月, 2022 2 次提交
  20. 03 8月, 2022 1 次提交
  21. 02 8月, 2022 1 次提交
  22. 29 7月, 2022 3 次提交
    • A
      add FLAGS_enable_api_kernel_fallback (#44706) · e439d735
      Aganlengzi 提交于
      * add FLAGS_enable_api_kernel_fallback
      
      * deal with more cases
      
      * add ut for coverage
      e439d735
    • L
      move CUDAStream to phi (#44529) · da3743fd
      Leo Chen 提交于
      * init
      
      * move CUDAStream to phi
      
      * fix compilation
      
      * merge develop
      
      * add stream_owned_ member
      
      * split cuda_stream.h
      
      * fix cpu compile
      
      * fix constructor
      
      * fix bug
      
      * fix windows compile
      
      * fix inference test_levit
      
      * fix windows tests
      da3743fd
    • Y
      [Phi] Add yaml for assign_value (#44596) · 88584396
      Yulong Ao 提交于
      * [Phi] Add yaml for assign_value
      
      * [Phi] Fix the bug of the assign api and modify the unittest
      
      * [Phi] Fix the bug when the tensor does not have the backend info
      
      * [Phi] Replace the functional-style cast init by the brace-init
      
      * [Phi] Cast the data explicitly
      88584396
  23. 28 7月, 2022 1 次提交
  24. 27 7月, 2022 1 次提交
  25. 26 7月, 2022 1 次提交
  26. 22 7月, 2022 1 次提交
    • A
      [CustomDevice] register Copy for custom device (#44200) · 3b0aa75e
      Aganlengzi 提交于
      * [CustomDevice] register Copy for custom device
      
      * [CustomDevice] register Copy for custom device
      
      * [CustomDevice] register Copy for custom device
      
      * merge and add uts
      
      * merge and add uts
      
      * fix for blocking and unittests coverage
      3b0aa75e
  27. 19 7月, 2022 1 次提交
  28. 18 7月, 2022 1 次提交
  29. 15 7月, 2022 1 次提交
  30. 14 7月, 2022 2 次提交