1. 21 3月, 2023 1 次提交
    • iSerendipity's avatar
      [PHI decoupling] Move DataType* from paddle:experimental to phi namespace (#51716) · 4638a62e
      iSerendipity 提交于
      * move DataType from paddle::experimental to phi
      
      * convert namespace
      
      * convert namespace
      
      * convert namespace
      
      * clarify namespace
      
      * convert more datatype
      
      * Revert "convert more datatype"
      
      This reverts commit 083b462959e6a22d4d8767707b628b95b396642e.
      
      * convert more in auto_code_generator
      
      * fix conflicts for XPU
      
      * fix namespace conflicts
      
      * fix errors
      
      * Revert "fix errors"
      
      This reverts commit f9d9958b54ee32141112274c8a5c3c381ab0f876.
      
      * fix errors
      
      * fix formatting
      4638a62e
  2. 09 3月, 2023 1 次提交
    • TaoTao Li's avatar
      Add comm context manager, add phi broadcast op (#51072) · c191b707
      TaoTao Li 提交于
      * * add comm context for device context
      
      * add broadcast phi operator kernel and api
      
      * add broadcast support dtype, update ut
      
      * fix broadcast bfloat16 type
      
      * fix ut
      
      * update test_collective_broadcast_api timeout to 300
      c191b707
  3. 28 12月, 2022 1 次提交
    • L
      [new-exec] Ahead-Of-Time choosing kernel (#48789) · 63d2d722
      Leo Chen 提交于
      * add skip run
      
      * alloc minimum memory
      
      * skip check_size in Alloc
      
      * skip check_size in Alloc
      
      * skip check_size in Alloc
      
      * fix cases when tensor is initialized or empty
      
      * alloc empty output for place info
      
      * add test
      
      * increase timeout
      
      * format code
      
      * skip cpu
      
      * add cudnn_deterministic
      
      * fit for hostAlloc
      
      * follow comments
      
      * change check_size to fake_alloc
      63d2d722
  4. 18 11月, 2022 1 次提交
  5. 01 11月, 2022 1 次提交
    • C
      Adapting device-specific Extra Attributes for the PHI kernel (#46342) · c923e6c9
      Chen Weihang 提交于
      * add extra attr property set
      
      * add type_info for all context
      
      * add onednn context to all context
      
      * fix context compile error
      
      * simplify conv kernel args
      
      * pass runtime attr into dev_ctx
      
      * fix marco error
      
      * clear conv_grad_kernel extra args
      
      * merge conv_grad_grad into conv_grad
      
      * clear conv2d_grad_grad extra attrs
      
      * clear yaml and eager extra attr
      
      * fix conv1d error
      
      * change to thread local
      
      * fix npu compile failed
      
      * try to fix windows compile failed
      
      * add conv2d onednn phi kernel
      
      * fix ci bugs (#36)
      
      * fix compile bugs (#38)
      
      * fix extra input transform bug (#39)
      
      * support dynamic created attr (#40)
      
      * reset extra info gen code
      
      * rm conv_grad_grad kernel
      
      * reimpl pass attr adapting
      
      * add int attr support
      
      * remove vector inputnames creating
      
      * fix map at error
      
      * Update paddle/phi/kernels/onednn/conv_grad_kernel.cc
      Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>
      
      * remove useless extra attrs
      
      * replace mkldnn_engine by onednn_engine
      Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>
      Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>
      c923e6c9
  6. 05 9月, 2022 1 次提交
    • H
      [phi] Migrate memcpy kernel to PHI, hold NPU op (#45622) · 2f19a364
      HongyuJia 提交于
      * migrate memcpy to phi
      
      * fix typo error
      
      * fix typo error
      
      * fix  bug and testcase
      
      * fix typo, uniform_random_kernel.cc header
      
      * fix Alloc pinned bug
      
      * change GPUContext::GetPinnedPlace
      
      * add GetPinnedPlace function
      
      * add GetPinnedPlace function
      
      * restore default throw error
      
      * fix Unimplemented error
      
      * skip StandaloneExecutor testcase
      
      * delete memcpy_sig
      2f19a364
  7. 07 7月, 2022 1 次提交
  8. 05 6月, 2022 1 次提交
  9. 12 4月, 2022 1 次提交
    • C
      [CustomOp] Add context pool unittests (#41085) · 59ec9599
      Chen Weihang 提交于
      * add context pool unittests
      
      * fix timeout
      
      * polish details
      
      * change option pos
      
      * add dll decl for wndows
      
      * fix pre-commit error
      
      * move dll_decl and export DeviceContext
      
      * replace lost dll_decl.h
      59ec9599
  10. 01 4月, 2022 1 次提交
    • W
      [Eager] Support pinned (#41035) · f3270fc8
      wanghuancoder 提交于
      * support pinned, test=develop
      
      * support async_write, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine,test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      f3270fc8
  11. 23 2月, 2022 1 次提交
  12. 20 2月, 2022 1 次提交
  13. 18 2月, 2022 1 次提交
  14. 17 2月, 2022 1 次提交
  15. 14 2月, 2022 1 次提交
  16. 06 2月, 2022 1 次提交
  17. 27 1月, 2022 1 次提交
  18. 25 1月, 2022 1 次提交
  19. 24 1月, 2022 1 次提交
    • [Refactoring Tensor PR #5] replace storage with pten allocation (#39085) · a56e16a7
      石晓伟 提交于
      * updates callers, test=develop
      
      * updates tensor, test=develop
      
      * fixes errors, test=develop
      
      * remove some dtypes, test=develop
      
      * fix errors in the base storage modification, test=develop
      
      * fixes a bug, test=develop
      
      * fixes the bugs in push the whole, test=develop
      
      * updates, test=develop
      
      * update
      
      * update, test=develop
      
      * fixes the mac-py3 CI, test=develop
      
      * remove the storage impl, test=develop
      
      * updates some codes, test=develop
      
      * update, test=develop
      
      * updates pten allocation, test=develop
      a56e16a7
  20. 21 1月, 2022 1 次提交
    • W
      [PTEN] Add cpu context (#38979) · 064bc4b8
      Wilber 提交于
      * add cpu_context.
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * fix ci problem
      
      * fix npu ci problem
      
      * update
      
      * fix ci compile
      064bc4b8