1. 04 8月, 2022 1 次提交
  2. 29 7月, 2022 1 次提交
    • L
      move CUDAStream to phi (#44529) · da3743fd
      Leo Chen 提交于
      * init
      
      * move CUDAStream to phi
      
      * fix compilation
      
      * merge develop
      
      * add stream_owned_ member
      
      * split cuda_stream.h
      
      * fix cpu compile
      
      * fix constructor
      
      * fix bug
      
      * fix windows compile
      
      * fix inference test_levit
      
      * fix windows tests
      da3743fd
  3. 19 7月, 2022 1 次提交
  4. 15 7月, 2022 1 次提交
  5. 14 7月, 2022 2 次提交
    • L
      refine allocation cmake (#44241) · dc5a0420
      Leo Chen 提交于
      * build into one static library
      
      * move memory/detail to memory/allocation
      
      * fix bug
      
      * fix profiler
      
      * fix framework_proto
      
      * fix deps
      
      * fix inference compilation
      
      * fix rocm compile
      
      * follow comments
      
      * fix buddy_allocator_test
      dc5a0420
    • Y
      [Phi]Improve the mechanism for mkldnn kernel in PHI (#43941) · e9b4d0be
      YuanRisheng 提交于
      * adapt mkldnn kernel in PHI
      
      * fix ci compile bugs
      
      * fix compile bugs
      
      * fix compile bugs
      
      * fix compile bugs
      
      * fix compile bugs
      
      * delete comment
      
      * fix compile bugs in windows-inference
      
      * delete code for converage
      
      * modify code by review
      
      * modify code by review
      
      * add todo
      
      * fix compile bugs
      
      * fix compile bugs
      
      * fix compile bugs
      
      * fix unittest bugsx
      e9b4d0be
  6. 11 7月, 2022 1 次提交
  7. 07 7月, 2022 1 次提交
  8. 24 6月, 2022 2 次提交
  9. 13 6月, 2022 1 次提交
  10. 04 6月, 2022 1 次提交
  11. 12 5月, 2022 2 次提交
    • S
      Fix some typos in paddle/. (#42408) · 2012672c
      Shuangchi He 提交于
      2012672c
    • z8hanghuan's avatar
      add xpu buffer_reader, *test=kunlun (#42578) · cc343a41
      z8hanghuan 提交于
      * add xpu buffer_reader, *test=kunlun
      
      * xpu buffer_reader, use XPUDeviceGuard, *test=kunlun
      
      * modify xpu.cmake, *test=kunlun
      
      * modify xpu.cmake, *test=kunlun
      
      * modify xpu.cmake, *test=kunlun
      
      * add xpu buffer_reader, *test=kunlun
      
      * add xpu buffer reader, *test=kunlun
      
      * add xpu buffer reader, *test=kunlun
      cc343a41
  12. 14 4月, 2022 1 次提交
    • Y
      [Phi] Support construct Scalar by using Non-CPU Tensor (#41765) · 54ccc308
      YuanRisheng 提交于
      * support construct scalar using non-cpu tensor
      
      * fix bugs when run unittest
      
      * fix compile bugs
      
      * fix bugs when run ci
      
      * fix compile bugs
      
      * fix bugs when move copy
      
      * perfect unit test
      
      * perfect unittest
      
      * update according to comment
      
      * add target dependency
      
      * deal with conflict
      
      * fix bugs when run unit test
      
      * fix unit test bugs
      54ccc308
  13. 13 4月, 2022 2 次提交
  14. 31 3月, 2022 1 次提交
    • C
      Maintain old profiler (#41132) · a6bf2218
      chenjian 提交于
      * no
      
      * maintain old profiler
      
      * exclude new python record events for old profiler
      
      * maintain old profiler
      
      * maintain
      
      * maintain old profiler
      
      * maintain
      
      * fix cmakes
      a6bf2218
  15. 21 3月, 2022 1 次提交
  16. 08 3月, 2022 1 次提交
  17. 03 3月, 2022 1 次提交
  18. 24 2月, 2022 1 次提交
  19. 23 2月, 2022 1 次提交
  20. 22 2月, 2022 1 次提交
  21. 21 2月, 2022 1 次提交
  22. 14 2月, 2022 1 次提交
  23. 06 2月, 2022 1 次提交
  24. 30 1月, 2022 1 次提交
  25. 28 1月, 2022 1 次提交
  26. 27 1月, 2022 2 次提交
    • Q
      [MLU] add compile ci scripts for MLU, test=mlu_ci (#39122) · 56410b4a
      Qi Li 提交于
      56410b4a
    • A
      [PluggableDevice] Add custom kernel support based on pten kernel management (#38848) · a8879215
      Aganlengzi 提交于
      * [Demo] custom kernel based on pten kernel
      
      * merge and npu custom work well
      
      * del comments
      
      * delete other code
      
      * fix CUDAContext
      
      * fix not found small_vector.h
      
      * support NPU
      
      * fix NPUContext
      
      * fix DeviceContext support
      
      * add UT
      
      * fix call
      
      * add UT
      
      * fix
      
      * fix for comments and ut
      
      * add MACRO control
      
      * fix multi input output
      
      * support env CUSTOM_DEVICE_ROOT
      
      * deal with special cases
      
      * fix for Windows
      
      * try coverage with test_custom_kernel_dot.py
      
      * fix test_custom_kernel_dot
      
      * fix test_custom_kernel_dot
      
      * fix merge
      
      * fix merge
      
      * fix CI
      
      * update
      
      * merge and fix
      
      * remove WITH_CUSTOM_KERNEL
      
      * fix merge
      
      * merge and fix
      
      * fix ut
      
      * fix ut for mac
      
      * add more UT
      
      * add more UT
      
      * fix
      a8879215
  27. 26 1月, 2022 2 次提交
  28. 25 1月, 2022 2 次提交
  29. 21 1月, 2022 1 次提交
    • W
      [PTEN] Add cpu context (#38979) · 064bc4b8
      Wilber 提交于
      * add cpu_context.
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * fix ci problem
      
      * fix npu ci problem
      
      * update
      
      * fix ci compile
      064bc4b8
  30. 17 1月, 2022 1 次提交
    • W
      [Pten] Replace platform::Place to pten::Place. (#38899) · c48a9ad5
      Wilber 提交于
      * add pten::Place data structure.
      
      * update ci problem
      
      * fix ci problem
      
      * update
      
      * using platform::Place=pten::Place
      
      * remove BOOST_GET_CONST for CPUPlace and GPUPlace
      
      * compile pass 25%.
      
      * compile pass 45%
      
      * compile pass 60%
      
      * remove boost_get for xpu npu mlu and ipu
      
      * compile pass on cpu and gpu.
      
      * fix compile problem
      
      * fix compile error.
      
      * update
      
      * fix ci problem
      
      * update
      
      * ci approve
      
      * fix ci problem
      
      * fix ci eager test problem
      
      * remove BOOST_GET_CONST
      
      * fix npu compile
      c48a9ad5
  31. 12 1月, 2022 1 次提交
    • L
      Os info (#38779) · 0d8d1e0e
      liutiexing 提交于
      * add align for WorkQueue
      
      * add spinlock
      
      * merge develop
      
      * merge
      
      * Add EventsWaiter
      
      * Revert "Add EventsWaiter"
      
      This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2.
      
      * os_info update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * fix
      
      * update
      
      * update for windows
      
      * fix windows
      
      * update
      
      * update
      Co-authored-by: Nliutiexing <liutiexing@google.com>
      0d8d1e0e
  32. 10 1月, 2022 1 次提交
    • L
      Profiler skeleton (#38826) · a8afed69
      liutiexing 提交于
      * add align for WorkQueue
      
      * add spinlock
      
      * merge develop
      
      * merge
      
      * Add EventsWaiter
      
      * Revert "Add EventsWaiter"
      
      This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2.
      
      * profiler skeleton
      
      * update
      
      * update
      
      * update
      Co-authored-by: Nliutiexing <liutiexing@google.com>
      a8afed69
  33. 29 12月, 2021 1 次提交
    • L
      Make profiler better (#38280) · 851637fd
      liutiexing 提交于
      * add align for WorkQueue
      
      * add spinlock
      
      * merge develop
      
      * merge
      
      * Add EventsWaiter
      
      * Revert "Add EventsWaiter"
      
      This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2.
      
      * update OS info
      
      * split host_event_recorder
      
      * split host_event_recorder
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      Co-authored-by: Nliutiexing <liutiexing@google.com>
      851637fd