1. 02 4月, 2022 2 次提交
  2. 01 4月, 2022 1 次提交
  3. 31 3月, 2022 1 次提交
    • L
      [new-exec] fit mkldnn op (#41058) · 02cf6764
      Leo Chen 提交于
      * fix bug that some op has no op_role attr
      
      * add mkldnn support for new executor
      
      * fit for mkldnn data_transfer
      
      * fit for mkldnn data_transfer
      02cf6764
  4. 27 3月, 2022 2 次提交
  5. 24 3月, 2022 1 次提交
    • L
      Refine events waiter (#40876) · 36ee6dd3
      liutiexing 提交于
      * add align for WorkQueue
      
      * add spinlock
      
      * merge develop
      
      * merge
      
      * Add EventsWaiter
      
      * Add EventsWaiter
      
      * update
      
      * Revert "Add EventsWaiter"
      
      This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2.
      
      * update
      
      * update Error MSG
      
      * update EventsWaiter
      
      * update
      Co-authored-by: Nliutiexing <liutiexing@google.com>
      36ee6dd3
  6. 23 3月, 2022 1 次提交
  7. 22 3月, 2022 1 次提交
  8. 28 2月, 2022 2 次提交
  9. 21 2月, 2022 1 次提交
    • C
      Update record interface using part2 (#39694) · c984cd85
      chenjian 提交于
      * fix RecordEvent interface
      
      * modify default level to 4
      
      * update interface use
      
      * add const default trace level
      
      * update record event interface using
      
      * update record event interface using
      
      * update operator.cc
      
      * update part2
      
      * update part1
      
      * fix include profiler.h header in ps server
      
      * fix include profiler.h header in ps server
      
      * fix profiler.h header
      c984cd85
  10. 20 2月, 2022 1 次提交
  11. 17 2月, 2022 1 次提交
  12. 14 2月, 2022 1 次提交
  13. 11 2月, 2022 1 次提交
  14. 08 2月, 2022 1 次提交
  15. 26 1月, 2022 1 次提交
    • L
      [pten] remove deprecated fluid op kernel for pten (#38842) · 3ab9aef1
      Leo Chen 提交于
      * update cmake file to remove fluid kernel
      
      * add pten declaration.h to where pybind.h used
      
      * fix sync_bn and tensorrt_engine
      
      * refine detection_library
      
      * fix interpreter_core
      
      * support eager legacy
      
      * fit eager legacy for pten
      
      * fall back to cpu if not found kernel
      
      * fix compile problem
      
      * fix compile problem
      
      * refine fallback logic
      
      * fit operator.run()
      
      * fix xpu compile
      
      * fit for new_exec
      
      * add REGISTER_OP_WITHOUT_GRADIENT
      
      * un-cache pt_kernel_context
      
      * fix compile
      
      * fix cudnn
      
      * fix compiling with on_infer
      
      * fix mkldnn
      
      * fix isfinite_v2
      
      * fix xpu problem
      
      * fix op_device
      
      * refine fallback for xpu
      
      * fix xpu compile
      
      * merge develop
      
      * refine code format
      
      * fix compile
      
      * fix compile
      
      * add data_transfer
      
      * fix PreparePtenData
      
      * fix cpu context
      
      * merge develop
      
      * fix compile
      
      * fix error device context
      
      * fix xpu
      
      * fix dev_ctx
      3ab9aef1
  16. 25 1月, 2022 1 次提交
    • W
      [Move selected_rows PR #3] Change the relationship of [include/Cmake]. (#39128) · 2bafd338
      Weilong Wu 提交于
      * Added selected_rows and rw_lock to pten
      
      * Renamed the unit test target to fix CI
      
      * Removed Class SelectedRows in Fluid, changed include/cmake relationship, use pten::SelectedRows in Fluid
      
      * Remove rw_lock.h,rw_lock_test.cc in fluid
      
      * Use pten::RWLock and pten::AutoRDLock, fix CI
      
      * Use pten::SelectedRows
      
      * Use pten::SelectedRows
      
      * Fix to pass NPU CI
      
      * Use pten::SelectedRows, to pass NPU CI
      
      * To fix NPU CI
      
      * To fix NPU CI again
      2bafd338
  17. 15 1月, 2022 1 次提交
  18. 07 1月, 2022 1 次提交
  19. 30 12月, 2021 1 次提交
  20. 28 12月, 2021 2 次提交
    • F
      Utilize StreamSafeCUDAAllocator to support fast GC in new executor (#37642) · 0c7153a4
      From00 提交于
      * fix reshape move storage error
      
      * remove needless set type
      
      * alloc tensor by shared storage
      
      * Utilize StreamSafeCUDAAllocator to support fast GC in new executor
      
      * Fix compile error for Windows and ROCm
      
      * Fix compile error for Windows
      
      * Modify UT stream_safe_cuda_alloc_test
      
      * Modify UT stream_safe_cuda_alloc_test
      
      * Rewrite fast GC
      
      * Rewrite fast GC
      
      * Fix compile error for BOOST_GET_CONST
      
      * Fix compile error for BOOST_GET_CONST
      
      * Changes default stream for StreamSafeCUDAAllocator
      
      * Fix a small CI error
      
      * Remove some redundant code
      
      * Fix conflict
      
      * Fix compile error for ROCm
      
      * Fix Windoes CI error
      
      * Fix CI error
      
      * Remove some unnecessary code
      
      * Fix CI error
      
      * Add UT for fast GC
      
      * Fix CI error
      
      * add device-agnostic stream class
      
      * add stream.h
      
      * fix ut
      
      * fix cpu compile
      
      * Use RWLock in GetAllocator
      
      * Fix CI error
      Co-authored-by: NChen Weihang <chenweihang@baidu.com>
      Co-authored-by: Nzhiqiu <chenqiuliang@baidu.com>
      0c7153a4
    • L
      [new-exec] add completion_nofifier (#38447) · 404a4a6a
      Leo Chen 提交于
      * add completion_nofifier
      
      * fix bug
      
      * unregist event waiter
      404a4a6a
  21. 23 12月, 2021 1 次提交
    • L
      Upgrade work queue (#38335) · 198d11be
      liutiexing 提交于
      * add align for WorkQueue
      
      * add spinlock
      
      * merge develop
      
      * merge
      
      * Add EventsWaiter
      
      * Revert "Add EventsWaiter"
      
      This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2.
      
      * update EventsWater
      
      * fix
      
      * split workqueue files
      
      * add more tests
      
      * fix
      
      * bugfix
      
      * bugfix
      
      * update
      Co-authored-by: Nliutiexing <liutiexing@google.com>
      198d11be
  22. 03 12月, 2021 2 次提交
  23. 01 12月, 2021 1 次提交
    • L
      HostEventRecorder (#37629) · feda7c1d
      liutiexing 提交于
      * add align for WorkQueue
      
      * add spinlock
      
      * merge develop
      
      * merge
      
      * Add EventsWaiter
      
      * Revert "Add EventsWaiter"
      
      This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2.
      
      * update HostEventTracer
      
      * update HostEventTracer
      
      * fix c++17
      
      * update
      
      * update
      
      * update
      
      * update
      
      * fix bug
      Co-authored-by: Nliutiexing <liutiexing@google.com>
      feda7c1d
  24. 26 11月, 2021 1 次提交
  25. 25 11月, 2021 1 次提交
  26. 24 11月, 2021 1 次提交
  27. 23 11月, 2021 1 次提交
  28. 22 11月, 2021 1 次提交
  29. 17 11月, 2021 2 次提交
  30. 15 11月, 2021 2 次提交
    • L
      [new-exec] fix stream analysis (#37161) · 584b4b24
      Leo Chen 提交于
      * fix revord_event
      
      * refine class Instruction
      
      * refine Instruction and InterpreterCore
      
      * make instruction and operator_base consistent
      
      * support NoNeedBufferVar in stream_analyzer
      
      * fix place of event
      
      * add vlog before continue
      584b4b24
    • W
      fix 3 bug of new_executor (#37142) · 8358d614
      wanghuancoder 提交于
      * fix 3 bug, test=develop
      
      * refine, test=develop
      8358d614
  31. 12 11月, 2021 1 次提交
    • L
      Refine new executor (#37074) · 1fe4513c
      Leo Chen 提交于
      * split declaration and implementation
      
      * remove initdevices
      
      * refine VariableMetaInfo
      
      * add ut
      
      * fix compile
      1fe4513c
  32. 10 11月, 2021 2 次提交