- 14 2月, 2022 2 次提交
-
-
由 liutiexing 提交于
* add align for WorkQueue * add spinlock * merge develop * merge * Add EventsWaiter * Revert "Add EventsWaiter" This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2. * add log for Executor * Avoid thread reconsruction when EOF Co-authored-by: Nliutiexing <liutiexing@google.com>
-
由 Chen Weihang 提交于
* add has_attr for arg map context * skip useless attr now * skip attr if not exists * fix typo
-
- 11 2月, 2022 2 次提交
-
-
由 liutiexing 提交于
* add align for WorkQueue * add spinlock * merge develop * merge * Add EventsWaiter * Revert "Add EventsWaiter" This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2. * add log for Executor Co-authored-by: Nliutiexing <liutiexing@google.com>
-
由 Leo Chen 提交于
-
- 08 2月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* adapt selectedrows in execution * impl selected rows branch * support selectedrow in infershape utils * fix device compile failed * fix new exe test failed * revert some changes
-
- 30 1月, 2022 1 次提交
-
-
由 Leo Chen 提交于
-
- 28 1月, 2022 2 次提交
-
-
由 liutiexing 提交于
* add align for WorkQueue * add spinlock * merge develop * merge * Add EventsWaiter * Revert "Add EventsWaiter" This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2. * split template * Add Profiler and HostTracer * update * update * update * updateg * fix cmake Co-authored-by: Nliutiexing <liutiexing@google.com>
-
由 liutiexing 提交于
* add align for WorkQueue * add spinlock * merge develop * merge * Add EventsWaiter * Revert "Add EventsWaiter" This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2. * Set thread name for WorkQueue Co-authored-by: Nliutiexing <liutiexing@google.com>
-
- 26 1月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* update cmake file to remove fluid kernel * add pten declaration.h to where pybind.h used * fix sync_bn and tensorrt_engine * refine detection_library * fix interpreter_core * support eager legacy * fit eager legacy for pten * fall back to cpu if not found kernel * fix compile problem * fix compile problem * refine fallback logic * fit operator.run() * fix xpu compile * fit for new_exec * add REGISTER_OP_WITHOUT_GRADIENT * un-cache pt_kernel_context * fix compile * fix cudnn * fix compiling with on_infer * fix mkldnn * fix isfinite_v2 * fix xpu problem * fix op_device * refine fallback for xpu * fix xpu compile * merge develop * refine code format * fix compile * fix compile * add data_transfer * fix PreparePtenData * fix cpu context * merge develop * fix compile * fix error device context * fix xpu * fix dev_ctx
-
- 25 1月, 2022 1 次提交
-
-
由 Weilong Wu 提交于
* Added selected_rows and rw_lock to pten * Renamed the unit test target to fix CI * Removed Class SelectedRows in Fluid, changed include/cmake relationship, use pten::SelectedRows in Fluid * Remove rw_lock.h,rw_lock_test.cc in fluid * Use pten::RWLock and pten::AutoRDLock, fix CI * Use pten::SelectedRows * Use pten::SelectedRows * Fix to pass NPU CI * Use pten::SelectedRows, to pass NPU CI * To fix NPU CI * To fix NPU CI again
-
- 17 1月, 2022 1 次提交
-
-
由 Wilber 提交于
* add pten::Place data structure. * update ci problem * fix ci problem * update * using platform::Place=pten::Place * remove BOOST_GET_CONST for CPUPlace and GPUPlace * compile pass 25%. * compile pass 45% * compile pass 60% * remove boost_get for xpu npu mlu and ipu * compile pass on cpu and gpu. * fix compile problem * fix compile error. * update * fix ci problem * update * ci approve * fix ci problem * fix ci eager test problem * remove BOOST_GET_CONST * fix npu compile
-
- 15 1月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* remove cached kernel context * revert dataloader format change
-
- 13 1月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* fix mkldnn invalid infershape * add unittest for mkldnn in new executor * add import os
-
- 10 1月, 2022 3 次提交
-
-
由 liutiexing 提交于
* add align for WorkQueue * add spinlock * merge develop * merge * Add EventsWaiter * Revert "Add EventsWaiter" This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2. * profiler skeleton * update * update * update Co-authored-by: Nliutiexing <liutiexing@google.com>
-
由 Zhanlue Yang 提交于
* Added shared_ptr<Allocation> member & corresponding interfaces to Storage * Removed original pten::Allocation from Storage and adjusted the interfaces accordingly * Fixed issues with storage offset * Used place to malloc allocation for TensorStorage * [Unify Tensors PR #3]Ported framework::Tensor interfaces to pten::DenseTensor * Fixed issues with place * Added comments * Moved mutable_data with stream argument to DenseTensor * Added set_offset interface * Fixed CI issues,test=allcases * [Unify Tensors PR #4] Port LoDTensor interfaces to DenseTensor * Removed friend class EigenTensor/EigenMatrix/EigenVector from Tensor * Modified framework::Tensor to inherit from DenseTensor * Reverted changes too pten_layout() interface * Removed friend classes * Rearranged cfunction calls from tensor.data<void>() to tensor.data() * Fixed CI issues * Fixed lite issues * Fixed data() interface issues,test=allcases * Resolved IsInitialized() issues * Fixed ResetHolder() issues * Fixed MKLDNN & Storage issues * Resolved ShareBufferWith() issues * Fixed LoD issues
-
由 Chen Weihang 提交于
* unify infer_shape func calling * support set grad infer shape fn for custom op * unify infershape in new executor and eager * remove todo comment * revert infershape in operator
-
- 07 1月, 2022 1 次提交
-
-
由 Leo Chen 提交于
-
- 30 12月, 2021 1 次提交
-
-
由 xiongkun 提交于
* fix wait for tiexing * fix work2vec model. new_exe support EOF Exception in ReadOp now
-
- 28 12月, 2021 2 次提交
-
-
由 From00 提交于
* fix reshape move storage error * remove needless set type * alloc tensor by shared storage * Utilize StreamSafeCUDAAllocator to support fast GC in new executor * Fix compile error for Windows and ROCm * Fix compile error for Windows * Modify UT stream_safe_cuda_alloc_test * Modify UT stream_safe_cuda_alloc_test * Rewrite fast GC * Rewrite fast GC * Fix compile error for BOOST_GET_CONST * Fix compile error for BOOST_GET_CONST * Changes default stream for StreamSafeCUDAAllocator * Fix a small CI error * Remove some redundant code * Fix conflict * Fix compile error for ROCm * Fix Windoes CI error * Fix CI error * Remove some unnecessary code * Fix CI error * Add UT for fast GC * Fix CI error * add device-agnostic stream class * add stream.h * fix ut * fix cpu compile * Use RWLock in GetAllocator * Fix CI error Co-authored-by: NChen Weihang <chenweihang@baidu.com> Co-authored-by: Nzhiqiu <chenqiuliang@baidu.com>
-
由 Leo Chen 提交于
* add completion_nofifier * fix bug * unregist event waiter
-
- 23 12月, 2021 1 次提交
-
-
由 liutiexing 提交于
* add align for WorkQueue * add spinlock * merge develop * merge * Add EventsWaiter * Revert "Add EventsWaiter" This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2. * update EventsWater * fix * split workqueue files * add more tests * fix * bugfix * bugfix * update Co-authored-by: Nliutiexing <liutiexing@google.com>
-
- 16 12月, 2021 1 次提交
-
-
由 Leo Chen 提交于
* fix cmake * not check execution time
-
- 15 12月, 2021 1 次提交
-
-
由 Leo Chen 提交于
* refine test * add download_program target * update ut code * refine code * disable profiler * add comments * refine cmake * skip coverage ci
-
- 14 12月, 2021 1 次提交
-
-
由 Aurelius84 提交于
-
- 13 12月, 2021 1 次提交
-
-
由 xiongkun 提交于
* fix single card 8 unittests in new executor * fix * fix
-
- 06 12月, 2021 1 次提交
-
-
由 Leo Chen 提交于
* enable sequential_run for standalone_executor * add ut * fix ut
-
- 03 12月, 2021 2 次提交
- 02 12月, 2021 1 次提交
-
-
由 Leo Chen 提交于
-
- 01 12月, 2021 1 次提交
-
-
由 liutiexing 提交于
* add align for WorkQueue * add spinlock * merge develop * merge * Add EventsWaiter * Revert "Add EventsWaiter" This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2. * update HostEventTracer * update HostEventTracer * fix c++17 * update * update * update * update * fix bug Co-authored-by: Nliutiexing <liutiexing@google.com>
-
- 30 11月, 2021 1 次提交
-
-
由 xiongkun 提交于
* add scope_guard * 1. fix control flow cases 2. fix calc_gradient
-
- 29 11月, 2021 2 次提交
-
-
由 wanghuancoder 提交于
-
由 wanghuancoder 提交于
* suport fetch lodtensor array, test=develop * refine, test=develop * refine, test=develop * refine, test=develop
-
- 26 11月, 2021 1 次提交
-
-
由 wanghuancoder 提交于
* clear local scope every setp, test=develop * refine,test=develop * refine, test=develop
-
- 25 11月, 2021 1 次提交
-
-
由 xiongkun 提交于
* clear LoDTensorArray * fix bugs * fix * fix gpu
-
- 24 11月, 2021 3 次提交
-
-
由 Aurelius84 提交于
-
由 Leo Chen 提交于
-
由 Aurelius84 提交于
-
- 23 11月, 2021 2 次提交
-
-
由 wanghuancoder 提交于
-
由 Leo Chen 提交于
* sync scope and variable_scope when init executor * set var_desc for new var
-