- 02 4月, 2022 2 次提交
- 01 4月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* move WaitEvent/RecordEvent into try-catch * refine supportNpu
-
- 31 3月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* fix bug that some op has no op_role attr * add mkldnn support for new executor * fit for mkldnn data_transfer * fit for mkldnn data_transfer
-
- 27 3月, 2022 2 次提交
-
-
由 Leo Chen 提交于
* fit for mkldnn and inplace op * fix compile * refine ut * register op version * fix inplace op * fix transfer_layout
-
由 From00 提交于
* Make StreamSafeCUDAAllocator compatible with NaiveBestFit strategy * Set FLAGS_use_stream_safe_cuda_allocator to false * Update * Remove unnecessary code * Fix CI errors * Add UT
-
- 24 3月, 2022 1 次提交
-
-
由 liutiexing 提交于
* add align for WorkQueue * add spinlock * merge develop * merge * Add EventsWaiter * Add EventsWaiter * update * Revert "Add EventsWaiter" This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2. * update * update Error MSG * update EventsWaiter * update Co-authored-by: Nliutiexing <liutiexing@google.com>
-
- 23 3月, 2022 1 次提交
-
-
由 Leo Chen 提交于
-
- 22 3月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* async prepare deps * fix bug that std::future is not set * add ut * refine code * fix standalone ut * disable prof
-
- 28 2月, 2022 2 次提交
-
-
由 liutiexing 提交于
* add align for WorkQueue * add spinlock * merge develop * merge * Add EventsWaiter * Revert "Add EventsWaiter" This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2. * Add host_trace_level env variable * Revert "Optimize perf of softmax_with_cross_entropy (#39553)" This reverts commit bbe5228c. Co-authored-by: Nliutiexing <liutiexing@google.com> Co-authored-by: NZzSean <18818272991@163.com>
-
由 Chen Weihang 提交于
* rename pten_utils to phi_utils * rename pten_utils target * rename Pten to Phi * replace pten with phi * resolve conflict
-
- 21 2月, 2022 1 次提交
-
-
由 chenjian 提交于
* fix RecordEvent interface * modify default level to 4 * update interface use * add const default trace level * update record event interface using * update record event interface using * update operator.cc * update part2 * update part1 * fix include profiler.h header in ps server * fix include profiler.h header in ps server * fix profiler.h header
-
- 20 2月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* rename pten dir to phi * rename namespace to phi * rename infrt pten dir to phi * resolve conflict * rename pten to phi in cmake * revert all infrt change * change needed files * fix infrt failed * fix inference failed
-
- 17 2月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* relocate code of interpretercore gc
-
- 14 2月, 2022 1 次提交
-
-
由 liutiexing 提交于
* add align for WorkQueue * add spinlock * merge develop * merge * Add EventsWaiter * Revert "Add EventsWaiter" This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2. * add log for Executor * Avoid thread reconsruction when EOF Co-authored-by: Nliutiexing <liutiexing@google.com>
-
- 11 2月, 2022 1 次提交
-
-
由 liutiexing 提交于
* add align for WorkQueue * add spinlock * merge develop * merge * Add EventsWaiter * Revert "Add EventsWaiter" This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2. * add log for Executor Co-authored-by: Nliutiexing <liutiexing@google.com>
-
- 08 2月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* adapt selectedrows in execution * impl selected rows branch * support selectedrow in infershape utils * fix device compile failed * fix new exe test failed * revert some changes
-
- 26 1月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* update cmake file to remove fluid kernel * add pten declaration.h to where pybind.h used * fix sync_bn and tensorrt_engine * refine detection_library * fix interpreter_core * support eager legacy * fit eager legacy for pten * fall back to cpu if not found kernel * fix compile problem * fix compile problem * refine fallback logic * fit operator.run() * fix xpu compile * fit for new_exec * add REGISTER_OP_WITHOUT_GRADIENT * un-cache pt_kernel_context * fix compile * fix cudnn * fix compiling with on_infer * fix mkldnn * fix isfinite_v2 * fix xpu problem * fix op_device * refine fallback for xpu * fix xpu compile * merge develop * refine code format * fix compile * fix compile * add data_transfer * fix PreparePtenData * fix cpu context * merge develop * fix compile * fix error device context * fix xpu * fix dev_ctx
-
- 25 1月, 2022 1 次提交
-
-
由 Weilong Wu 提交于
* Added selected_rows and rw_lock to pten * Renamed the unit test target to fix CI * Removed Class SelectedRows in Fluid, changed include/cmake relationship, use pten::SelectedRows in Fluid * Remove rw_lock.h,rw_lock_test.cc in fluid * Use pten::RWLock and pten::AutoRDLock, fix CI * Use pten::SelectedRows * Use pten::SelectedRows * Fix to pass NPU CI * Use pten::SelectedRows, to pass NPU CI * To fix NPU CI * To fix NPU CI again
-
- 15 1月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* remove cached kernel context * revert dataloader format change
-
- 07 1月, 2022 1 次提交
-
-
由 Leo Chen 提交于
-
- 30 12月, 2021 1 次提交
-
-
由 xiongkun 提交于
* fix wait for tiexing * fix work2vec model. new_exe support EOF Exception in ReadOp now
-
- 28 12月, 2021 2 次提交
-
-
由 From00 提交于
* fix reshape move storage error * remove needless set type * alloc tensor by shared storage * Utilize StreamSafeCUDAAllocator to support fast GC in new executor * Fix compile error for Windows and ROCm * Fix compile error for Windows * Modify UT stream_safe_cuda_alloc_test * Modify UT stream_safe_cuda_alloc_test * Rewrite fast GC * Rewrite fast GC * Fix compile error for BOOST_GET_CONST * Fix compile error for BOOST_GET_CONST * Changes default stream for StreamSafeCUDAAllocator * Fix a small CI error * Remove some redundant code * Fix conflict * Fix compile error for ROCm * Fix Windoes CI error * Fix CI error * Remove some unnecessary code * Fix CI error * Add UT for fast GC * Fix CI error * add device-agnostic stream class * add stream.h * fix ut * fix cpu compile * Use RWLock in GetAllocator * Fix CI error Co-authored-by: NChen Weihang <chenweihang@baidu.com> Co-authored-by: Nzhiqiu <chenqiuliang@baidu.com>
-
由 Leo Chen 提交于
* add completion_nofifier * fix bug * unregist event waiter
-
- 23 12月, 2021 1 次提交
-
-
由 liutiexing 提交于
* add align for WorkQueue * add spinlock * merge develop * merge * Add EventsWaiter * Revert "Add EventsWaiter" This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2. * update EventsWater * fix * split workqueue files * add more tests * fix * bugfix * bugfix * update Co-authored-by: Nliutiexing <liutiexing@google.com>
-
- 03 12月, 2021 2 次提交
- 01 12月, 2021 1 次提交
-
-
由 liutiexing 提交于
* add align for WorkQueue * add spinlock * merge develop * merge * Add EventsWaiter * Revert "Add EventsWaiter" This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2. * update HostEventTracer * update HostEventTracer * fix c++17 * update * update * update * update * fix bug Co-authored-by: Nliutiexing <liutiexing@google.com>
-
- 26 11月, 2021 1 次提交
-
-
由 wanghuancoder 提交于
* clear local scope every setp, test=develop * refine,test=develop * refine, test=develop
-
- 25 11月, 2021 1 次提交
-
-
由 xiongkun 提交于
* clear LoDTensorArray * fix bugs * fix * fix gpu
-
- 24 11月, 2021 1 次提交
-
-
由 Aurelius84 提交于
-
- 23 11月, 2021 1 次提交
-
-
由 wanghuancoder 提交于
-
- 22 11月, 2021 1 次提交
-
-
由 Leo Chen 提交于
-
- 17 11月, 2021 2 次提交
- 15 11月, 2021 2 次提交
-
-
由 Leo Chen 提交于
* fix revord_event * refine class Instruction * refine Instruction and InterpreterCore * make instruction and operator_base consistent * support NoNeedBufferVar in stream_analyzer * fix place of event * add vlog before continue
-
由 wanghuancoder 提交于
* fix 3 bug, test=develop * refine, test=develop
-
- 12 11月, 2021 1 次提交
-
-
由 Leo Chen 提交于
* split declaration and implementation * remove initdevices * refine VariableMetaInfo * add ut * fix compile
-
- 10 11月, 2021 2 次提交
-
-
由 Aurelius84 提交于
* Simplify constructor of InterpreterCore * fix bool * clean code
-
由 wanghuancoder 提交于
* cancle thread when exit, test=develop * gc to unique_ptr, test=develop * refine, test=develop * fix namespace, test=develop
-