- 28 1月, 2022 7 次提交
-
-
由 Chen Weihang 提交于
* update forward argument mapping * fix compile failed * fix test failed
-
由 liutiexing 提交于
* add align for WorkQueue * add spinlock * merge develop * merge * Add EventsWaiter * Revert "Add EventsWaiter" This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2. * split template * Add Profiler and HostTracer * update * update * update * updateg * fix cmake Co-authored-by: Nliutiexing <liutiexing@google.com>
-
由 liutiexing 提交于
* add align for WorkQueue * add spinlock * merge develop * merge * Add EventsWaiter * Revert "Add EventsWaiter" This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2. * Set thread name for WorkQueue Co-authored-by: Nliutiexing <liutiexing@google.com>
-
由 YuanRisheng 提交于
-
由 wenbin 提交于
* slice * shuffle pass enhancement
-
由 Fan Zhang 提交于
* [PSLIB] Add Metrics Module, Support User-defined Add Metric * [PSLIB] Modify According to CI * [PSLIB] Modify According to CI * [PSLIB] Modify According to CI * [PSLIB] Modify According to CI Coverage * [PSLIB] Modify According to CI * [PSLIB] Modify According to CI * [PSLIB] Modify According to CI * [PSLIB] Modify According to CI * [PSLIB] Modify According to CI * [PSLIB] Modify According to CI Coverage * [PSLIB] Modify According to CI Coverage * [PSLIB] Modify According to CI Coverage * modify role_maker * update CMakeLists.txt
-
由 zyfncg 提交于
* remove remake densetensor * fix eager test error * fix bug in eager * implement AllocateFrom * remove WriteBackOutput * fix problem of eager Co-authored-by: Nzkh2016 <zhangkaihuo@baidu.com>
-
- 27 1月, 2022 4 次提交
-
-
由 zyfncg 提交于
* remove remake densetensor * fix eager test error * fix bug in eager
-
由 Aganlengzi 提交于
* [Demo] custom kernel based on pten kernel * merge and npu custom work well * del comments * delete other code * fix CUDAContext * fix not found small_vector.h * support NPU * fix NPUContext * fix DeviceContext support * add UT * fix call * add UT * fix * fix for comments and ut * add MACRO control * fix multi input output * support env CUSTOM_DEVICE_ROOT * deal with special cases * fix for Windows * try coverage with test_custom_kernel_dot.py * fix test_custom_kernel_dot * fix test_custom_kernel_dot * fix merge * fix merge * fix CI * update * merge and fix * remove WITH_CUSTOM_KERNEL * fix merge * merge and fix * fix ut * fix ut for mac * add more UT * add more UT * fix
-
由 wenbin 提交于
* shuffle channel pass * add ut * timeout fix * makefile fix
-
由 Thunderbrook 提交于
* compile for afs api * with pslib
-
- 26 1月, 2022 4 次提交
-
-
由 Leo Chen 提交于
* update cmake file to remove fluid kernel * add pten declaration.h to where pybind.h used * fix sync_bn and tensorrt_engine * refine detection_library * fix interpreter_core * support eager legacy * fit eager legacy for pten * fall back to cpu if not found kernel * fix compile problem * fix compile problem * refine fallback logic * fit operator.run() * fix xpu compile * fit for new_exec * add REGISTER_OP_WITHOUT_GRADIENT * un-cache pt_kernel_context * fix compile * fix cudnn * fix compiling with on_infer * fix mkldnn * fix isfinite_v2 * fix xpu problem * fix op_device * refine fallback for xpu * fix xpu compile * merge develop * refine code format * fix compile * fix compile * add data_transfer * fix PreparePtenData * fix cpu context * merge develop * fix compile * fix error device context * fix xpu * fix dev_ctx
-
由 Allen Guo 提交于
* sync misc changes * apply comments 01 * fix compile error * remove is_ipu_place check * add authors Co-authored-by: NXiaobing Wang <xiaobingw@graphcore.ai> Co-authored-by: NAllen Guo <alleng@graphcore.ai> Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai> Co-authored-by: NHaicheng Jiang <haichengj@graphcore.ai> Co-authored-by: NHan Zhao <hanzhao@graphcore.ai> * sync changes * restore cmake * update ir cmake and setup.py * update inference_lib cmake * split PR Co-authored-by: NXiaobing Wang <xiaobingw@graphcore.ai> Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai> Co-authored-by: NHaicheng Jiang <haichengj@graphcore.ai> Co-authored-by: NHan Zhao <hanzhao@graphcore.ai>
-
由 石晓伟 提交于
-
由 Chen Weihang 提交于
* infermeta context init design * support infermeta called in fluid op * add hasattr and attr methods * add dygraah GetVarPtrs support * rename arg_map_context to arg_map_utils * add registry for arg map func * resolve conflit * refactor op utils design * polish meta config * fix details * remove hasattr method * resolve conflit * revert cmake order change * revert some change * change init pos * fix compile faileed * fix typo * fix inference failed * fix windows ccompile failed * polish format Co-authored-by: NWang Huan <wanghuan29@baidu.com>
-
- 25 1月, 2022 4 次提交
-
-
由 yaoxuefeng 提交于
-
由 Weilong Wu 提交于
* Added selected_rows and rw_lock to pten * Renamed the unit test target to fix CI * Removed Class SelectedRows in Fluid, changed include/cmake relationship, use pten::SelectedRows in Fluid * Remove rw_lock.h,rw_lock_test.cc in fluid * Use pten::RWLock and pten::AutoRDLock, fix CI * Use pten::SelectedRows * Use pten::SelectedRows * Fix to pass NPU CI * Use pten::SelectedRows, to pass NPU CI * To fix NPU CI * To fix NPU CI again
-
由 Wilber 提交于
-
由 xiongkun 提交于
* transfer: string tinyformat errors and part of enforce into pten * remove comment * fix by code review * assert is not compile in -DNDEBUG * add string as dependences of paddle_inference
-
- 24 1月, 2022 4 次提交
-
-
由 sneaxiy 提交于
-
由 Jacek Czaja 提交于
* - more unlikely * - compilation fix * - removed redundant definition * - fix * - Fixes * - compilation fix for windows
-
由 Feiyu Chan 提交于
* migration of functors in paddle/fluid/operators/eigen and paddle/fluid/platform/eigen_ext.h * update path of data types like float16.h in includes in extensions.h
-
由 Wilber 提交于
* move dynload from fluid to pten. * fix ci compile * fix windows ci compile. * update * update * fix compile error
-
- 22 1月, 2022 2 次提交
-
-
由 Chen Weihang 提交于
-
由 Chen Weihang 提交于
* add attr for arg map context * add argument fn declare * add attr test for get attr value method * polish details
-
- 21 1月, 2022 5 次提交
-
-
由 chentianyu03 提交于
-
由 Weilong Wu 提交于
-
由 fwenguang 提交于
* [MLU]add mlu ci dockerfile * fix comment * add cncl
-
由 Aurelius84 提交于
* Migrate Dim and DDim from paddle::framework into pten namespace * fix paddle::framework::Array * fix framework::Array
-
由 Wilber 提交于
* add cpu_context. * update * update * update * update * update * fix ci problem * fix npu ci problem * update * fix ci compile
-
- 20 1月, 2022 2 次提交
-
-
由 Aurelius84 提交于
* Migrate bfloat16/float16/complex from platform into pten::common * fix typo * fix code style
-
由 zyfncg 提交于
* remove MakePtenTensor in BuildKernelContext * fix a bug caused by storage * remove WriteBackOutput in dynamic and static mode * fix complie error of std::max * fix complie error of std::max * fix date_type bug * fix memory alloc bug * add some debug info * fix compile problem * fix problem of data_type check * comment out some unreached code
-
- 18 1月, 2022 3 次提交
-
-
由 Sławomir Siwek 提交于
* Mish * Change exp() library * mish fuse pass * mish attrs * fixes * mishop maker * remove attrs * mish kernal for bf16 * fc+mish fuse * fix code format error * Resolve merge conflicts * Update mish operator version * update mish variable to new naming convention
-
由 Zhanlue Yang 提交于
* Merged LoDTensor with Tensor,test=allcases * Patched python level LoDTensor * Patched python level LoDTensor * Merge Tensor into DenseTensor * Fixed namespace issues,test=allcases * Fixed merge issues * Fixed inference issues * Fixed NPU test issues * Fixed merge issues
-
由 zmxdream 提交于
* update * fix ps_gpu_wrapper. test=develop * fix ps_gpu_wrapper. test=develop
-
- 17 1月, 2022 5 次提交
-
-
由 CtfGo 提交于
collecting input variables that only shape needed of each subgraph that compiled by CINN in build_cinn_pass, and expose them to memory optimization of framework passes by declaringDECLARE_INPLACE_OP_INFERER in cinn_launch op.
-
由 Wilber 提交于
* add pten::Place data structure. * update ci problem * fix ci problem * update * using platform::Place=pten::Place * remove BOOST_GET_CONST for CPUPlace and GPUPlace * compile pass 25%. * compile pass 45% * compile pass 60% * remove boost_get for xpu npu mlu and ipu * compile pass on cpu and gpu. * fix compile problem * fix compile error. * update * fix ci problem * update * ci approve * fix ci problem * fix ci eager test problem * remove BOOST_GET_CONST * fix npu compile
-
由 Allen Guo 提交于
Co-authored-by: NXiaobing Wang <xiaobingw@graphcore.ai> Co-authored-by: NAllen Guo <alleng@graphcore.ai> Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai> Co-authored-by: NHaicheng Jiang <haichengj@graphcore.ai> Co-authored-by: NHan Zhao <hanzhao@graphcore.ai> Co-authored-by: NXiaobing Wang <xiaobingw@graphcore.ai> Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai> Co-authored-by: NHaicheng Jiang <haichengj@graphcore.ai> Co-authored-by: NHan Zhao <hanzhao@graphcore.ai>
-
由 Allen Guo 提交于
* update ipu releated passes Co-authored-by: NXiaobing Wang <xiaobingw@graphcore.ai> Co-authored-by: NAllen Guo <alleng@graphcore.ai> Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai> Co-authored-by: NHaicheng Jiang <haichengj@graphcore.ai> Co-authored-by: NHan Zhao <hanzhao@graphcore.ai> * remove ipu_pass_base * update error msg * update error msg 02 * split pr 01 * restore ipu_pass_base Co-authored-by: NXiaobing Wang <xiaobingw@graphcore.ai> Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai> Co-authored-by: NHaicheng Jiang <haichengj@graphcore.ai> Co-authored-by: NHan Zhao <hanzhao@graphcore.ai>
-
由 sneaxiy 提交于
* add no reduce mode for pe * add NoReduce ut
-