- 11 11月, 2022 2 次提交
-
-
由 czr-gc 提交于
* feat(ipu): add model_runtime backend support in IPU. * fix(ipu_executor): fix error message format. * fix(ipu_executor): fix format. * fix(ipu_executor): fix format again. * fix(ipu_executor): fix format again. * fix(ipu_executor): fix format again.
-
由 james 提交于
phi::Alloc() complains about missing device_allocator_
-
- 10 11月, 2022 1 次提交
-
-
由 james 提交于
* XPU support eager mode * add unittest for XPU eager mode * minor bugfix * minor bugfix, test=kunlun * correct copyright info * 1. remove unsed vars/funcs 2. ProcessGroupBKCL inherit from ProcessGroupStream * bugfix for fp16 in eager mode multi-card, test=kunlun * rebase & fix a few issues * use new processgroup interface, test=kunlun * fix compile issue, test=kunlun
-
- 09 11月, 2022 1 次提交
-
-
由 Jacek Czaja 提交于
* first commit - more fixes - compilation fix - compilation fix - fix - another fix - yet another fix - Fix - fix to fused ops - compilation fix - compilation fix - another compilation fix - another fix - fix - fix - fix - fix - yet another fix - fix - fix - cosmetic fix :- lint - Revert some changes (to be brought back later) - fix to build - Added prototype of slice - fix compilation fix - compilation fix - fix - fix - Fix - fix fix modified: cmake/flags.cmake * lint * rerun of CI * - Fix * - lint * - lint2
-
- 08 11月, 2022 2 次提交
-
-
由 zhangyikun02 提交于
-
由 zhangyikun02 提交于
-
- 07 11月, 2022 5 次提交
-
-
由 Hui Zhang 提交于
* suqeeze2 transpose2 fuse onednn * format * fix output shape * fix conflict * format * format * remove useless * remove log * simply pass * fix comment * fix * fix msg * fix error msg * format
-
由 QingshuChen 提交于
*test=kunlun
-
由 ykkk2333 提交于
add roll and roll_grad kernels and strided_slice and strided_slice_grad kernels, test=kunlun (#47368) * add stat tool * add roll and roll_grad kernels and strided_slice and strided_slice_grad kernels, test=kunlun
-
由 ronnywang 提交于
-
由 HongyuJia 提交于
* move cudnn hardcode outside GetExpectedKernelType * add header file * debug * update interpreter_util with hardcode * update interpreter_util headerfile * solve activation hardcode * debug with CI * add mkldnn_op_list header file * temporarily uncomment mkldnn * temporarily uncomment mkldnn * delete sequence_softmax cudnn hardcode * add hardcode to data_transfer.cc * update data_transfer headerfile * try fix segment fault * update cudnn&miopen_helper * reset HasAttr of DygraphExctnCtx * debug, this commit should pass all CI * debug should pass CI, temporarily disable activation * debug should pass CI * fix default_attr=nullptr bug * clean debug code * Call SetDnnFallback function in the base class * activation fallback to plain kernel * fix default GetExpectedKernelType find wrong kernel * search cudnn kernel instead of fallback * fix cudnn_handle bug * remove tanh use_cudnn * restore tanh use_cudnn * debug tanh * fix tanh bug * delete activation cudnn kernel * polish code
-
- 05 11月, 2022 1 次提交
-
-
由 Yiqun Liu 提交于
-
- 04 11月, 2022 3 次提交
-
-
由 houj04 提交于
* [XPU] add cumsum op. test=kunlun * try to fix linker. test=kunlun * try to fix linker. test=kunlun * try to fix linker. test=kunlun * debug. test=kunlun * update xpu.cmake. remove unnecessary codes. test=kunlun.
-
由 ykkk2333 提交于
-
由 jakpiase 提交于
* tmp save * minor chnage * CI fix * added FC optimizations * latest update * CI fix * fixed bug with fusing fc
-
- 03 11月, 2022 1 次提交
-
-
由 zhouweiwei2014 提交于
-
- 02 11月, 2022 4 次提交
-
-
由 Yiqun Liu 提交于
Improve the tool for checking nan and inf, and support to compute the max, min and mean of output tensor. (#47095) * Improve the tool for checking nan and inf, and support to compute the max, min and mean of output tensor. * Add a FLAGS to control whether abort when meets inf/nan and polish codes. * Fix unittest. * Change the computing of mean.
-
由 houj04 提交于
* [XPU] add int64 support for slice and subtract. test=kunlun * try to fix xpu compile. test=kunlun * try to fix xpu compile. test=kunlun * try to fix xpu compile. test=kunlun * remove unnecessary modification. test=kunlun
-
由 Tian Zheng 提交于
* Add build option for CUDNN Frontend API * Fix review comments * Change namespace for cudnn_frontend.h
- 01 11月, 2022 2 次提交
-
-
由 HongyuJia 提交于
* move cudnn hardcode outside GetExpectedKernelType * add header file * debug * update interpreter_util with hardcode * update interpreter_util headerfile * solve activation hardcode * debug with CI * add mkldnn_op_list header file * temporarily uncomment mkldnn * temporarily uncomment mkldnn * delete sequence_softmax cudnn hardcode * add hardcode to data_transfer.cc * update data_transfer headerfile * try fix segment fault * update cudnn&miopen_helper * reset HasAttr of DygraphExctnCtx * debug, this commit should pass all CI * debug should pass CI, temporarily disable activation * debug should pass CI * fix default_attr=nullptr bug * clean debug code
-
由 Chen Weihang 提交于
* add extra attr property set * add type_info for all context * add onednn context to all context * fix context compile error * simplify conv kernel args * pass runtime attr into dev_ctx * fix marco error * clear conv_grad_kernel extra args * merge conv_grad_grad into conv_grad * clear conv2d_grad_grad extra attrs * clear yaml and eager extra attr * fix conv1d error * change to thread local * fix npu compile failed * try to fix windows compile failed * add conv2d onednn phi kernel * fix ci bugs (#36) * fix compile bugs (#38) * fix extra input transform bug (#39) * support dynamic created attr (#40) * reset extra info gen code * rm conv_grad_grad kernel * reimpl pass attr adapting * add int attr support * remove vector inputnames creating * fix map at error * Update paddle/phi/kernels/onednn/conv_grad_kernel.cc Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com> * remove useless extra attrs * replace mkldnn_engine by onednn_engine Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com> Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>
-
- 27 10月, 2022 1 次提交
-
-
由 Aurelius84 提交于
* add predictor_engine * add predictor_engine * fix zero shape * fix lodTensor * fix unittest * fix code style * update CmakeList
-
- 26 10月, 2022 2 次提交
-
-
由 Sławomir Siwek 提交于
* fc/matmuls + scale fuse pass * remove double-extension * add unit tests * comments from review * codestyle * add pass to int8 list * new codestyle * attr name typo
-
由 HongyuJia 提交于
* remove prior_box mkldnn hard code * add header file * simplify PD_VISIT_TYPE * decouple dependency between prior_box and density_prior_box * fix pragma omp parallel error * bypass #pragma omp_parallel_for error * polish code * remove visit_type headerfile * polish codestyle * polish codestyle * try fix CI error * add testcase, datatype=float64 * reset test_prior_box testcase * add datacheck to DenseTensor * update template name * call prior_box with macro expand
-
- 25 10月, 2022 2 次提交
- 24 10月, 2022 1 次提交
-
-
由 HongyuJia 提交于
* delete GetExpectedKernelType mkldnn of mul_grad * update mkldnn_op_list, remove mul_grad * delete GetExpectedKernelType mkldnn of mul
-
- 21 10月, 2022 1 次提交
-
-
由 Yuanle Liu 提交于
* fix nvprof_nvtx_push interface bug
-
- 20 10月, 2022 1 次提交
-
-
由 HongyuJia 提交于
* remove fc mkldnn hardcode * remove useless enum of kFCMKLDNN * fix macro error * update operators.cmake
-
- 19 10月, 2022 2 次提交
-
-
由 Yuanle Liu 提交于
-
由 Leo Chen 提交于
* clean unused code: piece.cc/h * clean usage
-
- 18 10月, 2022 1 次提交
-
-
由 HongyuJia 提交于
-
- 17 10月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
* namespace modify * update by comment
-
- 15 10月, 2022 1 次提交
-
-
由 HongyuJia 提交于
-
- 13 10月, 2022 2 次提交
-
-
由 Leo Chen 提交于
-
由 HongyuJia 提交于
* remove PADDLE_WITH_MKLDNN, test white_list=abs * fix unique_ptr * fix op.Type() * remove TODO in kernel_dispatch.h * remove IndicateVarDataType function, update white_list * remove mkldnn hard code * add comments * fix == * update mkldnn_op_list * delete hard code of OPs * update mkldnn_op_list * update mkldnn_op_list, remove interp * add error check for ExecutionContext * update mkldnn_op_list, remove transpose2_grad * remove interpolate mkldnn * remove fill_constant mkldnn * opt HasAttr in DygraphExecutionContext * deprecated commit, test mkldnn_white_list * deprecated commit, test mkldnn_white_list * deprecated commit, test mkldnn_black_list * update mkldnn_op_list, add assert error op * solve cudnn related op * fix error * add mkldnn fallback in phi_utils.cc * remove mkldnn fallback in phi_utils.cc * opt code implementation * polish Copyright License
-
- 11 10月, 2022 2 次提交
-
-
由 Wen Sun 提交于
-
由 Chen Weihang 提交于
* remove using lodtensor part1 * polish history code format
-
- 10 10月, 2022 1 次提交
-
-
由 Sylwester Fraczek 提交于
* Add methods that find input or output name by var name * kind of bugfix - initialize variables * ci fix * review fixed
-