- 15 9月, 2022 6 次提交
-
-
由 HongyuJia 提交于
* refine PADDLE_WITH_MKLDNN code * fix data_norm_op * polish addmm_op
-
由 Jacek Czaja 提交于
* - mul & matmul changes - fix - bs16 correction of strides * - cosmetic fixes * - lint * - fix * - fix * - format -> mem_desc * - fix * - fix * - fix * - fix * - fix
-
由 zyfncg 提交于
* clear extra attrs of elementwise op in opmaker * fix op_debug_string_test * fix bug of grad_add * fix sort of runtime attrs
-
由 WangZhen 提交于
Support 0 shapes input Tensor for MKL slice kernel
-
由 Nyakku Shigure 提交于
-
由 weishengying 提交于
-
- 14 9月, 2022 8 次提交
-
-
由 Jiabin Yang 提交于
* support bmm and bmm_grad in xpu * add error removal * test=kunlun * refactor code for better structure * test=kunlun * add fp16 kernel for bmm * test=kunlun
-
由 Nyakku Shigure 提交于
* [CodeStyle] trim trailing whitespace in .md and .rst * empty commit, test=document_fix
-
由 Leo Guo 提交于
Migrate scale and scatter to phi, and modify the code style for roi_align_kernel. test=kunlun (#45938)
-
由 ykkk2333 提交于
-
由 sneaxiy 提交于
* fix distributed_fused_lamb nan * remove CUDA_ASSERT
-
由 Yiqun Liu 提交于
-
由 Xiaoxu Chen 提交于
* add reduce_mean,reduce_sum primitive ops * add ne_p gt_p primitive operators * add ge_p abs_p primitive oparators
-
由 Chenxiao Niu 提交于
-
- 13 9月, 2022 3 次提交
- 10 9月, 2022 1 次提交
-
-
由 qipengh 提交于
-
- 09 9月, 2022 7 次提交
-
-
由 duanyanhui 提交于
* make memcpy op to support custom device * fix bug
-
由 Leo Chen 提交于
* add operator<< for BuildStrategy * add fake_coalesce * fit allreduce mode for new_exe * remove dubeg code * follow comments
-
由 Chen Weihang 提交于
* migrate load kernel * remove load op * fix test failed
-
由 xiaoxiaohehe001 提交于
-
由 ronnywang 提交于
* [CustomDevice] add dy2static support * update
-
由 Chen Weihang 提交于
* add fusion dir and fuse_softmax_mask kernel * remove fusion kernel dir * migrate infershape * fix code errror
-
由 sneaxiy 提交于
-
- 08 9月, 2022 7 次提交
-
-
由 piotrekobi 提交于
* gaussian random * mkldnn to onednn renaming * fix merge conflicts * remove fluid code * onednn renaming * Move classes from mkldnn_reuse.h to onednn_reuse.h * Migrate pool+grad, clip+grad and cast oneDNN kernels to PHI * Refactor grad kernels into separate files * Fix CI failures * Fix Codestyle * Implement reviewer suggestions * Add new lines after includes for readability Co-authored-by: NSilv3S <slawomir.siwek@intel.com>
-
由 Leo Guo 提交于
-
由 HongyuJia 提交于
-
由 taixiurong 提交于
* add gemm_epilogue * xpu-paddlepaddle-40 [任务] fused_gemm_epilogue 支持 test=kunlun
-
由 TeFeng Chen 提交于
-
由 xiongkun 提交于
* stop pass filter int32/int16/int64/bool inputs in cond_op * fix bugs: except block 0, the backward vars and forward vars exist in different blocks. * fix code by review
-
由 sneaxiy 提交于
-
- 07 9月, 2022 8 次提交
-
-
由 Chen Weihang 提交于
* add save kernel * add save_sr_kernel * remove original save_op * add save gpu kernel * remove combine kernel * add port.h include * add save selected rows test * remove useless kernel.h
-
由 Yuang Liu 提交于
-
由 WangZhen 提交于
* Construct exec and ctx only once in cond op to speed up * Fix construct function error
-
由 Wilber 提交于
-
由 piotrekobi 提交于
* gaussian random * mkldnn to onednn renaming * fix merge conflicts * Migrate reduce_op oneDNN kernels to phi * Remove unnecessary header * remove fluid code * onednn renaming * Change std::vector<int64_t> to IntArray * Fix code style * Move classes from mkldnn_reuse.h to onednn_reuse.h * Move more functions from mkldnn_helper.h to onednn_helpper.h * Change MKLDNN to OneDNN in VLOG message * Implement reviewer suggestions Co-authored-by: NSilv3S <slawomir.siwek@intel.com>
-
由 WangZhen 提交于
Adapt tensor output_size for conv2d_transpose and depthwise_conv2d_transpose
-
由 Yuang Liu 提交于
-
由 zyfncg 提交于
* clear extra attrs of reduce op in opmaker * fix reduce_mean
-