- 30 11月, 2022 1 次提交
-
-
由 zhangyikun02 提交于
-
- 29 11月, 2022 1 次提交
-
-
由 haosicheng 提交于
-
- 28 11月, 2022 3 次提交
-
-
由 huangjiyi 提交于
* rm fluid “xpu_header.h” deps in phi * move part of xpu_op_list.h from fluid to phi * add fluid xpu_op_list deps * add glog deps for xpu_op_list in phi * fix PR-CI-Kunlun
-
由 zyfncg 提交于
* add fluid_op_name_map * rename some kernel name * add comments for op-kernel map * refine map name of op to kernel
-
由 haosicheng 提交于
-
- 24 11月, 2022 2 次提交
-
-
由 zhangyikun02 提交于
-
由 zhangyikun02 提交于
-
- 23 11月, 2022 2 次提交
-
-
由 ykkk2333 提交于
* add stat tool * add roll and roll_grad kernels and strided_slice and strided_slice_grad kernels, test=kunlun * add masked_selected_grad kernel,test=kunlun
-
由 zhangyikun02 提交于
-
- 21 11月, 2022 2 次提交
-
-
由 wanghuancoder 提交于
* refine reduce_all
-
由 taixiurong 提交于
-
- 18 11月, 2022 2 次提交
-
-
由 james 提交于
* correct sync behavior for XPU distributed training XPU support event mechanism similar to cuda event, so it is advisable to use an event to sync compute/comm streams for performance. However this mechanism is never fully tested, and inconsistent loss/ending_epochs are reported. Therefore, this PR replaces event sync with stream waiting as a temporary solution. * remove compile warning
-
由 zhangyikun02 提交于
-
- 17 11月, 2022 2 次提交
-
-
由 YuanRisheng 提交于
* standard api * fix xpu bugs
-
由 taixiurong 提交于
-
- 16 11月, 2022 1 次提交
-
-
由 ykkk2333 提交于
* add stat tool * add roll and roll_grad kernels and strided_slice and strided_slice_grad kernels, test=kunlun * embedding and embedding_grad add int32 input, test=kunlun
-
- 15 11月, 2022 1 次提交
-
-
由 zhouweiwei2014 提交于
-
- 11 11月, 2022 1 次提交
-
-
由 zhouweiwei2014 提交于
-
- 10 11月, 2022 6 次提交
-
-
由 zhangyikun02 提交于
-
由 YuanRisheng 提交于
* standard api * fix sparse bugs * fix xpu bugs, test=kunlun * remove hard code for custom unittest * open ci, test=kunlun * deal with conflict
-
由 Wang Xin 提交于
* remove fluid/framework/generator.h from phi * fix PR-CI-Kunlun-KP-Build fail
-
由 huangjiyi 提交于
[PHI Decoupling] remove "paddle/fluid/platform/float16.h" and "paddle/fluid/platform/for_range.h" in phi. (#47817) * rm "paddle/fluid/platform/float16.h" in phi * rm "paddle/fluid/platform/for_range.h" in phi
-
由 zhouweiwei2014 提交于
-
由 james 提交于
* XPU support eager mode * add unittest for XPU eager mode * minor bugfix * minor bugfix, test=kunlun * correct copyright info * 1. remove unsed vars/funcs 2. ProcessGroupBKCL inherit from ProcessGroupStream * bugfix for fp16 in eager mode multi-card, test=kunlun * rebase & fix a few issues * use new processgroup interface, test=kunlun * fix compile issue, test=kunlun
-
- 09 11月, 2022 1 次提交
-
-
由 Wang Xin 提交于
* remove framework/data_type.h from phi * fix CI fail: map proto::VarType to phi::DataType * refactor code to add more detailed comments
-
- 08 11月, 2022 2 次提交
-
-
由 zhangyikun02 提交于
-
由 zhangyikun02 提交于
-
- 07 11月, 2022 2 次提交
-
-
由 QingshuChen 提交于
*test=kunlun
-
由 ykkk2333 提交于
add roll and roll_grad kernels and strided_slice and strided_slice_grad kernels, test=kunlun (#47368) * add stat tool * add roll and roll_grad kernels and strided_slice and strided_slice_grad kernels, test=kunlun
-
- 04 11月, 2022 3 次提交
-
-
由 houj04 提交于
* [XPU] add cumsum op. test=kunlun * try to fix linker. test=kunlun * try to fix linker. test=kunlun * try to fix linker. test=kunlun * debug. test=kunlun * update xpu.cmake. remove unnecessary codes. test=kunlun.
-
由 ykkk2333 提交于
-
由 zhangyikun02 提交于
-
- 03 11月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
-
- 02 11月, 2022 4 次提交
-
-
由 zhangbo9674 提交于
* fix ci bug * test
-
由 YuanRisheng 提交于
* Standardise batch norm * standardize conv3d and depwise_conv2d * fix ci bugs
-
由 zhouweiwei2014 提交于
-
由 houj04 提交于
* [XPU] add int64 support for slice and subtract. test=kunlun * try to fix xpu compile. test=kunlun * try to fix xpu compile. test=kunlun * try to fix xpu compile. test=kunlun * remove unnecessary modification. test=kunlun
-
- 01 11月, 2022 3 次提交
-
-
由 YuanRisheng 提交于
* standard_api * add hardtanh
-
由 Wang Xin 提交于
-
由 Chen Weihang 提交于
* add extra attr property set * add type_info for all context * add onednn context to all context * fix context compile error * simplify conv kernel args * pass runtime attr into dev_ctx * fix marco error * clear conv_grad_kernel extra args * merge conv_grad_grad into conv_grad * clear conv2d_grad_grad extra attrs * clear yaml and eager extra attr * fix conv1d error * change to thread local * fix npu compile failed * try to fix windows compile failed * add conv2d onednn phi kernel * fix ci bugs (#36) * fix compile bugs (#38) * fix extra input transform bug (#39) * support dynamic created attr (#40) * reset extra info gen code * rm conv_grad_grad kernel * reimpl pass attr adapting * add int attr support * remove vector inputnames creating * fix map at error * Update paddle/phi/kernels/onednn/conv_grad_kernel.cc Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com> * remove useless extra attrs * replace mkldnn_engine by onednn_engine Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com> Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>
-