- 07 11月, 2022 12 次提交
-
-
由 Wang Xin 提交于
-
由 QingshuChen 提交于
*test=kunlun
-
由 ykkk2333 提交于
add roll and roll_grad kernels and strided_slice and strided_slice_grad kernels, test=kunlun (#47368) * add stat tool * add roll and roll_grad kernels and strided_slice and strided_slice_grad kernels, test=kunlun
-
由 wanghuancoder 提交于
* refine python lib link
-
由 Yuanle Liu 提交于
-
由 ronnywang 提交于
-
由 WangZhen 提交于
* Get three grad lists in CPP to avoid gpu idle time * Support legacy mode
-
由 HongyuJia 提交于
* move cudnn hardcode outside GetExpectedKernelType * add header file * debug * update interpreter_util with hardcode * update interpreter_util headerfile * solve activation hardcode * debug with CI * add mkldnn_op_list header file * temporarily uncomment mkldnn * temporarily uncomment mkldnn * delete sequence_softmax cudnn hardcode * add hardcode to data_transfer.cc * update data_transfer headerfile * try fix segment fault * update cudnn&miopen_helper * reset HasAttr of DygraphExctnCtx * debug, this commit should pass all CI * debug should pass CI, temporarily disable activation * debug should pass CI * fix default_attr=nullptr bug * clean debug code * Call SetDnnFallback function in the base class * activation fallback to plain kernel * fix default GetExpectedKernelType find wrong kernel * search cudnn kernel instead of fallback * fix cudnn_handle bug * remove tanh use_cudnn * restore tanh use_cudnn * debug tanh * fix tanh bug * delete activation cudnn kernel * polish code
-
由 Qi Li 提交于
-
由 Wen Sun 提交于
-
由 Sławomir Siwek 提交于
* init changes * bnorm * method signature * change order * bnorm * removed unused args
-
由 Sławomir Siwek 提交于
* remove fwd funcs * migrate conv grads
-
- 05 11月, 2022 1 次提交
-
-
由 Yiqun Liu 提交于
-
- 04 11月, 2022 13 次提交
-
-
由 LiYuRio 提交于
-
由 zyfncg 提交于
* generate static graph code for some activation op * fix bug * fix infermeta of selected_rows
-
由 Jacek Czaja 提交于
* slice & mul & requantize * - Fix to requentize test
-
由 shentanyue 提交于
-
由 LiYuRio 提交于
-
由 houj04 提交于
* [XPU] add cumsum op. test=kunlun * try to fix linker. test=kunlun * try to fix linker. test=kunlun * try to fix linker. test=kunlun * debug. test=kunlun * update xpu.cmake. remove unnecessary codes. test=kunlun.
-
由 Sławomir Siwek 提交于
-
由 Piotr Paturej 提交于
* add extra attr property set * add type_info for all context * add onednn context to all context * fix context compile error * simplify conv kernel args * pass runtime attr into dev_ctx * fix marco error * clear conv_grad_kernel extra args * merge conv_grad_grad into conv_grad * clear conv2d_grad_grad extra attrs * clear yaml and eager extra attr * fix conv1d error * change to thread local * fix npu compile failed * try to fix windows compile failed * add conv2d onednn phi kernel * fix ci bugs (#36) * fix compile bugs (#38) * fix extra input transform bug (#39) * support dynamic created attr (#40) * reset extra info gen code * rm conv_grad_grad kernel * reimpl pass attr adapting * add int attr support * remove vector inputnames creating * fix map at error * Update paddle/phi/kernels/onednn/conv_grad_kernel.cc Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com> * remove useless extra attrs * replace mkldnn_engine by onednn_engine * Migrate pool+grad to PHI * Update paddle/fluid/operators/mkldnn/test_mkldnn_op_nhwc.cc Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com> * Update paddle/phi/kernels/onednn/pool_grad_kernel.cc Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com> * Update paddle/phi/kernels/onednn/pool_kernel.cc Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com> Co-authored-by: NChen Weihang <chenweihang@baidu.com> Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com> Co-authored-by: NChen Weihang <chenwhpro@163.com> Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>
-
由 Sławomir Siwek 提交于
* add extra attr property set * add type_info for all context * add onednn context to all context * fix context compile error * simplify conv kernel args * pass runtime attr into dev_ctx * fix marco error * clear conv_grad_kernel extra args * merge conv_grad_grad into conv_grad * clear conv2d_grad_grad extra attrs * remove redundant imports * migrate softmax * clear yaml and eager extra attr * fix conv1d error * change to thread local * fix npu compile failed * try to fix windows compile failed * add conv2d onednn phi kernel * fix ci bugs (#36) * fix compile bugs (#38) * fix extra input transform bug (#39) * support dynamic created attr (#40) * reset extra info gen code * rm conv_grad_grad kernel * reimpl pass attr adapting * add int attr support * remove vector inputnames creating * merge dev * fix map at error * adjust attribute * adapt funcs to PHI * init * adjust imports * support postops * format codeblocks * revert changes to softmax Co-authored-by: NChen Weihang <chenweihang@baidu.com> Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>
-
由 LiYuRio 提交于
-
由 ykkk2333 提交于
-
由 jakpiase 提交于
* tmp save * minor chnage * CI fix * added FC optimizations * latest update * CI fix * fixed bug with fusing fc
-
由 wanghuancoder 提交于
* fix cc_library link python lib
-
- 03 11月, 2022 10 次提交
-
-
由 Wangzheee 提交于
* disable_lookup_table_v2
-
由 Piotr Paturej 提交于
* Fix dnnl errors in elementwise_sub tests * Fix model accuracy attempt * Add new fix * Add proper fix * Refactor by removing code repetition
-
由 Ruibiao Chen 提交于
* Dispath computation OPs before communication in standalone executor * Update code * Fix CI errors * Improve performance of coalesce_tensor and depend OP in standalone executor * pre-commit check
-
由 zhouweiwei2014 提交于
-
由 yeliang2258 提交于
* add constant_folding_pass pass for mkldnn int8 * update UpdateScaleOpInOutScales
-
由 Leo Chen 提交于
-
由 HongyuJia 提交于
* opt CanMKLDNNBeUsed performance * fix nullptr bug * fix OpBase default_attrs=nullptr bug * fix OpBase default_attrs=nullptr bug * fix OpBase default_attrs=nullptr bug
-
由 Sławomir Siwek 提交于
* add extra attr property set * add type_info for all context * add onednn context to all context * fix context compile error * simplify conv kernel args * pass runtime attr into dev_ctx * fix marco error * clear conv_grad_kernel extra args * merge conv_grad_grad into conv_grad * clear conv2d_grad_grad extra attrs * remove redundant imports * migrate softmax * clear yaml and eager extra attr * fix conv1d error * change to thread local * fix npu compile failed * try to fix windows compile failed * add conv2d onednn phi kernel * fix ci bugs (#36) * fix compile bugs (#38) * fix extra input transform bug (#39) * support dynamic created attr (#40) * reset extra info gen code * rm conv_grad_grad kernel * reimpl pass attr adapting * add int attr support * remove vector inputnames creating * merge dev * fix map at error * adjust attribute * adapt funcs to PHI Co-authored-by: NChen Weihang <chenweihang@baidu.com> Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>
-
由 wenbin 提交于
-
由 Wang Xin 提交于
* remove unused-variable warning in linux * fix unused-variable error in GpuPS
-
- 02 11月, 2022 4 次提交
-
-
由 Zhang Jun 提交于
* add bilinear OutSize
-
由 丁一 提交于
-
由 Ruibiao Chen 提交于
* Dispath computation OPs before communication in standalone executor * Update code * Fix CI errors