- 04 11月, 2022 10 次提交
-
-
由 houj04 提交于
* [XPU] add cumsum op. test=kunlun * try to fix linker. test=kunlun * try to fix linker. test=kunlun * try to fix linker. test=kunlun * debug. test=kunlun * update xpu.cmake. remove unnecessary codes. test=kunlun.
-
由 pangyoki 提交于
-
由 Sławomir Siwek 提交于
-
由 Piotr Paturej 提交于
* add extra attr property set * add type_info for all context * add onednn context to all context * fix context compile error * simplify conv kernel args * pass runtime attr into dev_ctx * fix marco error * clear conv_grad_kernel extra args * merge conv_grad_grad into conv_grad * clear conv2d_grad_grad extra attrs * clear yaml and eager extra attr * fix conv1d error * change to thread local * fix npu compile failed * try to fix windows compile failed * add conv2d onednn phi kernel * fix ci bugs (#36) * fix compile bugs (#38) * fix extra input transform bug (#39) * support dynamic created attr (#40) * reset extra info gen code * rm conv_grad_grad kernel * reimpl pass attr adapting * add int attr support * remove vector inputnames creating * fix map at error * Update paddle/phi/kernels/onednn/conv_grad_kernel.cc Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com> * remove useless extra attrs * replace mkldnn_engine by onednn_engine * Migrate pool+grad to PHI * Update paddle/fluid/operators/mkldnn/test_mkldnn_op_nhwc.cc Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com> * Update paddle/phi/kernels/onednn/pool_grad_kernel.cc Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com> * Update paddle/phi/kernels/onednn/pool_kernel.cc Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com> Co-authored-by: NChen Weihang <chenweihang@baidu.com> Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com> Co-authored-by: NChen Weihang <chenwhpro@163.com> Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>
-
由 Sławomir Siwek 提交于
* add extra attr property set * add type_info for all context * add onednn context to all context * fix context compile error * simplify conv kernel args * pass runtime attr into dev_ctx * fix marco error * clear conv_grad_kernel extra args * merge conv_grad_grad into conv_grad * clear conv2d_grad_grad extra attrs * remove redundant imports * migrate softmax * clear yaml and eager extra attr * fix conv1d error * change to thread local * fix npu compile failed * try to fix windows compile failed * add conv2d onednn phi kernel * fix ci bugs (#36) * fix compile bugs (#38) * fix extra input transform bug (#39) * support dynamic created attr (#40) * reset extra info gen code * rm conv_grad_grad kernel * reimpl pass attr adapting * add int attr support * remove vector inputnames creating * merge dev * fix map at error * adjust attribute * adapt funcs to PHI * init * adjust imports * support postops * format codeblocks * revert changes to softmax Co-authored-by: NChen Weihang <chenweihang@baidu.com> Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>
-
由 LiYuRio 提交于
-
由 ykkk2333 提交于
-
由 jakpiase 提交于
* tmp save * minor chnage * CI fix * added FC optimizations * latest update * CI fix * fixed bug with fusing fc
-
由 zhangyikun02 提交于
-
由 wanghuancoder 提交于
* fix cc_library link python lib
-
- 03 11月, 2022 19 次提交
-
-
由 Wangzheee 提交于
* disable_lookup_table_v2
-
由 zhouweiwei2014 提交于
-
由 wanghuancoder 提交于
-
由 Piotr Paturej 提交于
* Fix dnnl errors in elementwise_sub tests * Fix model accuracy attempt * Add new fix * Add proper fix * Refactor by removing code repetition
-
由 Tian Zheng 提交于
-
由 Ruibiao Chen 提交于
* Dispath computation OPs before communication in standalone executor * Update code * Fix CI errors * Improve performance of coalesce_tensor and depend OP in standalone executor * pre-commit check
-
由 zhouweiwei2014 提交于
-
由 yeliang2258 提交于
* add constant_folding_pass pass for mkldnn int8 * update UpdateScaleOpInOutScales
-
由 Nyakku Shigure 提交于
* [CodeStyle][py2][U008] remove unnecessary args in `super()` * remove remained args * revert changes in test_pylayer_op * Revert "revert changes in test_pylayer_op" This reverts commit ff185a9ae738afac3b0264f61bde6c6b7f72e7c4. * revert some changes in example code
-
由 Leo Chen 提交于
-
由 HongyuJia 提交于
* opt CanMKLDNNBeUsed performance * fix nullptr bug * fix OpBase default_attrs=nullptr bug * fix OpBase default_attrs=nullptr bug * fix OpBase default_attrs=nullptr bug
-
由 sneaxiy 提交于
-
由 Sławomir Siwek 提交于
* add extra attr property set * add type_info for all context * add onednn context to all context * fix context compile error * simplify conv kernel args * pass runtime attr into dev_ctx * fix marco error * clear conv_grad_kernel extra args * merge conv_grad_grad into conv_grad * clear conv2d_grad_grad extra attrs * remove redundant imports * migrate softmax * clear yaml and eager extra attr * fix conv1d error * change to thread local * fix npu compile failed * try to fix windows compile failed * add conv2d onednn phi kernel * fix ci bugs (#36) * fix compile bugs (#38) * fix extra input transform bug (#39) * support dynamic created attr (#40) * reset extra info gen code * rm conv_grad_grad kernel * reimpl pass attr adapting * add int attr support * remove vector inputnames creating * merge dev * fix map at error * adjust attribute * adapt funcs to PHI Co-authored-by: NChen Weihang <chenweihang@baidu.com> Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>
-
由 zhangkaihuo 提交于
-
由 pangyoki 提交于
* add cudnn into whl package * add cudnn dso into whl package * let WITH_CUDNN_DSO be consistent with WITH_GPU * fix WITH_CUDNN_DSO in paddle_build
-
由 zhouweiwei2014 提交于
-
由 wenbin 提交于
-
由 Wang Xin 提交于
* remove unused-variable warning in linux * fix unused-variable error in GpuPS
-
由 YuanRisheng 提交于
-
- 02 11月, 2022 11 次提交
-
-
由 Zhang Jun 提交于
* add bilinear OutSize
-
由 Leo Chen 提交于
-
由 zhangbo9674 提交于
* fix ci bug * test
-
由 丁一 提交于
-
由 Ruibiao Chen 提交于
* Dispath computation OPs before communication in standalone executor * Update code * Fix CI errors
-
由 Tao Luo 提交于
-
由 Chen Weihang 提交于
* add storage properties for npu * fix compile failed * fix api name mismatch * polish design
-
由 YuanRisheng 提交于
* Standardise batch norm * standardize conv3d and depwise_conv2d * fix ci bugs
-
由 zhouweiwei2014 提交于
-
由 Yiqun Liu 提交于
Improve the tool for checking nan and inf, and support to compute the max, min and mean of output tensor. (#47095) * Improve the tool for checking nan and inf, and support to compute the max, min and mean of output tensor. * Add a FLAGS to control whether abort when meets inf/nan and polish codes. * Fix unittest. * Change the computing of mean.