- 04 11月, 2022 1 次提交
-
-
由 Piotr Paturej 提交于
* add extra attr property set * add type_info for all context * add onednn context to all context * fix context compile error * simplify conv kernel args * pass runtime attr into dev_ctx * fix marco error * clear conv_grad_kernel extra args * merge conv_grad_grad into conv_grad * clear conv2d_grad_grad extra attrs * clear yaml and eager extra attr * fix conv1d error * change to thread local * fix npu compile failed * try to fix windows compile failed * add conv2d onednn phi kernel * fix ci bugs (#36) * fix compile bugs (#38) * fix extra input transform bug (#39) * support dynamic created attr (#40) * reset extra info gen code * rm conv_grad_grad kernel * reimpl pass attr adapting * add int attr support * remove vector inputnames creating * fix map at error * Update paddle/phi/kernels/onednn/conv_grad_kernel.cc Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com> * remove useless extra attrs * replace mkldnn_engine by onednn_engine * Migrate pool+grad to PHI * Update paddle/fluid/operators/mkldnn/test_mkldnn_op_nhwc.cc Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com> * Update paddle/phi/kernels/onednn/pool_grad_kernel.cc Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com> * Update paddle/phi/kernels/onednn/pool_kernel.cc Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com> Co-authored-by: NChen Weihang <chenweihang@baidu.com> Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com> Co-authored-by: NChen Weihang <chenwhpro@163.com> Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>
-
- 03 11月, 2022 2 次提交
-
-
由 zhouweiwei2014 提交于
-
由 Sławomir Siwek 提交于
* add extra attr property set * add type_info for all context * add onednn context to all context * fix context compile error * simplify conv kernel args * pass runtime attr into dev_ctx * fix marco error * clear conv_grad_kernel extra args * merge conv_grad_grad into conv_grad * clear conv2d_grad_grad extra attrs * remove redundant imports * migrate softmax * clear yaml and eager extra attr * fix conv1d error * change to thread local * fix npu compile failed * try to fix windows compile failed * add conv2d onednn phi kernel * fix ci bugs (#36) * fix compile bugs (#38) * fix extra input transform bug (#39) * support dynamic created attr (#40) * reset extra info gen code * rm conv_grad_grad kernel * reimpl pass attr adapting * add int attr support * remove vector inputnames creating * merge dev * fix map at error * adjust attribute * adapt funcs to PHI Co-authored-by: NChen Weihang <chenweihang@baidu.com> Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>
-
- 02 11月, 2022 1 次提交
-
-
由 Tian Zheng 提交于
* Add build option for CUDNN Frontend API * Fix review comments * Change namespace for cudnn_frontend.h
-
- 01 11月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* add extra attr property set * add type_info for all context * add onednn context to all context * fix context compile error * simplify conv kernel args * pass runtime attr into dev_ctx * fix marco error * clear conv_grad_kernel extra args * merge conv_grad_grad into conv_grad * clear conv2d_grad_grad extra attrs * clear yaml and eager extra attr * fix conv1d error * change to thread local * fix npu compile failed * try to fix windows compile failed * add conv2d onednn phi kernel * fix ci bugs (#36) * fix compile bugs (#38) * fix extra input transform bug (#39) * support dynamic created attr (#40) * reset extra info gen code * rm conv_grad_grad kernel * reimpl pass attr adapting * add int attr support * remove vector inputnames creating * fix map at error * Update paddle/phi/kernels/onednn/conv_grad_kernel.cc Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com> * remove useless extra attrs * replace mkldnn_engine by onednn_engine Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com> Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>
-
- 31 10月, 2022 1 次提交
-
-
由 ronnywang 提交于
* [CustomDevice] GetCCLComm add custom device support * update * update * update
-
- 27 10月, 2022 1 次提交
-
-
由 Jacek Czaja 提交于
* - halfway transforming transpose grad - Fixes - buildable * - lint * rerunning the process
-
- 25 10月, 2022 1 次提交
-
-
由 jakpiase 提交于
* return proper state * fix for dims * fix
-
- 19 10月, 2022 1 次提交
-
-
由 Yuanle Liu 提交于
-
- 17 10月, 2022 2 次提交
-
-
由 RedContritio 提交于
-
由 duanyanhui 提交于
* add singleton to custom device * Update custom_device.cc Init device_init_flag_ in default
-
- 11 10月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* remove using lodtensor part1 * polish history code format
-
- 10 10月, 2022 1 次提交
-
-
由 Paulina Gacek 提交于
* op migrated, Copy(OneDNNContext, ...) added * mutable_data & op registration in fluid removed * refactoring * OneDNNGetDataType to uppercase * missing cpu check added, handler moved to .h file * name changed to transpose_grad * Copy changed back to TensorCopy * Resizing corrected, Copy(OneDNNContext) removed
-
- 28 9月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* remove needless using tensor * remove needless using tensor * resolve conflict * replace tensor using * fix format error * revert needless changing * fix rocm and npu compile error * fix cinn compile error * fix format error * fix mkldnn format error * fix mkldnn format error * fix cinn compile error * fix cinn compile error * fix cinn compile error * resolve conflict
-
- 22 9月, 2022 2 次提交
-
-
由 Piotr Paturej 提交于
* Convert slice+grad oneDNN fluid kernels to PHI * Change mutable_data to Alloc * Refactor licences
-
由 Hui Zhang 提交于
* fix sub sign reverse for mkldnn * refactor code as comment * remove useless * format code
-
- 21 9月, 2022 1 次提交
-
-
由 Piotr Paturej 提交于
-
- 20 9月, 2022 2 次提交
-
-
由 Sławomir Siwek 提交于
* init * remove softmaxop * merge dev * correct dir * style
-
由 Piotr Paturej 提交于
* Convert split, pad and pad3d kernels * Convert slice+grad oneDNN fluid kernels to PHI * change out->mutable_data to dev_ctx.Alloc
-
- 18 9月, 2022 1 次提交
-
-
由 RichardWooSJTU 提交于
-
- 16 9月, 2022 2 次提交
-
-
由 sneaxiy 提交于
* support int64 non-broadcast * support broadcast case for int64 index * fix bug * support more Arity * remove some codes * upgrade patchelf to v0.15.0 to pass CI build * fix bug * fix patchelf installation * add debug flags * remove useless codes * fix viterbi_decode and set_value op uts * remove always enable int64
-
由 ronnywang 提交于
* [CustomDevice] add custom_device_resource_pool & device_event_custom_device * update * update * update * update
-
- 14 9月, 2022 1 次提交
-
-
由 JingZhuangzhuang 提交于
* Delay TensorRT registry * Add unused define * Fix TensorRT test * fix function to reference * Update trt_plugin.h
-
- 08 9月, 2022 1 次提交
-
-
由 piotrekobi 提交于
* gaussian random * mkldnn to onednn renaming * fix merge conflicts * remove fluid code * onednn renaming * Move classes from mkldnn_reuse.h to onednn_reuse.h * Migrate pool+grad, clip+grad and cast oneDNN kernels to PHI * Refactor grad kernels into separate files * Fix CI failures * Fix Codestyle * Implement reviewer suggestions * Add new lines after includes for readability Co-authored-by: NSilv3S <slawomir.siwek@intel.com>
-
- 06 9月, 2022 1 次提交
-
-
由 Wilber 提交于
-
- 05 9月, 2022 2 次提交
-
-
由 piotrekobi 提交于
* gaussian random * mkldnn to onednn renaming * fix merge conflicts * remove fluid code * onednn renaming * Move classes from mkldnn_reuse.h to onednn_reuse.h * Move more functions from mkldnn_helper.h to onednn_helpper.h * Change MKLDNN to OneDNN in VLOG message Co-authored-by: NSilv3S <slawomir.siwek@intel.com>
-
由 sneaxiy 提交于
-
- 04 9月, 2022 1 次提交
-
-
由 Sławomir Siwek 提交于
* gaussian random * mkldnn to onednn renaming * fix merge conflicts * remove fluid code * onednn renaming * change header path * change fluid import to phi
-
- 02 9月, 2022 1 次提交
-
-
由 kangguangli 提交于
-
- 01 9月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* refine cmake of framework * add deps for dense tensor * fix deps * remove alloc(ctx) * add depends on mkldnn
-
- 24 8月, 2022 1 次提交
-
-
由 Rayman 提交于
* 【Hackathon No.34】优化 poisson op * [poisson] code style fix * modify code style * prevent from big number * modify code style * modify code style * modify import * modify import * modify code style
-
- 23 8月, 2022 1 次提交
-
-
由 ronnywang 提交于
* [CustomDevice] add profiler apis * migrate CalculateEstOccupancy into cuda_tracer * update * add ut
-
- 22 8月, 2022 1 次提交
-
-
由 ronnywang 提交于
-
- 18 8月, 2022 1 次提交
-
-
由 zhangxiaoci 提交于
* change to async mode for xpu multi-card training in static graph mode * minor bugfix * irrelevant. move to another pr * move change to other pr * fix stream issue * fix 'stream not meet with current context' error * fix branch diverge, test=kunlun
-
- 10 8月, 2022 2 次提交
-
-
由 zhangxiaoci 提交于
* add macro control in enforce_xpu.h, test=kunlun * minor bugfix * minor bugfix
-
由 Leo Chen 提交于
* set cuda device before run * add header file * fix compile
-
- 05 8月, 2022 1 次提交
-
-
由 Qi Li 提交于
-
- 01 8月, 2022 2 次提交
-
-
由 zhouweiwei2014 提交于
-
由 Wilber 提交于
* infer context fix place error. * update * update
-
- 29 7月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* init * move CUDAStream to phi * fix compilation * merge develop * add stream_owned_ member * split cuda_stream.h * fix cpu compile * fix constructor * fix bug * fix windows compile * fix inference test_levit * fix windows tests
-