- 09 4月, 2023 1 次提交
-
-
由 ronnywang 提交于
* [PHI CAPI] support complex dtype kernel * update
-
- 07 4月, 2023 1 次提交
-
-
由 Wang Xin 提交于
-
- 06 4月, 2023 1 次提交
-
-
由 Sławomir Siwek 提交于
* replace matmul with matmul_v2 in fuse passes * Remove fusion logic from matmul * removing fusion methods * add proper name * adjust namespaces * clean attrs in python tests * delete checkpoint and restore matmul version * remove unused code * matmul and reshape/transpose fuses migrated * split MatmulOneDNN headers * fuse activation and eltwise_add * add fuse_activation * matmul_transpose_reshape/reshape_transpose_matmul * matmul + elementwise_add (fused) * activation temporary modifciation * restore matmul(v1) version 0 * merge newest develop * remove depedency from other PR * revert pbtxt * remove placeholders from matmul_v2 * add description in OPMaker * remove matmul_v2_op.h and all depedencies * remove dims changing in base op * add possibility to fuse already fused_matmul * restart broken CI * Empty-Commit * revert matmul_utils.h * codestyle * adjust imports * add pbtxt file * 100% matmul unit tests coverage * trigger CI with minimal changes to develop * adjust changes to develop * add fused_matmul op * inherit base ops * add "v2" * move OPMaker * Gradually add fused_matmul files * second batch of fused_matmul changes * split infershapes of matmul_v2 and fused_matmul * merge code from other PR * 2023 * inherit fused_matmul from matmul_v2 * Update paddle/phi/backends/onednn/onednn_reuse.h Co-authored-by: NTomasz Socha <tomasz.socha@intel.com> * Update paddle/phi/kernels/fusion/onednn/fused_matmul_kernel.cc Co-authored-by: NTomasz Socha <tomasz.socha@intel.com> * resolve conflicts * codestyle * simplify isgemmlinear * 2023 * remove import * reuse methods * matmul_v2_mkldnn cleanup * simplify ExecuteMatMulV1Grad * matmul refactored * fc * SetOutMemDescWithLogicalLayoutFusesSupport * matmul_v2 * alpha support * group repetetive funcs * matmul utils * execute matmul methods * restore registered kernel names * split header and impl files * remove double negatives * reduce numer of modified files * adjust ExecuteMatmul * add scales for ut * dates * limit number of modified files * fluid imports * remove alpha * codestyle --------- Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>
-
- 03 4月, 2023 3 次提交
-
-
由 engineer1109 提交于
-
由 risemeup1 提交于
* fix_gcc12_error * fix_gcc12_error * fix gcc12_error * fix_gcc12_error
-
由 wz1qqx 提交于
-
- 31 3月, 2023 4 次提交
-
-
由 csy0225 提交于
-
由 YuanRisheng 提交于
* remove distribute * fix py3 bugs * fix gpu-ps bugs * fix compile bugs * fix unittest bugs
-
由 houj04 提交于
-
由 zhupengyang 提交于
-
- 30 3月, 2023 1 次提交
-
-
由 ykkk2333 提交于
-
- 29 3月, 2023 1 次提交
-
-
由 chenxujun 提交于
-
- 27 3月, 2023 2 次提交
-
-
由 Xinyu Chen 提交于
-
由 Leo Guo 提交于
instance_norm_grad kernel. Fix bugs that the data type of input is different from output in reduce_sum kernel. test=kunlun
-
- 24 3月, 2023 1 次提交
-
-
由 YuanRisheng 提交于
* decouple memory copy * fix ci bugs * fix ci compile bugs * fix rocm compile * fix ci bugs * decouple memory * deal with conflict * fix xpu compile bugs * fix xpu bugs * deal with xpu bugs * fix cmake bugs * fix windows bugs * fix ci bugs * fix ci bugs * delete redundance code * add code for pybind * fix py3 bugs * fix ci bugs
-
- 23 3月, 2023 1 次提交
-
-
由 ZhouMengLei1999 提交于
-
- 22 3月, 2023 2 次提交
-
-
由 YangQun 提交于
* support 0-d tensor for element wise unary ops * fix python code style check * fix approval check * support 0-d tensor for onednn softmax and logsoftmax kernels * fix commnets * fix some unittests
-
由 Sławomir Siwek 提交于
* extract common methods to reuse * add header for transpose ops * fused_transpose * Split big function * transpose2 tests * fused_transpose * Apply extra attributes * add pbtxt file * update pbtxt * Merge develop * add more strict op compats * code style * remove mkldnn_data_type * unify SetOutMemDescWithReshape2FuseSupport * adjust quantize-dequantize for transpose * remove appendact * transpose2 quantization * fix int8 tests * adjust transpose_op to current develop * delete fusion code from transpose_kernel * add fused transpose to NHWC unittest * change order
-
- 21 3月, 2023 2 次提交
-
-
由 iSerendipity 提交于
* move DataType from paddle::experimental to phi * convert namespace * convert namespace * convert namespace * clarify namespace * convert more datatype * Revert "convert more datatype" This reverts commit 083b462959e6a22d4d8767707b628b95b396642e. * convert more in auto_code_generator * fix conflicts for XPU * fix namespace conflicts * fix errors * Revert "fix errors" This reverts commit f9d9958b54ee32141112274c8a5c3c381ab0f876. * fix errors * fix formatting
-
由 houj04 提交于
* [XPU] add fp16 support for compare ops. * fix ci.
-
- 20 3月, 2023 4 次提交
-
-
由 duanyanhui 提交于
-
由 Zhang Na 提交于
-
由 mayang002 提交于
-
由 ykkk2333 提交于
* add xpu tile and concat kernel int64, test=kunlun * fix previous xpu dataoader bug, and add maxpool3dgrad special dim support, test=kunlun
-
- 16 3月, 2023 1 次提交
-
-
由 Huang Jiyi 提交于
* remove fluid thread_data_registry * update * fix bug
-
- 15 3月, 2023 4 次提交
-
-
由 limingshu 提交于
-
由 iSerendipity 提交于
* Revert "Revert "【Hackathon No.67】remove operator.h in blas.h (#50989)" (#51467)" This reverts commit b9d91531. * remove cout * add header * fix missing header * fix refer fluid error * fix missing header * 更新 repeat_interleave_grad_kernel_impl.h Change to phi style datatype. * 更新 repeat_interleave_grad_kernel_impl.h Fix missing header * datatype fluid -> phi * paddle::experimental -> phi * fix reference error * fix reference error * fix reference error * fix errors * fix missing FLAGS * fix missing headers * fix missing headers * fix missing headers * fix missing headers * fix missing header * fix missing header * fix errors
-
由 zhangyuqin1998 提交于
* Delete randperm raw op * fix
-
由 pangyoki 提交于
-
- 14 3月, 2023 1 次提交
-
-
由 pangyoki 提交于
* cuda graph support multi-stream for new executor * fix windows compile error * delete create_cuda_graph_stream
-
- 13 3月, 2023 5 次提交
-
-
由 iSerendipity 提交于
* Replace paddle::experimental::DataType as phi::DataType * restore custom_device.cc
-
由 Huang Jiyi 提交于
* platform::CUDAPinnedDeviceContext -> phi::GPUPinnedContext * replace platform::TraceEventCollector
-
由 Sławomir Siwek 提交于
* mkldnn->onednn * fused softplus op + kernel * remove extra attributes * add missing handler * change var name
-
由 houj04 提交于
* [XPU] add increment op. * fix ci
-
由 RuohengMa 提交于
* [XPU] add reduce_min_grad XPU kernel * add unittest for reduce_min xpu op
-
- 10 3月, 2023 5 次提交
-
-
由 ykkk2333 提交于
-
由 shentanyue 提交于
-
由 QingshuChen 提交于
-
由 ykkk2333 提交于
-
由 zhangyikun02 提交于
-