- 23 5月, 2023 2 次提交
-
-
由 weishengying 提交于
-
由 Wang Xin 提交于
* static graph autogen code support for pad3d op * bug fixed * add ut for pad3d mkldnn op * fix coverage * fix bug * fix bug * Delete test_pad3d_mkldnn_op.py
-
- 19 5月, 2023 2 次提交
-
-
由 Galaxy1458 提交于
-
由 Galaxy1458 提交于
-
- 18 5月, 2023 1 次提交
-
-
由 Hulek 提交于
* Fused elementwises kernels and ops * change fuse pass name * adjust .pbtxt files * adjust quantization attributes * add missing arguments and fix others, review fixed * simplify fused kernel registration * fix elementwise unit tests * reuse one fused elementwise op * adjust proto * Add supported datatypes * Change 'Scale' to 'scale' in tests, change some tests to onednn * Revert breaking changes * Fix unit tests * Delete obsolete test cases * Delete commented out code * Fix codestyle * delete temporary condition * fix conflicts and delete duplicate fusing * Fix code after merge * Move tests to new directory * fix tests volatility * Rename test_elementwise_add_onednn_op.py to test_elementwise_add_mkldnn_op.py * Update CMakeLists.txt add mkldnn op test --------- Co-authored-by: NSilv3S <slawomir.siwek@intel.com>
-
- 15 5月, 2023 3 次提交
-
-
由 huangjiyi 提交于
* update * fix bug * fix output type def
-
由 Galaxy1458 提交于
* test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop
-
由 Galaxy1458 提交于
* test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop
-
- 11 5月, 2023 1 次提交
-
-
由 Galaxy1458 提交于
* test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop
-
- 26 4月, 2023 1 次提交
-
-
由 Galaxy1458 提交于
* test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop
-
- 24 4月, 2023 2 次提交
-
-
由 zhouweiwei2014 提交于
-
由 YangQun 提交于
* support 0d tensor for shape and squeeze onednn kernel * set python api for shape op ut
-
- 17 4月, 2023 1 次提交
-
-
由 zhangyuqin1998 提交于
-
- 14 4月, 2023 2 次提交
-
-
由 YangQun 提交于
[Zero-Dim] support 0-D tensor for reduce/reshape/stack/prelu/expand_v2/gaussion onednn kernels (#52185) * support 0-D tensor for reduce/reshape/stack/prelu/expand_v2/gaussion ops * fix gaussian random mkldnn op ut
-
由 zhangyuqin1998 提交于
-
- 13 4月, 2023 2 次提交
-
-
由 HongyuJia 提交于
* [enforce.h Decouple logging.h] Delete glog/logging.h from enforce.h * Add logging.h for profiler.cc * Add logging.h for gloo_utils.h * Add logging.h for addmm_kernel_impl.h * Add logging.h for addmm_grad_kernel_impl.h * Add logging.h for p_send_kernel.cu * Add logging.h for determinant_grad_kernel_impl.h * Add logging.h for p_recv_kernel.cu * Add logging.h for elementwise_grad_base.h * Add logging.h for transfer_layout_kernel.cc * Add logging.h for eigvals_kernel.cc and index_select_impl.h * Add logging.h for all files in kernel directory * Add logging.h for xpu_info.cc * Add logging.h for xpu
-
由 zhangyuqin1998 提交于
-
- 06 4月, 2023 1 次提交
-
-
由 Sławomir Siwek 提交于
* replace matmul with matmul_v2 in fuse passes * Remove fusion logic from matmul * removing fusion methods * add proper name * adjust namespaces * clean attrs in python tests * delete checkpoint and restore matmul version * remove unused code * matmul and reshape/transpose fuses migrated * split MatmulOneDNN headers * fuse activation and eltwise_add * add fuse_activation * matmul_transpose_reshape/reshape_transpose_matmul * matmul + elementwise_add (fused) * activation temporary modifciation * restore matmul(v1) version 0 * merge newest develop * remove depedency from other PR * revert pbtxt * remove placeholders from matmul_v2 * add description in OPMaker * remove matmul_v2_op.h and all depedencies * remove dims changing in base op * add possibility to fuse already fused_matmul * restart broken CI * Empty-Commit * revert matmul_utils.h * codestyle * adjust imports * add pbtxt file * 100% matmul unit tests coverage * trigger CI with minimal changes to develop * adjust changes to develop * add fused_matmul op * inherit base ops * add "v2" * move OPMaker * Gradually add fused_matmul files * second batch of fused_matmul changes * split infershapes of matmul_v2 and fused_matmul * merge code from other PR * 2023 * inherit fused_matmul from matmul_v2 * Update paddle/phi/backends/onednn/onednn_reuse.h Co-authored-by: NTomasz Socha <tomasz.socha@intel.com> * Update paddle/phi/kernels/fusion/onednn/fused_matmul_kernel.cc Co-authored-by: NTomasz Socha <tomasz.socha@intel.com> * resolve conflicts * codestyle * simplify isgemmlinear * 2023 * remove import * reuse methods * matmul_v2_mkldnn cleanup * simplify ExecuteMatMulV1Grad * matmul refactored * fc * SetOutMemDescWithLogicalLayoutFusesSupport * matmul_v2 * alpha support * group repetetive funcs * matmul utils * execute matmul methods * restore registered kernel names * split header and impl files * remove double negatives * reduce numer of modified files * adjust ExecuteMatmul * add scales for ut * dates * limit number of modified files * fluid imports * remove alpha * codestyle --------- Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>
-
- 04 4月, 2023 1 次提交
-
-
由 Ruibiao Chen 提交于
* Improve new executor static build * Skip GC for static build * Skip infershape for static build * Handle read_op * Add fused_attention to OpsWithFluidKernelNeedMoveToPhi * Fix argsort typos * Add sequence_pool to OpsWithFluidKernelNeedMoveToPhi * Fix skip share lod errors * Fix errors for adam * Fix errors for eigvals, memcpy and fake_quantize * Add static_build.cc * Add black list * Fix CI errors * Fix CI errors * Fix CI errors * Fix TensorArray * Fix TensorArray * Add update_loss_scaling to OpsNeedSetOutputDtypeWhenRegisterPhiKernel * Fix copy * Fix errors * Fix momentum * Skip mkldnn * Fix CI errors * Fix c_sync_calc_stream_op * Fix CINN * Fix while op * All CI pass, disable FLAGS to merge code, enable it after more tests in future * Add UTs * Fix typos * Fix typos * Add mkldnn UT * Remove mkldnn test * Fix typos * Fix dist test * Fix typos * Fix CI errors * Fix CI errors * Add UTs * Fix typos * Fix typos * Add sparse tests * ToComplexType -> ToComplex * Add test_matmul_op_static_build to disable_win_inference_test
-
- 29 3月, 2023 1 次提交
-
-
由 zengshao0622 提交于
* pad3d add unittests of fp16 and bf16 * pad3d add unittests of fp16 and bf16 * fix cuda place * fix random to uniform * fix class name * fix fp16 max relative error to 1.5e-3 * add dytpe register for onednn * add pad uint16 check of common.py * remove check_eager * test_check_grad --> test_check_grad_normal
-
- 27 3月, 2023 2 次提交
-
-
由 Xinyu Chen 提交于
-
由 Sławomir Siwek 提交于
* extract Op and OPMaker to .h * extend pattern for fused_op * set "with_residual" default to false * adjust fuse passes * remove fc+eltwise flag * fused_output_scale * activation attrs * remove extra attrs * fix int8/bf16 unit tests * simplify RecomputeOutputDims * remove unused method * Add description for attributes * add extra check * adjust op compats * update quantize test * fix protobuf parsing error * fix int8 performance * fused elementwises * merge develop * remove activation * restore activation for existing add/sub ops
-
- 22 3月, 2023 2 次提交
-
-
由 YangQun 提交于
* support 0-d tensor for element wise unary ops * fix python code style check * fix approval check * support 0-d tensor for onednn softmax and logsoftmax kernels * fix commnets * fix some unittests
-
由 Sławomir Siwek 提交于
* extract common methods to reuse * add header for transpose ops * fused_transpose * Split big function * transpose2 tests * fused_transpose * Apply extra attributes * add pbtxt file * update pbtxt * Merge develop * add more strict op compats * code style * remove mkldnn_data_type * unify SetOutMemDescWithReshape2FuseSupport * adjust quantize-dequantize for transpose * remove appendact * transpose2 quantization * fix int8 tests * adjust transpose_op to current develop * delete fusion code from transpose_kernel * add fused transpose to NHWC unittest * change order
-
- 21 3月, 2023 1 次提交
-
-
由 iSerendipity 提交于
* move DataType from paddle::experimental to phi * convert namespace * convert namespace * convert namespace * clarify namespace * convert more datatype * Revert "convert more datatype" This reverts commit 083b462959e6a22d4d8767707b628b95b396642e. * convert more in auto_code_generator * fix conflicts for XPU * fix namespace conflicts * fix errors * Revert "fix errors" This reverts commit f9d9958b54ee32141112274c8a5c3c381ab0f876. * fix errors * fix formatting
-
- 15 3月, 2023 1 次提交
-
-
由 zhangyuqin1998 提交于
* Delete hardswish_raw op * fix ut
-
- 13 3月, 2023 2 次提交
-
-
由 YuanRisheng 提交于
* remove transpose infershape * fix ci bugs * fix ci bugs * delete transpose infershape * fix ci bugs * fix ci bugs
-
由 Sławomir Siwek 提交于
* mkldnn->onednn * fused softplus op + kernel * remove extra attributes * add missing handler * change var name
-
- 10 3月, 2023 1 次提交
-
-
由 HappyHeavyRain 提交于
* Add function node in phi_kernel for MKLDNN * fix the bug in 'BuildInferVarKernelContext' * add infer_varkernel_utils.cc * fix the bug:the first two parametes of 'BuildInferVarKernelContext' can't be template variable * change the code according to first review * change the code according to first review * change the mode of paddle_build.sh * change 'infer_var_kernel_fn_' to 'get_kerneltype_forvar_fn_' * add the error information * fix NotFound infomation warning * fix NotFound infomation warning * fix NotFound infomation warning
-
- 09 3月, 2023 1 次提交
-
-
由 zhouweiwei2014 提交于
-
- 06 3月, 2023 3 次提交
-
-
由 傅剑寒 提交于
* add bf16 fp16 type support for interpolate * add bf16 fp16 support for interpolate in phi on cpu
-
由 Huang Jiyi 提交于
* move DeviceContextPool to phi * add EmplaceExternalContextFunc * update namespace * update cmake * fix bugs and create context_pool_impl.h * replace platform::is_xxx_place * fix bugs * update generator * fix bugs * fix bugs * fix bugs * fix bugs * fix bugs * fix bugs * fix bugs * fix enforce usage * Revert "fix enforce usage" This reverts commit 5f521f08a69713cee506e64a00ec6d9fba709e27. * fix bugs * rm XPUDeviceContext and CustomDeviceContext * fix bugs * fix fix context init bug * fix bugs after merge * fix bugs * fix name * fix mutable_data * update and fix bugs * fix bugs * update * fix bugs * fix name * fix bugs * merge * fix bugs * create context_pool in phi/backends * create context_pool in phi/backends * fix bugs * fix xpu bugs * fix rocm bugs * fix bugs * fix bugs * fix bugs * fix xpu bugs * update * update * fix bugs * fix bugs
-
由 Sławomir Siwek 提交于
* matmul refactored * fc * SetOutMemDescWithLogicalLayoutFusesSupport * matmul_v2 * alpha support * group repetetive funcs * matmul utils * execute matmul methods * restore registered kernel names * split header and impl files * remove double negatives * increase coverage * add onednn tests to ctest * remove fusion logic from base matmuls
-
- 15 2月, 2023 1 次提交
-
-
由 YuanRisheng 提交于
* move profiler * add file * fix mac compile bugs * fix ci bugs * fix mac bugs * fix ci bugs * fix compile bugs * perfect code according comment
-
- 08 2月, 2023 1 次提交
-
-
由 Paulina Gacek 提交于
* QuantTranpose pattern is being found by pass * quant + transpose fuse * code style changes * UT written, reorder fixed * Dequantize + transpose2 fuse added * pass name changed * UT added & shift corrected * got rid of redundancy * review changes * AsIntermediate corrected * compat added
-
- 06 2月, 2023 2 次提交
-
-
由 zyfncg 提交于
* remove extra input of conv2d * fix bug * fix unittest bug * adjust conv2d.pbtxt * fix cpu_quantize_pass_tester * revert use_addto of conv2d * fix runtime attribute * fix bug * recover force_fp32_output in conv2d * refine error info * fix bug
-
由 engineer1109 提交于
-
- 03 2月, 2023 1 次提交
-
-
由 Sławomir Siwek 提交于
* replace matmul with matmul_v2 in fuse passes * Remove fusion logic from matmul * removing fusion methods * add proper name * adjust namespaces * clean attrs in python tests * delete checkpoint and restore matmul version * remove unused code * matmul and reshape/transpose fuses migrated * split MatmulOneDNN headers * fuse activation and eltwise_add * add fuse_activation * matmul_transpose_reshape/reshape_transpose_matmul * matmul + elementwise_add (fused) * activation temporary modifciation * merge newest develop * remove depedency from other PR * revert pbtxt * remove placeholders from matmul_v2 * add description in OPMaker * remove matmul_v2_op.h and all depedencies * remove dims changing in base op * add possibility to fuse already fused_matmul * restart broken CI * Empty-Commit * revert matmul_utils.h * codestyle * adjust imports * add pbtxt file * 100% matmul unit tests coverage * trigger CI with minimal changes to develop * adjust changes to develop * add fused_matmul op * inherit base ops * add "v2" * move OPMaker * Gradually add fused_matmul files * second batch of fused_matmul changes * split infershapes of matmul_v2 and fused_matmul * inherit fused_matmul from matmul_v2 * Update paddle/phi/backends/onednn/onednn_reuse.h Co-authored-by: NTomasz Socha <tomasz.socha@intel.com> * Update paddle/phi/kernels/fusion/onednn/fused_matmul_kernel.cc Co-authored-by: NTomasz Socha <tomasz.socha@intel.com> --------- Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>
-
- 20 1月, 2023 1 次提交
-
-
由 jakpiase 提交于
* fix for matmul_grad * another fix for matmul_grad * fix
-
- 12 1月, 2023 1 次提交
-
-
由 YuanRisheng 提交于
* rename kernel * delete sig * modify code according comment * fix ci bugs
-