- 06 4月, 2023 11 次提交
-
-
由 Sławomir Siwek 提交于
* replace matmul with matmul_v2 in fuse passes * Remove fusion logic from matmul * removing fusion methods * add proper name * adjust namespaces * clean attrs in python tests * delete checkpoint and restore matmul version * remove unused code * matmul and reshape/transpose fuses migrated * split MatmulOneDNN headers * fuse activation and eltwise_add * add fuse_activation * matmul_transpose_reshape/reshape_transpose_matmul * matmul + elementwise_add (fused) * activation temporary modifciation * restore matmul(v1) version 0 * merge newest develop * remove depedency from other PR * revert pbtxt * remove placeholders from matmul_v2 * add description in OPMaker * remove matmul_v2_op.h and all depedencies * remove dims changing in base op * add possibility to fuse already fused_matmul * restart broken CI * Empty-Commit * revert matmul_utils.h * codestyle * adjust imports * add pbtxt file * 100% matmul unit tests coverage * trigger CI with minimal changes to develop * adjust changes to develop * add fused_matmul op * inherit base ops * add "v2" * move OPMaker * Gradually add fused_matmul files * second batch of fused_matmul changes * split infershapes of matmul_v2 and fused_matmul * merge code from other PR * 2023 * inherit fused_matmul from matmul_v2 * Update paddle/phi/backends/onednn/onednn_reuse.h Co-authored-by: NTomasz Socha <tomasz.socha@intel.com> * Update paddle/phi/kernels/fusion/onednn/fused_matmul_kernel.cc Co-authored-by: NTomasz Socha <tomasz.socha@intel.com> * resolve conflicts * codestyle * simplify isgemmlinear * 2023 * remove import * reuse methods * matmul_v2_mkldnn cleanup * simplify ExecuteMatMulV1Grad * matmul refactored * fc * SetOutMemDescWithLogicalLayoutFusesSupport * matmul_v2 * alpha support * group repetetive funcs * matmul utils * execute matmul methods * restore registered kernel names * split header and impl files * remove double negatives * reduce numer of modified files * adjust ExecuteMatmul * add scales for ut * dates * limit number of modified files * fluid imports * remove alpha * codestyle --------- Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>
-
由 Sonder 提交于
* add kernel functions * update kernel functions * update func parameters' name * create codes for gpu device * 调整文件位置 * fix include error * remove dependent files to phi/ * restore fused_attention_op.cu * fix dependence errors * fix dependence errors * fix include error * fix all depandence errors[build success] * remove useless include * recover useless include * use phi::ToNCCLDataType * fix namespace * update new register code * fix error in fused_gemm_epilogue_utils * fix error in FusedAttentionKernel parm * finish fused_attention registe code[build success] * add paddle::optional * add sig file * fix build error * fix a include error * update CMkaeList * fix parameter sequence * add include file * update #if before include * fix grammly error * update codes for DropoutParam * remove const cast * trans some fluid api to phi api * add #if * update test code * update test codes * recover test codes * trans fused_attention to fluid * move #endif to end * move #endif * delete useless files * use fused attention utils and recover random seed * remove fluid include in phi
-
由 scotty 提交于
-
由 RedContritio 提交于
-
由 RedContritio 提交于
-
由 张春乔 提交于
-
由 Jiabin Yang 提交于
-
由 陈沧夜 提交于
-
由 RedContritio 提交于
-
由 RedContritio 提交于
* support auto generate static for randint * move seed from extra to attrs
-
由 RedContritio 提交于
-
- 04 4月, 2023 7 次提交
-
-
由 lzydev 提交于
* autogen embedding * deal * fix bug in CompatMetaTensor::share_lod
-
由 huangjiyi 提交于
* update * fix bug * update * fix bug
-
由 Ruibiao Chen 提交于
* Improve new executor static build * Skip GC for static build * Skip infershape for static build * Handle read_op * Add fused_attention to OpsWithFluidKernelNeedMoveToPhi * Fix argsort typos * Add sequence_pool to OpsWithFluidKernelNeedMoveToPhi * Fix skip share lod errors * Fix errors for adam * Fix errors for eigvals, memcpy and fake_quantize * Add static_build.cc * Add black list * Fix CI errors * Fix CI errors * Fix CI errors * Fix TensorArray * Fix TensorArray * Add update_loss_scaling to OpsNeedSetOutputDtypeWhenRegisterPhiKernel * Fix copy * Fix errors * Fix momentum * Skip mkldnn * Fix CI errors * Fix c_sync_calc_stream_op * Fix CINN * Fix while op * All CI pass, disable FLAGS to merge code, enable it after more tests in future * Add UTs * Fix typos * Fix typos * Add mkldnn UT * Remove mkldnn test * Fix typos * Fix dist test * Fix typos * Fix CI errors * Fix CI errors * Add UTs * Fix typos * Fix typos * Add sparse tests * ToComplexType -> ToComplex * Add test_matmul_op_static_build to disable_win_inference_test
-
由 cyberslack_lee 提交于
* bce_loss * fix error * fix * fix * fix * reslove confilict
-
由 lzydev 提交于
* fix bug of redefine use_equal_all * fix bug of redefine use_equal_all
-
由 huangjiyi 提交于
* update * fix bug * fix bug * revert diag_op * revert expand_op and expand_as_op * fix bug * fix bug
-
由 zhangyuqin1998 提交于
* rename_bilinear_tensor_product * fix
-
- 03 4月, 2023 9 次提交
-
-
由 engineer1109 提交于
-
由 chenxujun 提交于
-
由 RedContritio 提交于
* support auto generate static for gaussian (gaussian_random) * move gaussian_random_batch_size_like Kernels from gaussian_random_op.* to gaussian_random_batch_size_like_op.*
-
由 gouzil 提交于
* add autogen code support for accuracy * fix input
-
由 risemeup1 提交于
* fix_gcc12_error * fix_gcc12_error * fix gcc12_error * fix_gcc12_error
-
由 risemeup1 提交于
* fix error,test=document_fix * test * fix gcc12_error * fix gcc12_error * fix gcc12_error * fix_gcc12_py3_error * fix_range-loop-construct_error * fix_gcc12_error
-
由 xiaoyuanzi914 提交于
* add autogen code support for auc_op * update --------- Co-authored-by: Nwqgo <1552367872@qq.com>
-
由 Young-Flash 提交于
-
由 LoneRanger 提交于
【PaddlePaddle Hackathon 4】No.56 : add fp16 test and bf16 test for diag, diagonal, fill and fill_diagonal_tensor (#51649)
-
- 01 4月, 2023 1 次提交
-
-
由 jjyaoao 提交于
* Delete the /paddle/fluid/platform/device/npu directory * clear Cmakelists * Try removing npu in the header file
-
- 31 3月, 2023 9 次提交
-
-
由 RedContritio 提交于
-
由 huangjiyi 提交于
* update bipartite_match * update * fix bug * fix test * fix bug * fix Kunlun-KP-Build * Revert "fix Kunlun-KP-Build" This reverts commit ceab63cc23079fd6839c826bb52db893fb056355. * update
-
由 jameszhang 提交于
* [kunlun] prevent numerical overflow in collective softmax_with_ce * add fix in another branch
-
由 chenjian 提交于
* first commit * add registry * add unit test * fix format * add unit test * fix bug * replace unsuqeeze to reshape * fix * fix unit test * update test * update test * fix unit test * fix * fix
-
由 zyfncg 提交于
* add yaml for some op * fix inplace_abn * fix test_leaky_relu_grad_grad_functor * fix yaml * fix typo
-
由 YuanRisheng 提交于
* remove distribute * fix py3 bugs * fix gpu-ps bugs * fix compile bugs * fix unittest bugs
-
由 engineer1109 提交于
fix with auto&
-
由 张春乔 提交于
* autofix Co-authored-by: NLiyulingyue <83450930+Liyulingyue@users.noreply.github.com> * revert changes in python/paddle/distributed/fleet/utils/hybrid_parallel_util.py * empty commit, trigger ci * fix test_slice --------- Co-authored-by: NSigureMo <sigure.qaq@gmail.com>
-
由 Yuang Liu 提交于
-
- 30 3月, 2023 3 次提交
-
-
由 zhupengyang 提交于
-
由 Feiyu Chan 提交于
1. add type caster for paddle's complex type, to allow pybind to automatically cast it with python's complex type; 2. add complex64 and complex128 data type for `libpaddle.Tensor`'s element get and set(which is required to perturb an element to get the numerical derivative) 3. add support for cuda pinned place in `libpaddle.Tensor` element get and set --- 4. fix a bug in op code generation.(Creation of output folder in concurrent with parsing op yamls.)
-
由 Wang Xin 提交于
* add autogen code support for spectral_norm * bug fixed * fix PR-CI-Static-Check fail
-