- 07 4月, 2023 11 次提交
-
-
由 kangguangli 提交于
* remove run_program * remove FLAGS_USE_STANDALONE_EXECUTOR
-
由 cyberslack_lee 提交于
-
由 Zhang Zheng 提交于
* [AMP OP&Test] Fix the logic of calling infer_dtype func in op test * add fp16
-
由 WangZhen 提交于
-
由 Guanghua Yu 提交于
-
由 TaoTao Li 提交于
fix merge conflicts
-
由 risemeup1 提交于
* remove zeros * remove zeros * apply gcc12 to py3 * apply gcc12 to py3 * fluid api clear * fluid api clean * fluid api clean
-
由 feifei-111 提交于
* fix dy2s grad name parse * pre-commit * bug fix * Fix grad/ error * Format code --------- Co-authored-by: N0x45f <wangzhen45@baidu.com>
-
由 Roc 提交于
* fix mkdir * update
-
由 Happyd99 提交于
* [Test MV] standalone_executor * update as * update as * update codestyle
-
由 Wang Xin 提交于
-
- 06 4月, 2023 12 次提交
-
-
由 ceci3 提交于
-
由 Sławomir Siwek 提交于
* replace matmul with matmul_v2 in fuse passes * Remove fusion logic from matmul * removing fusion methods * add proper name * adjust namespaces * clean attrs in python tests * delete checkpoint and restore matmul version * remove unused code * matmul and reshape/transpose fuses migrated * split MatmulOneDNN headers * fuse activation and eltwise_add * add fuse_activation * matmul_transpose_reshape/reshape_transpose_matmul * matmul + elementwise_add (fused) * activation temporary modifciation * restore matmul(v1) version 0 * merge newest develop * remove depedency from other PR * revert pbtxt * remove placeholders from matmul_v2 * add description in OPMaker * remove matmul_v2_op.h and all depedencies * remove dims changing in base op * add possibility to fuse already fused_matmul * restart broken CI * Empty-Commit * revert matmul_utils.h * codestyle * adjust imports * add pbtxt file * 100% matmul unit tests coverage * trigger CI with minimal changes to develop * adjust changes to develop * add fused_matmul op * inherit base ops * add "v2" * move OPMaker * Gradually add fused_matmul files * second batch of fused_matmul changes * split infershapes of matmul_v2 and fused_matmul * merge code from other PR * 2023 * inherit fused_matmul from matmul_v2 * Update paddle/phi/backends/onednn/onednn_reuse.h Co-authored-by: NTomasz Socha <tomasz.socha@intel.com> * Update paddle/phi/kernels/fusion/onednn/fused_matmul_kernel.cc Co-authored-by: NTomasz Socha <tomasz.socha@intel.com> * resolve conflicts * codestyle * simplify isgemmlinear * 2023 * remove import * reuse methods * matmul_v2_mkldnn cleanup * simplify ExecuteMatMulV1Grad * matmul refactored * fc * SetOutMemDescWithLogicalLayoutFusesSupport * matmul_v2 * alpha support * group repetetive funcs * matmul utils * execute matmul methods * restore registered kernel names * split header and impl files * remove double negatives * reduce numer of modified files * adjust ExecuteMatmul * add scales for ut * dates * limit number of modified files * fluid imports * remove alpha * codestyle --------- Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>
-
由 Sonder 提交于
* add kernel functions * update kernel functions * update func parameters' name * create codes for gpu device * 调整文件位置 * fix include error * remove dependent files to phi/ * restore fused_attention_op.cu * fix dependence errors * fix dependence errors * fix include error * fix all depandence errors[build success] * remove useless include * recover useless include * use phi::ToNCCLDataType * fix namespace * update new register code * fix error in fused_gemm_epilogue_utils * fix error in FusedAttentionKernel parm * finish fused_attention registe code[build success] * add paddle::optional * add sig file * fix build error * fix a include error * update CMkaeList * fix parameter sequence * add include file * update #if before include * fix grammly error * update codes for DropoutParam * remove const cast * trans some fluid api to phi api * add #if * update test code * update test codes * recover test codes * trans fused_attention to fluid * move #endif to end * move #endif * delete useless files * use fused attention utils and recover random seed * remove fluid include in phi
-
由 scotty 提交于
-
由 Nyakku Shigure 提交于
-
由 Zhang Zheng 提交于
-
由 Kang Zhao 提交于
* feat: add relu composite rule * feat: add relu composite rule, maximum op * feat: add relu composite rule, maximum op * feat: add relu composite rule, polish comments * feat: add relu composite rule, polish comments * feat: add relu composite rule, add python api of relu * feat: add relu composite rule, commit hook * fix: maximum type error & ban cinn test * fix: maximum input sequence bugs * resolve conflicts * fix: code style bugs * add: relu fp16 test * feat: add rsqrt composite rule * feat: add rsqrt composite rule * resolve conflicts of composite rule * fix: delete check eager * feat: add roll grad composite rule * fix minus shift * fix test roll op
-
由 jiangcheng 提交于
* [CINN] disable CINN test_mean_op unittest to pass CINN CI * disable test_mean_op for pass ci
-
由 sneaxiy 提交于
* fix flash attn * fix another API
-
由 Kim Yann 提交于
* rem is_compiled_with_npu * rem nup related code * make lint happy * rem test * remove some tests * Update grad_scaler.py * fix an error
-
由 LoneRanger 提交于
* add fp16 and bf16 for eye and frame * fix bug * fix bug * fix bug * Update test_frame_op.py fix code style * fix bug * fix bug
-
由 WJJ1995 提交于
* fixed glog * add * add bfloat16 test for logical op * rm useless code * add uint16 * deal with comments * fixed code style * fixed code style * fixed for ci * deal with comments * fixed for ci
-
- 05 4月, 2023 1 次提交
-
-
由 zhouweiwei2014 提交于
-
- 04 4月, 2023 12 次提交
-
-
由 yuehuayingxueluo 提交于
* add gloo gather * add gloo_tools * fix CI bug * use gloo gather * remove redundant code * fix process_group_gloo.py * rename send_recv * fix conflict * fix conflict * fix codestyle * fix CI bug * add PADDLE_ENFORCE_NE
-
由 Tian 提交于
-
由 chenxujun 提交于
* Add pool3d lgamma masked_select tests * Fix code
-
由 JYChen 提交于
-
由 Nyakku Shigure 提交于
Co-authored-by: Nxiongkun <xiongkun03@baidu.com>
-
由 wanghuancoder 提交于
-
由 Ruibiao Chen 提交于
* Improve new executor static build * Skip GC for static build * Skip infershape for static build * Handle read_op * Add fused_attention to OpsWithFluidKernelNeedMoveToPhi * Fix argsort typos * Add sequence_pool to OpsWithFluidKernelNeedMoveToPhi * Fix skip share lod errors * Fix errors for adam * Fix errors for eigvals, memcpy and fake_quantize * Add static_build.cc * Add black list * Fix CI errors * Fix CI errors * Fix CI errors * Fix TensorArray * Fix TensorArray * Add update_loss_scaling to OpsNeedSetOutputDtypeWhenRegisterPhiKernel * Fix copy * Fix errors * Fix momentum * Skip mkldnn * Fix CI errors * Fix c_sync_calc_stream_op * Fix CINN * Fix while op * All CI pass, disable FLAGS to merge code, enable it after more tests in future * Add UTs * Fix typos * Fix typos * Add mkldnn UT * Remove mkldnn test * Fix typos * Fix dist test * Fix typos * Fix CI errors * Fix CI errors * Add UTs * Fix typos * Fix typos * Add sparse tests * ToComplexType -> ToComplex * Add test_matmul_op_static_build to disable_win_inference_test
-
由 LoneRanger 提交于
* relocate debugger.py * fix bug * fix bug * fix bug * fix bug
-
由 Jiabin Yang 提交于
* fix_prim * fix bug * add note * fix logic * fix * add note * fix check * fix bug * fix bug * fix bug * add debug * fix check * fix bug * sync print log * fix test case * change default * support jit save with fuse * add more check * sync with pr 52120 * add more ut --------- Co-authored-by: Ncyber-pioneer <chenzhuo@tju.edu.cn>
-
由 Kai Song 提交于
-
由 LoneRanger 提交于
* remove op.py * [Zero-Dim] change Tensor.numpy() usage to other equivalent usage, avoid hack (#52197) * [BugFix] fix compute error in fused_dropout_add (#52261) * fix bg * add utest * add utest * [CodeStyle][UP034] remove (()) cases (#52060) * add up34 * modify var name in loop * revert changes in test_slice * Revert "modify var name in loop" This reverts commit 6d748e371afb417054ed0c6b36fd11e87959a90d. * temporarily ignore test_slice.py * add comment * empty commit, re-trigger all ci * fix inc --------- Co-authored-by: NSigureMo <sigure.qaq@gmail.com> * [AMP OP&Test] add unittest for log_softmax (#52264) * Fix_Linux_[-Wterminate]warning (#52186) * [CustomOP Inplace] Automap inplace dtype and shape, prepare for vector<Tensor> output (#52214) * [CustomOP Inplace] Automap inplace dtype and shape, prepare for vector<Tensor> output * delete dtype,shape func of multi_inplace op * [CustomOP Inplace] Automap inplace dtype and shape, support vector<Tensor> output * [CustomOP Inplace] Auto-generate python API for inplace vector<Tensor> output * [AMP OP&Test] add float16 optest for reshape_op (#51678) * [AMP OP&Test] add float16 optest for reshape_op * add public_python_api * [AMP OP&Test] Add fp16/bf16 to clip op (#52158) * add fp16/bf16 to clip op * fix as reviewed * update test_clip_op.py * update test_clip_op.py * fix bug * fix code style * fix bug * fix bug --------- Co-authored-by: Zhou Wei <1183042833@qq.com> Co-authored-by: NShenLiang <1422485404@qq.com> Co-authored-by: N张春乔 <83450930+Liyulingyue@users.noreply.github.com> Co-authored-by: NSigureMo <sigure.qaq@gmail.com> Co-authored-by: NCcc <52520497+juncaipeng@users.noreply.github.com> Co-authored-by: NGalaxy1458 <55453380+Galaxy1458@users.noreply.github.com> Co-authored-by: NHongyuJia <jiahongyu@baidu.com> Co-authored-by: Nzhaoyingli <86812880+zhaoyinglia@users.noreply.github.com> Co-authored-by: Nwuyefeilin <30919197+wuyefeilin@users.noreply.github.com>
-
由 Jiabin Yang 提交于
-
- 03 4月, 2023 4 次提交
-
-
由 cyber-pioneer 提交于
* polish prim arg None check * fix bug
-
由 HongyuJia 提交于
* [CustomOP Optional Inplace] Custom operator supports inplace optional vector Tensor input * uncomment unittest codes
-
由 cyber-pioneer 提交于
* simplify bn vjp code * simplify composite rule * polish name
-
由 chenxujun 提交于
-