- 09 5月, 2023 5 次提交
-
-
由 zhouweiwei2014 提交于
* [Zero-Dim] add 0D Tensor UT case for XPU * fix comment * remove some unnecessary UT
-
由 RuohengMa 提交于
* bind sparse_coo_tensor, reduce_max/max_int32, range/arange_int32, equal_bool, scatter_grad_float32, nearest_interp_int64 kernels * add more unit tests; modify compilation logic of xpu sparse kernels
-
由 Zhan Rongrui 提交于
-
由 Bo Zhang 提交于
-
由 Galaxy1458 提交于
* test,test=develop * test,test=develop
-
- 08 5月, 2023 7 次提交
-
-
由 GGBond8488 提交于
* add complex support for optest * add complex grad test * append one * move some debug info * move some debug info * move some debug info * move some debug info * add more complex test * Fix naming ambiguity * Revert "add more complex test" This reverts commit dbcb0516b8e53ba42e2d6089878a39b395345969. * change backward gradient, add TODO
-
由 Zhang Zheng 提交于
-
由 张春乔 提交于
-
由 wz1qqx 提交于
-
由 JYChen 提交于
-
由 co63oc 提交于
-
由 umiswing 提交于
-
- 06 5月, 2023 5 次提交
-
-
由 Yiqun Liu 提交于
-
由 umiswing 提交于
kernels.
-
由 RuohengMa 提交于
-
由 zhangyuqin1998 提交于
* Rename randint_raw and move it to legacy * Update fetch_v2_op.cc * Update randint_kernel.cc * Update randint_kernel.cu * Empty Commit to setup deployments
-
由 zhangyuqin1998 提交于
* move UniformRawKernel to legacy * Update uniform_kernel.cc * Update uniform_kernel.cu * Update uniform_kernel.cc * Update uniform_kernel.cu * Update uniform_kernel.h * Update uniform_kernel.cc * Empty Commit to setup deployments
-
- 05 5月, 2023 6 次提交
-
-
由 iSerendipity 提交于
This reverts commit 9c406531.
-
由 Galaxy1458 提交于
* test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop
-
由 shentanyue 提交于
-
由 co63oc 提交于
-
由 haosicheng 提交于
-
由 co63oc 提交于
* Add addmm tests * Fix code
-
- 04 5月, 2023 1 次提交
-
-
由 co63oc 提交于
-
- 28 4月, 2023 5 次提交
-
-
由 Bo Zhang 提交于
* change judgement for DropoutGradGPUKernelDriver * add UnrollerWithoutVecSize and after this Loaddata to be refined * pass unittest * use same unroller with XPU * BroadcastWithInt64Index * BroadcastDataLoader template partial specialization * fix compile errs in ROCms * clean ElementwiseT and InT for BroadcastKernel * default axis and clean inT * remove redundant fast divmod computation * optimize drop_nd & drop_nd_grad * optimize BroadcastDataLoader bf16 fp16 * rm InT etc. after merge develop * delete constexpr for windows ci * fix conflict * fix conflic with develop * fix conflic * new clean * clean
-
由 GGBond8488 提交于
* add 0d support for dist, trace, paddle.linalg.cond test=allcase * add_0d_output_support_for_det * test=allcase * support_0d_output_for_linalg.norm * support linalg.norm 0d output, test=allcase * fix 0D test * fix zero dim test, test=allcase * fix 0D test * fix tets,test=allcase * fix error,test=allcase * fix errors ,test=allcase * add static backward , test=allcase * add static backwward test, test=allcase * fix pr-ci-build error;test=document_fix (#53060) * [Cherry-Pick] Unique support float16&bfloat16 (#53023) unique支持float16和bfloat16数据类型,并完善相关单测。 * slogdet_support_0D_output * add new case * fix tests, test=allcase * fix p_norm related test, test=allcase * fix some err, test=allcase * test=allcase * move out trace * open some case, test=allcase * fix norm all case, test=allcase * fix some test error, test=allcase * fix typro,test=allcase * fix test err, test=allcase * test=allcase * test * fix test error, test=allcase * fix test error, test=allcase * fallback norm, test=allcase --------- Co-authored-by: Ntianshuo78520a <707759223@qq.com> Co-authored-by: NZhang Zheng <32410583+ZzSean@users.noreply.github.com>
-
由 iSerendipity 提交于
-
由 co63oc 提交于
-
由 superwinner1 提交于
* 'fmin' * 'fix' * 'fix'
-
- 27 4月, 2023 10 次提交
-
-
由 Bo Zhang 提交于
* change judgement for DropoutGradGPUKernelDriver * add UnrollerWithoutVecSize and after this Loaddata to be refined * pass unittest * use same unroller with XPU * BroadcastWithInt64Index * BroadcastDataLoader template partial specialization * fix compile errs in ROCms * PR comment
-
由 gouzil 提交于
* [phi] move sequence_pool kernel to phi * mv kernels impl * fix parameter error * clean include * fix compat filename * [phi] move fluid sequence_pool_grad to phi * [phi][compat] sig rm GradVarName * [phi] fix sequence_pool out type * [phi] rm impl, add const string * [phi] fix const str * fix sequence_pooling cmake * [phi] mv sequence_pooling_test * [phi] fix grad sig * [phi] fix sequence_pool is_test error * [phi] fix sequence_pooling gpu include * [phi] mv to impl * [phi] fix SequencePoolFunctor cu include * [phi] modify out max_index int32_t * [phi] add pooltype mapping determine * [phi] fix sequence_pool_sig * [phi] fix sequence_pool_sig sum * [phi] try ci * [phi] fix max_index optional
-
由 NetPunk 提交于
* support fp16 for maxout op * format code * change api * add test for static float16 * format code * formatting code * atol alignment * experiment—1 * experiment-2 * experiment-3 * format code
-
由 Sonder 提交于
* trans fused_feedward Compute function to phi * add register info * remove maxfunctor * move fused feedward to phi * remove sig file * remove fliud include * add include * add include * add sig file * add output register info * fix sig file * Update fused_feedforward_sig.cc * fix grad kernel * update output register info * fix * open fused_feedforward static build * add optional and fix code style * fix output info for fused attention * add optional param * merge
-
由 JYChen 提交于
-
由 engineer1109 提交于
-
由 Galaxy1458 提交于
* test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop
-
由 houj04 提交于
* [XPU] remove scale_loss in parallel.py * [XPU] throw Unimplemented when using Reducer
-
由 superwinner1 提交于
-
由 cyberslack_lee 提交于
-
- 26 4月, 2023 1 次提交
-
-
由 Ruibiao Chen 提交于
* Fix fused_attention_op and fused_feedforward_op bugs in xpu * Fix d_x alloc errors for fused_feedforward_grad_kernel
-