- 25 5月, 2023 1 次提交
-
-
由 thunder95 提交于
-
- 24 5月, 2023 1 次提交
-
-
由 Yiqun Liu 提交于
* Try to increase the repeat of autotune and fix the setting of allow_tf32_cublas. * Change the repeat of cublaslt to 10. * Use FLAGS_cublaslt_exhaustive_search_times as repeats. * Fix compiling error on CI. * Polish the key and simplify codes.
-
- 23 5月, 2023 1 次提交
-
-
由 cyberslack_lee 提交于
-
- 22 5月, 2023 3 次提交
-
-
由 risemeup1 提交于
* update_c++14_to_c++17_on_windows * disable test_audio_logmel_feature and test_audio_mel_feature
-
由 risemeup1 提交于
-
由 zhoutianzi666 提交于
* fix transfer_layout when input size if too big * do not add TransferLayoutKernelGPU * add int64 and add check
-
- 19 5月, 2023 4 次提交
-
-
由 limingshu 提交于
-
由 Galaxy1458 提交于
-
由 Galaxy1458 提交于
-
由 Galaxy1458 提交于
-
- 18 5月, 2023 2 次提交
-
-
由 engineer1109 提交于
-
由 co63oc 提交于
-
- 16 5月, 2023 2 次提交
-
-
由 Galaxy1458 提交于
* test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop
-
由 cyberslack_lee 提交于
-
- 15 5月, 2023 2 次提交
-
-
由 Galaxy1458 提交于
* test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop
-
由 Galaxy1458 提交于
* test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop
-
- 12 5月, 2023 1 次提交
-
-
由 Wang Xin 提交于
* sequence_mask functionalization * fix sequence_mask test
-
- 11 5月, 2023 1 次提交
-
-
由 张春乔 提交于
-
- 10 5月, 2023 3 次提交
-
-
由 傅剑寒 提交于
* add index_put api * fix value broadcast in backward and add test case in static * add timeout=120s for index_put * add op_compat for index_put * add inplace index_put test * add test case when index tensor in indices is int32 when indices.size less than x.dims * add index_put api backward in cpu place * add backward test case * refactor code to delete some duplicated code * replace reshape with resize for decrease extra memcpy * add datatype flag in backward yaml * fix bug in documentation * Update python/paddle/tensor/manipulation.py --------- Co-authored-by: NLigoml <39876205+Ligoml@users.noreply.github.com>
-
由 Galaxy1458 提交于
* test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop
-
由 Winters Montagne 提交于
-
- 09 5月, 2023 2 次提交
-
-
由 Bo Zhang 提交于
-
由 Galaxy1458 提交于
* test,test=develop * test,test=develop
-
- 08 5月, 2023 4 次提交
-
-
由 Zhang Zheng 提交于
-
由 张春乔 提交于
-
由 JYChen 提交于
-
由 co63oc 提交于
-
- 05 5月, 2023 3 次提交
-
-
由 iSerendipity 提交于
This reverts commit 9c406531.
-
由 Galaxy1458 提交于
* test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop
-
由 co63oc 提交于
* Add addmm tests * Fix code
-
- 28 4月, 2023 3 次提交
-
-
由 Bo Zhang 提交于
* change judgement for DropoutGradGPUKernelDriver * add UnrollerWithoutVecSize and after this Loaddata to be refined * pass unittest * use same unroller with XPU * BroadcastWithInt64Index * BroadcastDataLoader template partial specialization * fix compile errs in ROCms * clean ElementwiseT and InT for BroadcastKernel * default axis and clean inT * remove redundant fast divmod computation * optimize drop_nd & drop_nd_grad * optimize BroadcastDataLoader bf16 fp16 * rm InT etc. after merge develop * delete constexpr for windows ci * fix conflict * fix conflic with develop * fix conflic * new clean * clean
-
由 iSerendipity 提交于
-
由 superwinner1 提交于
* 'fmin' * 'fix' * 'fix'
-
- 27 4月, 2023 7 次提交
-
-
由 Bo Zhang 提交于
* change judgement for DropoutGradGPUKernelDriver * add UnrollerWithoutVecSize and after this Loaddata to be refined * pass unittest * use same unroller with XPU * BroadcastWithInt64Index * BroadcastDataLoader template partial specialization * fix compile errs in ROCms * PR comment
-
由 NetPunk 提交于
* support fp16 for maxout op * format code * change api * add test for static float16 * format code * formatting code * atol alignment * experiment—1 * experiment-2 * experiment-3 * format code
-
由 Sonder 提交于
* trans fused_feedward Compute function to phi * add register info * remove maxfunctor * move fused feedward to phi * remove sig file * remove fliud include * add include * add include * add sig file * add output register info * fix sig file * Update fused_feedforward_sig.cc * fix grad kernel * update output register info * fix * open fused_feedforward static build * add optional and fix code style * fix output info for fused attention * add optional param * merge
-
由 JYChen 提交于
-
由 engineer1109 提交于
-
由 Galaxy1458 提交于
* test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop
-
由 superwinner1 提交于
-