“0b061960effdaa8dbdbcd9044c4100389deeb4a3”上不存在“develop/doc/tutorials/image_classification/index_en.html”
Combination of multiple paddle::memory::allocate operation into one for ops (#49126)
* A leap of try for cudaLaunchCooperativeKernel * fix bugs * Totally replace the lar cuda kernel * Fix bugs * fix code according to comments * fix codes according to review comments * adding some function overload * relocate the power operation. * add bf16 support for index select relevant ops * revert bf16 type change. * add changes for more op * fix code writting bugs
Showing
想要评论请 注册 或 登录