Try to increase the repeat of autotune and fix the setting of allow_tf32_cublas. (#53622)
* Try to increase the repeat of autotune and fix the setting of allow_tf32_cublas. * Change the repeat of cublaslt to 10. * Use FLAGS_cublaslt_exhaustive_search_times as repeats. * Fix compiling error on CI. * Polish the key and simplify codes.
Showing
想要评论请 注册 或 登录