Optimize topk's performance when k is small and input_width is large (#45312)
* Optimize topk's performance when k is small and input_width is large * 修改blockdim设置逻辑 * Update top_k_function_cuda.h
Showing
想要评论请 注册 或 登录
* Optimize topk's performance when k is small and input_width is large * 修改blockdim设置逻辑 * Update top_k_function_cuda.h