- 10 11月, 2022 1 次提交
-
-
由 PuQing 提交于
* rm fluid gpu_launch_config * fix type
-
- 29 9月, 2022 1 次提交
-
-
由 carryyu 提交于
* fix P40 topk: Make the optimized topk compatible with P40. * fix P40 topk: Make the optimized topk compatible with P40. * fix P40 topk: Make the optimized topk compatible with P40.
-
- 22 9月, 2022 1 次提交
-
-
由 carryyu 提交于
* Optimize topk's performance when k is small and input_width is large * 修改blockdim设置逻辑 * Update top_k_function_cuda.h
-
- 01 8月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* remove cudaDeviceContext * remove more template * fix rocm compile * remove alias name CUDADeviceContext * fix compile * fix tests * revert changes
-
- 24 6月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
* perfect copy * deal with conflict * deal with conflict * fix compile bugs * fix unittest bugs * change code format * deal with conflict * modify code by review * fix ce bugs * fix ce bugs * add lo * perfect code format * deal with conflicts
-
- 21 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
resort .cu headers, set clang-format not sort include block and consider .cu as main source file (#43633)
-
- 05 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 02 4月, 2022 1 次提交
-
-
由 Zhang Zheng 提交于
Co-authored-by: Nroot <root@yq01-sys-hic-k8s-v100-box-a225-0186.yq01.baidu.com>
-
- 30 3月, 2022 1 次提交
-
-
由 Zhang Zheng 提交于
* Optimize the perf of top_k when k is too large * fix rcom compile * fix * only compile in cuda * fix log info
-
- 14 3月, 2022 1 次提交
-
-
由 Zhang Zheng 提交于
-
- 10 3月, 2022 1 次提交
-
-
由 Zhang Zheng 提交于
* first commit * cpu kernel * first version * fix compile error * fix compile error * delete v2 * fix * fix * add alias * fix * fix * fix * fix error * fix * fix * fix * fix format
-