Created by: hutuxian
Optimize the time complexity of AUC OP from O(slide_step * num_bucket) to O(num_bucket). Add its CUDA Kernel. And the auc kernel will be optimized from O(num_bucket) to O(log(num_bucket)), which is a future work.
Created by: hutuxian
Optimize the time complexity of AUC OP from O(slide_step * num_bucket) to O(num_bucket). Add its CUDA Kernel. And the auc kernel will be optimized from O(num_bucket) to O(log(num_bucket)), which is a future work.