• J
    Support reduce_sum_op float16 (#32966) · 606939de
    jiangcheng 提交于
    * add reduce_sum_op by add self-kernel
    
    * set all ReduceKernel MPType for accuracy
    
    * add float16 test script which input is integer number
    
    * solve reduce sum float16 check_grad problem
    
    * solve conflict and change test script for CI
    
    * change kernel register for CI
    
    * remove all useless template
    606939de
cub_reduce.h 17.1 KB