• W
    [Feature] support mix precision training for resnet (#14899) · fd854183
    Wu Yi 提交于
    * clip softmax for fp16
    
    * updates
    
    * fuse xent support fp16 test=develop
    
    * wip
    
    * wip
    
    * add simple row reduce
    
    * wip fp16 accurate softmax
    
    * add accurate softmax kernel for fp16 test=develop
    
    * update test=develop
    
    * fix cpu build test=develop
    
    * update api.spec test=develop
    
    * follow comments test=develop
    
    * fix build test=develop
    
    * fix trt build test=develop
    
    * fix inference build test=develop
    
    * fix merge test=develop
    
    * update test=develop
    
    * try fix build test=develop
    
    * fix build test=develop
    
    * rename real_exp test=develop
    
    * fortest
    
    * remove hacky kernels test=develop
    
    * clean up test=develop
    fd854183
elementwise_sub_op.cu 1.7 KB