• L
    fix the computation for dx (grad for x) for prelu operation. (#20949) · e249d9a3
    lilong12 提交于
    * set the default value of alpha for prelu to 0.25, test=develop
    
    * add the call to __syncthreads(), test=develop
    
    * fix the implementation of cpu prelu, test=develop
    
    * repair the implementation of element mode prelu, test=develop
    
    * modify test_prelu_op.py, test=develop
    e249d9a3
cub_reduce.h 12.2 KB