• MarDino's avatar
    Optimize FusedBiasAddGelu Kernel (#47679) · b0e28540
    MarDino 提交于
    * Add quick gelu and fused bias add kernel
    
    * fix annotation
    
    * remove useless code
    
    * add fast gelu option and set it in multi transformer op
    
    * add flag to restrict if use fast gelu approximate
    
    * fix flags conflict
    
    * fix use tanh function instead
    
    * add cudart version limit
    
    * use phi fast tanh func
    
    * fix comment
    b0e28540
fused_dropout_act_bias.h 16.1 KB