• F
    convert multihead to oss (#45019) · f706d95d
    feng_shuai 提交于
    * convert multihead to oss
    
    * fix:bug
    
    * fix:delete const cast
    
    * fix:don't support bias_qk
    
    * add vit pass
    
    * fix:convert bug and add preln_residual_bias
    
    * support length=-1
    
    * add UT for convert
    
    * add no_bias_qk support for gpu_multihead_op
    
    * delete infer_shape depends on bias_qk
    
    * oss just can be used in T4 and A*
    
    * fix:change api for ROCM CI
    f706d95d
vit_attention_fuse_pass.cc 5.4 KB