• Z
    Cherrypick NV fixes to release/2.4 (#48263) · 7a0b8625
    zlsh80826 提交于
    * Reduce squeeze2_matmul_fuse_pass, flattent tests time (#47098)
    
    * Add missing fp32 config and reduce the testing combination
    
    * Reduce trt matmul pass test max examples
    
    * Loose TRT fp16 tests tolerance (#47100)
    
    * Loose TRT half test tolerance to 1e-3 (#47101)
    
    * Loose TRT half test tolerance to 1e-3 (#47106)
    
    * Update distributed_strategy.proto (#46531)
    
    * Close popen pipe after used (#47053)
    
    * Add launch_bounds (#47285)
    
    * Fix TRT UT failures (#47488)
    
    * Format cherry-picked commits
    
    * CudnnNormConvolution is no longer supported on NVIDIA Hopper GPUs (#48203)
    
    * Skip tests that use fused_ops on H100
    
    * Add error message to FusedOps on H100
    Co-authored-by: NShijie <505749828@qq.com>
    Co-authored-by: NLeo Chen <39020268+leo0519@users.noreply.github.com>
    Co-authored-by: NTian Zheng <tizheng@nvidia.com>
    7a0b8625
fused_dropout_act_bias.h 13.7 KB