Support FC (#1551)
* support fc in pruning * support matmul and matmul_v2 pruning * support ffn pruning with gelu activations in the middle * support reshape2, transpose and split in transformer block * support pattern: qkv gemm -> batched gemm -> out linear * prune all fc layer * support fc pruning * almost done with few hardcode * remove hardcode * fix UT * fix UT * avoid setting attributes for reshape in fc pruning * fix UT * fix UT
Showing
想要评论请 注册 或 登录