Created by: NHZlX
The TRT subgraph will copy all the intermediate results to cpu memory, this will greatly reduce the time of inference. So we should filter the useless output.