Created by: jianhang-liu
When tuning MM_DNN model, it's found that stack OP caused almost 20% of total time and it only occur in release mode. This huge cost is due to call DebugStringEx() in OperatorWithKernel class. This issue occur for each OP (e.g. stack) which hasn't it's own GetExpectedKernelType().