* Refine saving output scale to infer program
* Fix skip_quant in QAT
* Refine calculating output scale of dygraph qat, test=develop