[cherry-pick][PROFILE][BugFix] Precision profiler writes output tensor to files for each op; Fix dropout opencl kernel register (#4331) * cherry-pick from #4255, write output tensor to file. test=develop * cherry-pick from fix opencl dropout. test=develop (#4253)