“5c66338f4e9678d1a1254c6f1adb5d124a15512c”上不存在“paddle/phi/kernels/cpu/diagonal_grad_kernel.cc”
* Add Python Callstacks when Op::Run error * Skip op with sub-block * refactor: refine callstack info's format * Reshape only support matrix * Polish Python code * Fix UT * Fix Py3