Fork自 PaddlePaddle / Paddle
Fix diag OP bug on Windows Python3.8 ,remove the std::min
add paddle.tensor.linalg.diag API, diag_v2 OP and CUDA kernel.