Fork自 PaddlePaddle / Paddle
* DataParallel state dict don't include _layers.; test=develop * add unitest of data parallel; test=develop * add load state test; test=develop