“1006383b8485c3409a9a7e09d9623df7e03f7364”上不存在“paddle/phi/kernels/group_norm_kernel.h”
-
由 Yiqun Liu 提交于
Optimize the CUDA implementation of sequence_expand op by reduce the times of copying lod data from CPU to GPU. (#15493) * Optimize the CUDA implementation of sequence_expand op by reduce the times of copying lod data from CPU to GPU. test=develop * Refine the op benchmark to support setting lod in config. test=develop
f4634d76