- 07 3月, 2019 2 次提交
- 26 2月, 2019 1 次提交
-
-
由 Yiqun Liu 提交于
Optimize the CUDA implementation of sequence_expand op by reduce the times of copying lod data from CPU to GPU. (#15493) * Optimize the CUDA implementation of sequence_expand op by reduce the times of copying lod data from CPU to GPU. test=develop * Refine the op benchmark to support setting lod in config. test=develop
-
- 22 2月, 2019 1 次提交
-
-
由 Yiqun Liu 提交于
* Initialize the benchmark tester for operator. test=develop * Rearrange the codes. test=develop
-