“e368bc58b67db2d86f97420c0dc92f90e628349e”上不存在“tools/python/micro/jinja2_files/micro_graph_data.h.jinja2”
  • Y
    Optimize the CUDA implementation of sequence_expand op by reduce the times of... · f4634d76
    Yiqun Liu 提交于
    Optimize the CUDA implementation of sequence_expand op by reduce the times of copying lod data from CPU to GPU. (#15493)
    
    * Optimize the CUDA implementation of sequence_expand op by reduce the times of copying lod data from CPU to GPU.
    test=develop
    
    * Refine the op benchmark to support setting lod in config.
    test=develop
    f4634d76
op_tester_config.cc 4.9 KB