1. 13 5月, 2019 1 次提交
    • Y
      Optimize the elementwise op using eigen (#15494) · dcda2023
      Yiqun Liu 提交于
      * Optimize the elementwise op with CUDA kernels.
      test=develop
      
      * Support setting of attr in op config file.
      test=develop
      
      * Add the support the setting dtype and initializer in config.
      test=develop
      
      * Save workspace.
      
      * Add initializer "zeros".
      test=develop
      
      * Fix compiling error.
      
      * Support the use of existed file to initailize tensor in op_tester.
      
      * Use eigen to optimize the elementwise_add/mul for the case that x and y have the same dims.
      test=develop
      dcda2023
  2. 07 3月, 2019 2 次提交
  3. 26 2月, 2019 1 次提交
    • Y
      Optimize the CUDA implementation of sequence_expand op by reduce the times of... · f4634d76
      Yiqun Liu 提交于
      Optimize the CUDA implementation of sequence_expand op by reduce the times of copying lod data from CPU to GPU. (#15493)
      
      * Optimize the CUDA implementation of sequence_expand op by reduce the times of copying lod data from CPU to GPU.
      test=develop
      
      * Refine the op benchmark to support setting lod in config.
      test=develop
      f4634d76
  4. 22 2月, 2019 1 次提交