1. 14 5月, 2020 1 次提交
  2. 11 9月, 2019 1 次提交
    • H
      Replace TemporaryAllocator by CUDADeviceContextAllocator (#18989) · 12542320
      Huihuang Zheng 提交于
      TemporaryAllocator is a singleton used for allocating memory for Cudnn. Since it is a singleton, we can delete it for better performance in memory.
      
      We replace TemporaryAllocator by CUDADeviceContextAllocator and CUDADeviceContextAllocation, which uses stream callback to delete the memory allocated for the stream to avoid singleton.
      
      Also added data_feed_proto to operator to fix CI in CPU compilation
      12542320
  3. 04 9月, 2019 2 次提交
  4. 30 8月, 2019 1 次提交
  5. 12 6月, 2019 1 次提交
    • W
      Fix scatter and gather op when has duplicate index (#17952) · 8eb134c3
      wawltor 提交于
      * test=develop
      The scatter op has a calc bug when the indices has same index, the scatter op use overwrite mode to calculate the same index, fix this bug by using the accumulate mode to calculate the same index.At the same time, the gather op has the same bug when the op calc the grad. And we use the lib of open-blas and eigen to optimize the time cost in accumulate mode.
      
      * test=develop
      Fix some code format problem, and the same time add the test case in gather and scatter op
      8eb134c3
  6. 25 5月, 2019 1 次提交
  7. 12 11月, 2018 1 次提交
    • Y
      Fix gather & stack op (#14355) · bd294378
      Yibing Liu 提交于
      * Add int type support for stack_op
      
      * Improve gather op to support index with shape N x 1
      
      test=develop
      
      * Fix stack_op kernel's registry
      
      test=develop
      bd294378
  8. 12 2月, 2018 1 次提交
  9. 10 2月, 2018 2 次提交
  10. 26 12月, 2017 1 次提交
  11. 04 10月, 2017 1 次提交
  12. 03 10月, 2017 1 次提交
  13. 29 9月, 2017 2 次提交