1. 17 12月, 2020 1 次提交
    • S
      [cherry-pick]fix matmulv2 bug & add rebuild group & fix bug of download (#29726) · df0430dc
      ShenLiang 提交于
      * Fix the dowanload bug in the case of multiple machines (#29551)
      
      * fix the dowanload bug
      * add sort for ips
      
      * Fix bug of matmul_v2 for broadcast case (#29599)
      
      * fix bug of matmul_v2 for broadcast
      
      * Rebuild group automatically in dynamic graph distributed (#29255)
      
      * add tensor_indices in AssignGroupBySize
      
      * add rebuild group in reducer
      
      * fix error message of gather nd (#29521)
      df0430dc
  2. 09 11月, 2020 1 次提交
  3. 23 8月, 2020 1 次提交
  4. 11 7月, 2020 1 次提交
  5. 14 5月, 2020 1 次提交
  6. 11 5月, 2020 1 次提交
    • C
      Add macro BOOST_GET to enrich the error information of boost :: get (#24175) · aa0f254f
      Chen Weihang 提交于
      * add new macro BOOST_GET_SAFELY & unittests, test=develop
      
      * add different macro type, test=develop
      
      * fix get macro type in executor, test=develop
      
      * four macro part change backup
      
      * using one macro for all case, test=develop
      
      * revert attribute change, test=develop
      
      * change to three func to solve gcc4.8 bug, test=develop
      
      * polish some details, test=develop
      aa0f254f
  7. 11 9月, 2019 1 次提交
    • H
      Replace TemporaryAllocator by CUDADeviceContextAllocator (#18989) · 12542320
      Huihuang Zheng 提交于
      TemporaryAllocator is a singleton used for allocating memory for Cudnn. Since it is a singleton, we can delete it for better performance in memory.
      
      We replace TemporaryAllocator by CUDADeviceContextAllocator and CUDADeviceContextAllocation, which uses stream callback to delete the memory allocated for the stream to avoid singleton.
      
      Also added data_feed_proto to operator to fix CI in CPU compilation
      12542320
  8. 30 8月, 2019 1 次提交
  9. 14 8月, 2019 1 次提交
  10. 25 5月, 2019 1 次提交
  11. 26 3月, 2019 1 次提交
  12. 12 11月, 2018 1 次提交
    • Y
      Fix gather & stack op (#14355) · bd294378
      Yibing Liu 提交于
      * Add int type support for stack_op
      
      * Improve gather op to support index with shape N x 1
      
      test=develop
      
      * Fix stack_op kernel's registry
      
      test=develop
      bd294378
  13. 12 2月, 2018 1 次提交
  14. 10 2月, 2018 2 次提交
  15. 26 12月, 2017 1 次提交
  16. 12 12月, 2017 1 次提交
    • Q
      Refine device context (#6433) · 61ec0b95
      QI JUN 提交于
      There are mainly following fixes:
      
      - take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place`
      - remove `eigen_device` interface in base class  `DeviceContext`
      - remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext`
      - remove unused `platform::EigenDeviceConverter`
      - rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL`
      - rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`
      61ec0b95
  17. 04 10月, 2017 1 次提交
  18. 03 10月, 2017 1 次提交
  19. 29 9月, 2017 2 次提交