1. 02 3月, 2022 1 次提交
  2. 01 3月, 2022 1 次提交
    • Z
      [bf16] add bf16 kernel: scale gather sum (#39683) · 6d26b332
      zhangbo9674 提交于
      * add scale gather sum
      
      * refine CUDA_ATOMIC_WRAPPER ADD for bf16
      
      * add gather unittest
      
      * solve conflict
      
      * add scale uinttest
      
      * add sum unittest
      
      * solve conflict
      
      * refine gather unittest
      
      * refine unittest
      6d26b332
  3. 15 2月, 2022 1 次提交
    • A
      [PTen]Migrate proto::VarType outside of Pten (#39411) · 7e7e9404
      Aurelius84 提交于
      * #1 migrate dist-related type()-> dtype()
      
      * move datatype function from pten -> fluid/framework
      
      * change type() in imperative into convert(dtype())
      
      * modify xx_tensor->type into xx_tensor->dtype
      
      * change the set_type interface and the caller
      
      * modify xx_tensor.type into xx_tensor.dtype
      
      * fix mutable_data(place, dtype())
      
      * change caller of mutable_data in pten and distributed
      
      * change the caller of mutable_data in fluid/framework
      
      * change the caller of mutable_data in imperative directory
      
      * mutable_data: inference
      
      * update the call of mutable_data
      
      * transfer MakePenScalarArray MakePtenScalar ResetHolderWithType
      
      * pass the compile. the next step is remove VarType in Pten
      
      * fix all and remove VarType from pten. success in linux. Next task is other platform
      
      * fix conflict with develop
      
      * fix compiled error
      
      * Fix reset conversion
      
      * fix conflict
      
      * fix compiled problem
      
      * fix typo
      
      * Fix << in tensor_utils.cc
      
      * fix type->dtype
      
      * fix unittest
      
      * fix tensor init constructor
      
      * fix DataTypeSize for BFloat16
      
      * fix code style
      
      * fix npu compiled error
      
      * fix npu
      
      * compile npu sucessfully
      
      * fix conflict
      
      * fix conflict
      Co-authored-by: Nxiongkun <xiongkun03@baidu.com>
      7e7e9404
  4. 11 6月, 2021 1 次提交
  5. 23 8月, 2020 1 次提交
  6. 14 5月, 2020 1 次提交
  7. 12 6月, 2019 1 次提交
    • W
      Fix scatter and gather op when has duplicate index (#17952) · 8eb134c3
      wawltor 提交于
      * test=develop
      The scatter op has a calc bug when the indices has same index, the scatter op use overwrite mode to calculate the same index, fix this bug by using the accumulate mode to calculate the same index.At the same time, the gather op has the same bug when the op calc the grad. And we use the lib of open-blas and eigen to optimize the time cost in accumulate mode.
      
      * test=develop
      Fix some code format problem, and the same time add the test case in gather and scatter op
      8eb134c3
  8. 25 5月, 2019 1 次提交
  9. 30 1月, 2019 2 次提交
  10. 31 10月, 2018 1 次提交
  11. 19 4月, 2018 1 次提交
  12. 12 2月, 2018 1 次提交
  13. 10 2月, 2018 2 次提交
  14. 26 12月, 2017 1 次提交
  15. 12 12月, 2017 1 次提交
    • Q
      Refine device context (#6433) · 61ec0b95
      QI JUN 提交于
      There are mainly following fixes:
      
      - take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place`
      - remove `eigen_device` interface in base class  `DeviceContext`
      - remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext`
      - remove unused `platform::EigenDeviceConverter`
      - rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL`
      - rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`
      61ec0b95
  16. 04 10月, 2017 1 次提交
  17. 03 10月, 2017 1 次提交
  18. 29 9月, 2017 3 次提交
  19. 04 9月, 2017 1 次提交
  20. 16 8月, 2017 1 次提交
  21. 10 8月, 2017 3 次提交
  22. 09 8月, 2017 1 次提交
  23. 08 8月, 2017 1 次提交
  24. 07 8月, 2017 2 次提交
  25. 04 8月, 2017 1 次提交
  26. 03 8月, 2017 1 次提交
  27. 31 7月, 2017 1 次提交
  28. 25 7月, 2017 1 次提交
  29. 19 7月, 2017 1 次提交
  30. 18 7月, 2017 1 次提交
  31. 17 7月, 2017 3 次提交