1. 12 3月, 2022 1 次提交
  2. 01 3月, 2022 1 次提交
    • Z
      [bf16] add bf16 kernel: scale gather sum (#39683) · 6d26b332
      zhangbo9674 提交于
      * add scale gather sum
      
      * refine CUDA_ATOMIC_WRAPPER ADD for bf16
      
      * add gather unittest
      
      * solve conflict
      
      * add scale uinttest
      
      * add sum unittest
      
      * solve conflict
      
      * refine gather unittest
      
      * refine unittest
      6d26b332
  3. 22 2月, 2022 1 次提交
  4. 20 2月, 2022 2 次提交
  5. 11 2月, 2022 1 次提交
  6. 08 2月, 2022 1 次提交
  7. 28 1月, 2022 1 次提交
  8. 27 1月, 2022 1 次提交
  9. 24 1月, 2022 1 次提交
    • [Refactoring Tensor PR #5] replace storage with pten allocation (#39085) · a56e16a7
      石晓伟 提交于
      * updates callers, test=develop
      
      * updates tensor, test=develop
      
      * fixes errors, test=develop
      
      * remove some dtypes, test=develop
      
      * fix errors in the base storage modification, test=develop
      
      * fixes a bug, test=develop
      
      * fixes the bugs in push the whole, test=develop
      
      * updates, test=develop
      
      * update
      
      * update, test=develop
      
      * fixes the mac-py3 CI, test=develop
      
      * remove the storage impl, test=develop
      
      * updates some codes, test=develop
      
      * update, test=develop
      
      * updates pten allocation, test=develop
      a56e16a7
  10. 20 1月, 2022 1 次提交
  11. 18 1月, 2022 2 次提交
  12. 15 1月, 2022 1 次提交
  13. 13 1月, 2022 2 次提交
    • C
      [PTen] Rename kernel register marco (#38861) · 158bf13f
      Chen Weihang 提交于
      * rename register marco
      
      * fix error changing
      
      * fix format error
      158bf13f
    • C
      [pten]Remove pten/include dir files (#38878) · 7e0292ea
      chentianyu03 提交于
      * move dot_dev api into dot_kernel.h
      
      * add infermate header
      
      * modify to dotkerel in dot_op.h
      
      * mvoe conj dev api into complex_kernel.h
      
      * move sign dev api into  sign_kernel.h
      
      * move scale dev api into kernel.h and remove infermete.h
      
      * rm paddle/pten/include/math.h
      
      * rm paddle/pten/include/math.h
      
      * rm include dir
      
      * rm paddle/pten/include/math.h
      
      * fix conflict with develop branch
      
      * rm devContext in conj_op.h
      
      * add the missing complex_kernel header
      7e0292ea
  14. 12 1月, 2022 1 次提交
  15. 04 1月, 2022 1 次提交
  16. 21 12月, 2021 2 次提交
  17. 20 12月, 2021 1 次提交