1. 16 3月, 2022 1 次提交
  2. 02 3月, 2022 1 次提交
  3. 01 3月, 2022 1 次提交
    • Z
      [bf16] add bf16 kernel: layer_norm p_norm reduce_sum (#39843) · ce8ed978
      zhangbo9674 提交于
      * add layer norm
      
      * add p norm
      
      * add reduce sum
      
      * refine layer norm register bf16 for cudnn811
      
      * add bf16 cast for hip
      
      * add unittest
      
      * refine rocm
      
      * refine layer_norm unittest
      
      * refine reduce op
      
      * refine unittest
      
      * enhance atol for reduce unittest
      ce8ed978
  4. 25 2月, 2022 1 次提交
  5. 23 2月, 2022 1 次提交
  6. 22 2月, 2022 1 次提交
  7. 21 2月, 2022 1 次提交
  8. 20 2月, 2022 3 次提交
  9. 17 2月, 2022 1 次提交
  10. 29 1月, 2022 1 次提交
    • C
      [PTen] Tidy pten core headers (#39188) · dd990981
      Chen Weihang 提交于
      * open header for custom kernel
      
      * add core utils
      
      * tidy core code
      
      * tify header
      
      * tidy include
      
      * tidy namespace
      
      * resolve conflit
      
      * fix unittest and coverage
      
      * remove platform using
      
      * resolve conflict
      
      * resolve conflict
      
      * fix digamma namespace error
      
      * fix xpu full kernel error
      
      * fix xpu full kernel error
      
      * polish details
      
      * add place for lib storage
      dd990981
  11. 27 1月, 2022 1 次提交
  12. 25 1月, 2022 1 次提交
  13. 24 1月, 2022 1 次提交
    • [Refactoring Tensor PR #5] replace storage with pten allocation (#39085) · a56e16a7
      石晓伟 提交于
      * updates callers, test=develop
      
      * updates tensor, test=develop
      
      * fixes errors, test=develop
      
      * remove some dtypes, test=develop
      
      * fix errors in the base storage modification, test=develop
      
      * fixes a bug, test=develop
      
      * fixes the bugs in push the whole, test=develop
      
      * updates, test=develop
      
      * update
      
      * update, test=develop
      
      * fixes the mac-py3 CI, test=develop
      
      * remove the storage impl, test=develop
      
      * updates some codes, test=develop
      
      * update, test=develop
      
      * updates pten allocation, test=develop
      a56e16a7
  14. 21 1月, 2022 2 次提交
  15. 20 1月, 2022 1 次提交
  16. 13 1月, 2022 1 次提交
  17. 12 1月, 2022 1 次提交
  18. 06 1月, 2022 1 次提交
  19. 05 1月, 2022 1 次提交
  20. 04 1月, 2022 1 次提交
  21. 31 12月, 2021 1 次提交