1. 09 2月, 2023 2 次提交
  2. 26 1月, 2023 1 次提交
    • M
      Abstract accelerator (step 3) (#2677) · 98cc35b6
      Ma, Guokai 提交于
      * Integrate accelerator abstraction interface into deepspeed/
      
      * Fix error message in fp16/fused_optimizer
      
      * fix error message in fp16/unfused_optimizer.py
      
      * assign get_accelerator().pin_memory() result to input Tensor name
      
      * no need to check cuda and whether nvtx supported
      
      * move try-except into inner most block
      
      * call Event() and Stream() in get_accelerator() for data type
      
      * Make Stream and Event as properties of abstract interface so they can be used as data type in deepspeed
      
      * Apply op_builder backend api change from #2705 from @jeffra
      
      * fix tests where Builder NAME is used
      
      * keep original ...Builder.NAME interface instead of ...Builder().NAME interface
      
      * fix builder closure for installation
      
      * fix randomltd builder
      
      * add comments to clarify create_op_builder and get_op_builder
      
      * fix compatibility with pip install -e
      Co-authored-by: NCheng Li <pistasable@gmail.com>
      Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
      98cc35b6
  3. 14 10月, 2022 1 次提交
  4. 30 7月, 2022 1 次提交
  5. 26 7月, 2022 1 次提交
  6. 08 6月, 2022 1 次提交
  7. 07 6月, 2022 1 次提交
  8. 22 4月, 2022 1 次提交
  9. 28 1月, 2022 1 次提交
  10. 13 1月, 2022 1 次提交
  11. 30 11月, 2021 1 次提交
  12. 09 11月, 2021 1 次提交
  13. 10 9月, 2020 1 次提交