1. 18 1月, 2023 1 次提交
  2. 12 1月, 2023 1 次提交
    • J
      Fix reduce func bug in process_group_bkcl (#49749) · 8e291bf7
      jameszhang 提交于
      * Fix reduce func bug in process_group_bkcl
      
      Also catch up with a recent process_group PR that failed to add XPU branch.
      Note that reduce is still accomplished by allreduce for xpu. Fix this should
      xccl lib be updated.
      
      * fix compile issue for non-XPU
      8e291bf7
  3. 09 12月, 2022 1 次提交
  4. 29 11月, 2022 1 次提交
  5. 24 11月, 2022 1 次提交
  6. 17 11月, 2022 1 次提交
  7. 14 11月, 2022 1 次提交
  8. 10 11月, 2022 1 次提交
    • J
      XPU multi-card support eager mode (#47445) · 3b91f8f3
      james 提交于
      * XPU support eager mode
      
      * add unittest for XPU eager mode
      
      * minor bugfix
      
      * minor bugfix, test=kunlun
      
      * correct copyright info
      
      * 1. remove unsed vars/funcs
      2. ProcessGroupBKCL inherit from ProcessGroupStream
      
      * bugfix for fp16 in eager mode multi-card, test=kunlun
      
      * rebase & fix a few issues
      
      * use new processgroup interface, test=kunlun
      
      * fix compile issue, test=kunlun
      3b91f8f3