1. 23 10月, 2022 1 次提交
  2. 21 10月, 2022 1 次提交
  3. 20 10月, 2022 3 次提交
  4. 19 10月, 2022 1 次提交
  5. 17 10月, 2022 2 次提交
    • G
      Add enable_partial_send_recv switch in pipeline_configs (#46992) · b9a2f29c
      Ghost Screaming 提交于
      * Fix bug of reduce_sum op. When input.numel() > INT32_MAX, its result
      is wrong.
      
      * Support allow_partial switch, which can be configure in
      pipeline_configs. If sent tensor are not the same from
      different hosts, they shouldn't been sent partially and
      then concated as a whole tensor.
      
      * Change name allow_partial to enable_partial_send_recv.
      
      * Add global variable _enable_partial_send_recv
      b9a2f29c
    • G
      Support BF16 training for sharding (#46846) · 0b39b244
      Ghost Screaming 提交于
      * Fix bug of reduce_sum op. When input.numel() > INT32_MAX, its result
      is wrong.
      
      * support pure bfloat16
      
      * support bf16 linear
      
      * update PR to pass CI
      
      * tiny fix where_grad_kernel.cu
      
      * Support bfloat16 type for reducer and sharding.
      
      * Fix some bug.
      
      * Polish code.
      
      * Polise code.
      
      * Add bfloat16 datatype in fill_grad kernels.
      Co-authored-by: Nsneaxiy <sneaxiy@126.com>
      0b39b244
  6. 13 10月, 2022 1 次提交
  7. 12 10月, 2022 2 次提交
  8. 09 10月, 2022 1 次提交
  9. 08 10月, 2022 1 次提交
  10. 28 9月, 2022 2 次提交
  11. 22 9月, 2022 1 次提交
  12. 21 9月, 2022 1 次提交
  13. 20 9月, 2022 2 次提交
  14. 19 9月, 2022 1 次提交
  15. 16 9月, 2022 1 次提交
    • W
      refactor mp. (#45803) · fa97e5ba
      wuhuachaocoding 提交于
      * refactor mp.
      
      * update setup.py.
      
      * update mp_layers.py for compatibility.
      
      * add documents for mp_layers.py
      
      * update init.py
      
      * update collective.py.
      
      * update.
      
      * update mp_ops.py
      
      * update.
      
      * update code style.
      
      * update code style.
      fa97e5ba
  16. 14 9月, 2022 1 次提交
  17. 09 9月, 2022 2 次提交
  18. 07 9月, 2022 1 次提交
  19. 06 9月, 2022 1 次提交
  20. 02 9月, 2022 1 次提交
  21. 26 8月, 2022 3 次提交
  22. 16 8月, 2022 1 次提交
  23. 12 8月, 2022 1 次提交
  24. 10 8月, 2022 1 次提交
  25. 09 8月, 2022 1 次提交
  26. 22 7月, 2022 1 次提交
  27. 13 7月, 2022 2 次提交
  28. 27 6月, 2022 1 次提交
  29. 14 6月, 2022 1 次提交
    • Z
      Fix numpy 1.20+ deprecation warnings (#42929) · 90cf2299
      zlsh80826 提交于
      * Replace np.bool/np.bool8 with np.bool_
      
      * Replace np.object with np.object_
      
      * Replace np.complex with np.complex128
      
      * Replace np.float with np.float64
      
      * Replace np.int with np.int_
      
      * Rerun pre-commit for newer pre-commit configuration
      
      * Use builtin bool instead of np.bool_ based on the context
      90cf2299
  30. 05 6月, 2022 1 次提交
    • S
      【code format check upgrade】 step2:yapf (#42944) · a072fca8
      Sing_chan 提交于
      * use yapf to format all python file
      
      * yapf exclude two unittests file for they rely on writing and reading file, and format will break them
      
      * disable diff_py_file because too many diff files cause command following failed
      a072fca8