1. 24 10月, 2022 3 次提交
  2. 21 10月, 2022 1 次提交
  3. 19 10月, 2022 1 次提交
    • G
      Add enable_partial_send_recv switch in pipeline_configs (#46992) (#47083) · 1d015f12
      Ghost Screaming 提交于
      * Fix bug of reduce_sum op. When input.numel() > INT32_MAX, its result
      is wrong.
      
      * Support allow_partial switch, which can be configure in
      pipeline_configs. If sent tensor are not the same from
      different hosts, they shouldn't been sent partially and
      then concated as a whole tensor.
      
      * Change name allow_partial to enable_partial_send_recv.
      
      * Add global variable _enable_partial_send_recv
      1d015f12
  4. 18 10月, 2022 2 次提交
    • Y
      Cherry pick for sharding (#47061) · 5b642140
      Yuang Liu 提交于
      * [dygraph sharding] Overlap the reduce and the caculation for sharding stage 2. (#46495)
      
      * [dygraph sharding stage 2] sharding broadcast overlap (#46656)
      
      * Multi groups for broadcast of sharding stage 2 (#46894)
      5b642140
    • H
      [cherry-pick] Fix perf issues of mp/pp/fuse in eager mode (#47071) · b84edd90
      Haohongxiang 提交于
      * [Dygraph] Fix performance of pp+mp by using send/recv_calc_stream instead of send/recv (#46116)
      
      * [Dygraph] Fix Perf of FusedFeedForward and FusedAttention with AllReduce (#46780)
      
      * update
      b84edd90
  5. 11 10月, 2022 1 次提交
    • Y
      Cherry pick for dygraph pp (#46876) · 9cc3f69f
      Yuang Liu 提交于
      * bug fix for virtual pipeline parallel (#45922)
      
      * dont wait for send op under dygraph pp (#46209)
      
      * [interleave pp] sync recv for 1f1b (#46399)
      
      * [dygraph pp] all sync for allgather partial (#46483)
      9cc3f69f
  6. 27 9月, 2022 1 次提交
  7. 22 9月, 2022 1 次提交
    • R
      logger manager (#45909) (#46087) · 7eb046c7
      Roc 提交于
      uniform logger manager in FleetAPI.
      hidde API under distributed/utils which users don't need.
      7eb046c7
  8. 19 9月, 2022 2 次提交
  9. 09 9月, 2022 1 次提交
  10. 07 9月, 2022 1 次提交
  11. 06 9月, 2022 1 次提交
  12. 02 9月, 2022 1 次提交
  13. 26 8月, 2022 3 次提交
  14. 16 8月, 2022 1 次提交
  15. 12 8月, 2022 1 次提交
  16. 10 8月, 2022 1 次提交
  17. 09 8月, 2022 1 次提交
  18. 22 7月, 2022 1 次提交
  19. 13 7月, 2022 2 次提交
  20. 27 6月, 2022 1 次提交
  21. 14 6月, 2022 1 次提交
    • Z
      Fix numpy 1.20+ deprecation warnings (#42929) · 90cf2299
      zlsh80826 提交于
      * Replace np.bool/np.bool8 with np.bool_
      
      * Replace np.object with np.object_
      
      * Replace np.complex with np.complex128
      
      * Replace np.float with np.float64
      
      * Replace np.int with np.int_
      
      * Rerun pre-commit for newer pre-commit configuration
      
      * Use builtin bool instead of np.bool_ based on the context
      90cf2299
  22. 05 6月, 2022 1 次提交
    • S
      【code format check upgrade】 step2:yapf (#42944) · a072fca8
      Sing_chan 提交于
      * use yapf to format all python file
      
      * yapf exclude two unittests file for they rely on writing and reading file, and format will break them
      
      * disable diff_py_file because too many diff files cause command following failed
      a072fca8
  23. 26 5月, 2022 1 次提交
  24. 25 5月, 2022 1 次提交
  25. 16 5月, 2022 1 次提交
  26. 19 4月, 2022 1 次提交
  27. 16 4月, 2022 1 次提交
  28. 13 4月, 2022 1 次提交
  29. 25 3月, 2022 1 次提交
    • J
      Refactor Dygraph Flags (#40786) · 3085d5e4
      Jiabin Yang 提交于
      * refactor eager flags
      
      * fix flags error when we switch from eager to dygraph
      
      * fix ci problem
      
      * fix ci
      
      * fix ci
      
      * merge develop and fix code style
      
      * merge develop and fix code style
      
      * fix op test error
      
      * fix op test error
      
      * fix op test error
      
      * fix op test error
      
      * fix op test error
      
      * merge develop
      3085d5e4
  30. 18 3月, 2022 1 次提交
  31. 14 3月, 2022 1 次提交
  32. 11 3月, 2022 1 次提交
  33. 09 3月, 2022 1 次提交