1. 20 5月, 2021 1 次提交
  2. 19 5月, 2021 1 次提交
  3. 16 5月, 2021 1 次提交
  4. 14 5月, 2021 5 次提交
  5. 13 5月, 2021 2 次提交
  6. 11 5月, 2021 1 次提交
    • O
      Auto-tuning aio parameters (#1059) · 6124eb37
      Olatunji Ruwase 提交于
      * Validate env; remove write size from logs
      
      * Performance scripts for auto-tunining/auto-generating aio params of
      deepspeed config.
      
      * Formatting fixes
      
      * Address feedback
      6124eb37
  7. 08 5月, 2021 1 次提交
  8. 07 5月, 2021 1 次提交
  9. 06 5月, 2021 1 次提交
  10. 05 5月, 2021 1 次提交
    • J
      bump 0.3.17 for next release · e4139a7d
      Jeff Rasley 提交于
      Previously we would bump release version then push release, but then any unreleased commits in master are versioned to the previous release which isn't correct. Instead we will now push to PyPI then bump to next version for un-released commits.
      e4139a7d
  11. 04 5月, 2021 2 次提交
    • S
      fix assert message (#1040) · 47ec97eb
      Stas Bekman 提交于
      * fix assert
      
      The current assert "Model must initialized in fp16 mode for ZeRO Stage 3." needs TLC - I rewrote it completely to match its cousen assert, so now we have 2 consistent matching asserts:
      
      - f"fp16 is enabled but one or several model parameters have dtype that is not fp16"
      - f"fp16 is not enabled but one or several model parameters have dtype of fp16"
      
      * remove f
      47ec97eb
    • J
      Change methods to be static (#1038) · 21047072
      janEbert 提交于
      Fix #1032
      21047072
  12. 03 5月, 2021 1 次提交
  13. 01 5月, 2021 3 次提交
  14. 30 4月, 2021 1 次提交
  15. 01 5月, 2021 5 次提交
  16. 30 4月, 2021 3 次提交
  17. 29 4月, 2021 2 次提交
  18. 28 4月, 2021 1 次提交
  19. 27 4月, 2021 1 次提交
  20. 25 4月, 2021 1 次提交
    • H
      Add find_unused_parameters option to DeepSpeedEngine (#945) · d0b61f18
      hamlet 提交于
      * Add find_unused_parameters option
      
      As unused parameters in modules may not be expected sometimes, 
      add an explicit error msg when it occurred and an option to avoid the error: https://github.com/microsoft/DeepSpeed/issues/707
      
      * Add find_unused_parameters option
      
      As unused parameters in modules may not be expected sometimes, 
      add an explicit error msg when it occurred and an option to avoid the error: https://github.com/microsoft/DeepSpeed/issues/707
      
      * Fix syntax error
      
      * Fix yapf error
      
      * Fix yapf error
      
      * Fix yapf error
      
      * Fix yapf error
      
      * Move stage2 find_unused_parameters to config file
      
      * Add stage2 find_unused_parameters
      
      * Add stage2 find_unused_parameters
      
      * Add stage2_find_unused_parameters option
      
      * Change error msg to reflect zero_optimization config change
      
      * Fix yapf error
      
      * Fix yapf errors
      
      * Change find_unused_parameters option name
      
      * Change find_unused_parameters option name
      
      * Change find_unused_parameters option name
      
      * Change find_unused_parameters option name
      
      * Change find_unused_parameters option name
      
      * Add UnusedParametersModel for test option find_unused_parameters
      
      * Add unit test for stage2 find_unused_parameters
      
      * Add cpu-adam compatible check
      
      * Remove dups import
      
      * Trim spaces
      
      * Fix yapf errors
      
      * Trim spaces
      
      * Add False Positive test check
      
      * Fix find_unused_parameters test
      
      * Trim spaces
      
      * Fix yapf error
      d0b61f18
  21. 24 4月, 2021 2 次提交
  22. 23 4月, 2021 3 次提交
    • O
      Asynchronous I/O docs (#1000) · bff4bc72
      Olatunji Ruwase 提交于
      * Fix docstring
      
      * Make screenshots clickable for easier viewing
      
      * Navigation menu in alphabetical order; More clicable screenshots
      
      * Rename 1Cycle doc
      
      * Tweak naming
      
      * Remove no longer used flag
      
      * ZeRO3 Offload release
      
      * Single GPU results
      
      * Rearrange figures
      
      * Single GPU text
      
      * tweak intro
      
      * zero3-offload section
      
      * Add asynchronous i/o docs
      bff4bc72
    • S
      [doc] add missing pin_memory entry (#999) · ecf2e1bc
      Stas Bekman 提交于
      - `offload_param` was missing `pin_memory` 
      - also moved the entry in `offload_optimizer` to have it in the same place.
      ecf2e1bc
    • W
      Fix issue where gradient_predivide_factor was called as a func. (#996) · a7118789
      William Buchwalter 提交于
      * Fix issue where gradient_predivide_factor was called as a func.
      
      `gradient_predivide_factor` is a `float`, hence shouldn't be called as func.
      This crashes when `reduce_scatter` flag is set to `False`.
      a7118789