1. 20 5月, 2021 1 次提交
  2. 16 5月, 2021 1 次提交
  3. 08 5月, 2021 1 次提交
  4. 30 4月, 2021 1 次提交
  5. 25 4月, 2021 1 次提交
    • H
      Add find_unused_parameters option to DeepSpeedEngine (#945) · d0b61f18
      hamlet 提交于
      * Add find_unused_parameters option
      
      As unused parameters in modules may not be expected sometimes, 
      add an explicit error msg when it occurred and an option to avoid the error: https://github.com/microsoft/DeepSpeed/issues/707
      
      * Add find_unused_parameters option
      
      As unused parameters in modules may not be expected sometimes, 
      add an explicit error msg when it occurred and an option to avoid the error: https://github.com/microsoft/DeepSpeed/issues/707
      
      * Fix syntax error
      
      * Fix yapf error
      
      * Fix yapf error
      
      * Fix yapf error
      
      * Fix yapf error
      
      * Move stage2 find_unused_parameters to config file
      
      * Add stage2 find_unused_parameters
      
      * Add stage2 find_unused_parameters
      
      * Add stage2_find_unused_parameters option
      
      * Change error msg to reflect zero_optimization config change
      
      * Fix yapf error
      
      * Fix yapf errors
      
      * Change find_unused_parameters option name
      
      * Change find_unused_parameters option name
      
      * Change find_unused_parameters option name
      
      * Change find_unused_parameters option name
      
      * Change find_unused_parameters option name
      
      * Add UnusedParametersModel for test option find_unused_parameters
      
      * Add unit test for stage2 find_unused_parameters
      
      * Add cpu-adam compatible check
      
      * Remove dups import
      
      * Trim spaces
      
      * Fix yapf errors
      
      * Trim spaces
      
      * Add False Positive test check
      
      * Fix find_unused_parameters test
      
      * Trim spaces
      
      * Fix yapf error
      d0b61f18
  6. 15 4月, 2021 1 次提交
  7. 27 3月, 2021 1 次提交
  8. 16 3月, 2021 1 次提交
  9. 12 3月, 2021 1 次提交
  10. 11 3月, 2021 1 次提交
  11. 17 2月, 2021 1 次提交
  12. 25 11月, 2020 1 次提交
  13. 24 11月, 2020 1 次提交
  14. 21 11月, 2020 1 次提交
  15. 13 11月, 2020 1 次提交
  16. 06 11月, 2020 1 次提交
  17. 31 10月, 2020 1 次提交
  18. 30 9月, 2020 1 次提交
  19. 28 9月, 2020 1 次提交
  20. 17 9月, 2020 1 次提交
  21. 10 9月, 2020 2 次提交
  22. 02 9月, 2020 1 次提交
  23. 01 9月, 2020 1 次提交
    • S
      Samyamr/grad acc stage2 (#338) · 7240abf3
      Samyam Rajbhandari 提交于
      * Adding gradient accumulation support for ZeRO Stage 2. Changing all Megatron-LM tests to also test gradient accumulation
      
      * Gradient Accumulation support for Stage 2. Model tests added to test the feature
      
      * formatting
      
      * Update deepspeed_light.py
      
      removing comment
      
      * Update ds_config_func_bs8_zero1.json
      
      reverting this file back. Its not needed for this PR
      
      * defining baseline prefix
      Co-authored-by: NJeff Rasley <jerasley@microsoft.com>
      7240abf3
  24. 14 7月, 2020 1 次提交
  25. 07 7月, 2020 1 次提交
  26. 24 6月, 2020 1 次提交
  27. 20 6月, 2020 1 次提交
    • S
      Update deepspeed_utils.py (#270) · 224494bd
      Samyam Rajbhandari 提交于
      * Removing handle_overflow debugging code in deepspeed_utils.py
      
      * Removing handle_overflow debugging code in deepspeed_zero_optimizer.py
      
      Removing unnecessary overflow handle code. Not sure why it was there in the first place.
      224494bd
  28. 05 6月, 2020 1 次提交
    • C
      Add log util (#230) · e1ad8803
      Chunyang Wen 提交于
      * Add log util
      
      * replace all occurrences of print and logging
      
      * address format
      
      * disable propagate to avoid duplicate log
      e1ad8803
  29. 04 6月, 2020 1 次提交
  30. 28 5月, 2020 2 次提交
    • J
      add support for predivide as a config option (#235) · bc36b91d
      Jeff Rasley 提交于
      * add support for predivide as a flag
      * add predivide json config, remove allgather_disable (as it's not currently used anymore)
      bc36b91d
    • S
      Samyamr/cpu memory bloat fix zero (#233) · d24d3de9
      Samyam Rajbhandari 提交于
      * Fix for CPU memory Bloating Issue caused by pyorch backward graph creation in allgather. Fixed by calling detach on tensors before calling all_gather
      
      * Fix for CPU memory Bloating Issue caused by pyorch backward graph creation in allgather. Fixed by calling detach on tensors before calling all_gather
      
      * Fix for CPU memory Bloating Issue caused by pyorch backward graph creation in allgather. Fixed by calling detach on tensors before calling all_gather
      d24d3de9
  31. 19 5月, 2020 1 次提交
  32. 25 4月, 2020 1 次提交
  33. 21 4月, 2020 1 次提交
  34. 03 4月, 2020 1 次提交
  35. 26 3月, 2020 1 次提交
  36. 11 3月, 2020 1 次提交
  37. 04 2月, 2020 1 次提交