1. 30 6月, 2023 1 次提交
  2. 24 6月, 2023 1 次提交
  3. 12 5月, 2023 1 次提交
  4. 14 4月, 2023 1 次提交
  5. 05 4月, 2023 1 次提交
  6. 22 3月, 2023 1 次提交
  7. 18 1月, 2023 1 次提交
  8. 21 12月, 2022 1 次提交
  9. 09 11月, 2022 1 次提交
    • M
      Fix CI issues related to cupy install (#2483) · 521d329b
      Michael Wyatt 提交于
      * remove any cupy install when setting up environments
      
      * revert previous changes to run on cu111 runners
      
      * fix for when no cupy is installed
      
      * remove cupy uninstall for workflows not using latest torch version
      
      * update to cu116 for inference tests
      
      * fix pip uninstall line
      
      * move python environment list to after DS install
      
      * remove cupy uninstall
      
      * re-add --forked
      
      * fix how we get cupy version (should be based on nvcc version)
      521d329b
  10. 02 11月, 2022 1 次提交
    • M
      Fixes for various CI problems (#2457) · 825f9d48
      Michael Wyatt 提交于
      * check only major CUDA version in CI
      
      * update expected torch latest version
      
      * pin torch latest to 1.12 until issues with 1.13 are resolve
      
      * wrong expected torch version
      
      * Update nv-torch18-v100.yml
      
      * remove forked from pytest option due to cuda re-initialization errors
      
      * removed expected torch version from inference tests, causing errors currently
      
      * fix various bugs that popped up
      
      * move all tests over to cu111 runners, cu113 runners having problems
      825f9d48
  11. 14 10月, 2022 1 次提交
  12. 05 10月, 2022 1 次提交
    • M
      Refactor remaining distributed tests (#2216) · ff427438
      Michael Wyatt 提交于
      * batch of refactored tests
      
      * more test refactoring
      
      * fp16 test refactor
      
      * more refactors
      
      * added DistributedFixture class
      
      * applied DistributedFixture to first batch of tests as a trial
      
      * added DistributedFixture test and documentation
      
      * last tests
      
      * fixes for refactored tests
      
      * remove subdirs in workflow files
      
      * fix pytest syntax error
      
      * fix another syntax error
      
      * update imports
      
      * use DistFixture with elastic checkpoint test
      
      * missing import
      
      * update to shared class tmpdir for elastic test
      
      * moved test files
      
      * avoid duplicate test file name
      
      * last refactor and moving test files
      
      * formatting
      
      * fix broken import
      
      * testing forked AMD tests
      
      * update abstract method
      
      * use blob storage for accelerate and transformers tests
      
      * upgrade torch for acclerate CI
      Co-authored-by: NOlatunji Ruwase <olruwase@microsoft.com>
      ff427438
  13. 10 9月, 2022 1 次提交
  14. 11 8月, 2022 1 次提交
  15. 02 8月, 2022 1 次提交
  16. 16 6月, 2022 1 次提交
  17. 15 6月, 2022 1 次提交
  18. 07 6月, 2022 1 次提交
  19. 25 5月, 2022 1 次提交
  20. 19 3月, 2022 1 次提交
    • M
      Add concurrency policy to CI workflow (#1844) · 2e1847d6
      Michael Wyatt 提交于
      * added concurrency to github actions
      
      * fixed problem where one workflow can cancel another workflow
      
      * added tmp file to help test concurrency policy on CI
      
      * removed tmp file to finish testing concurrency policy
      2e1847d6
  21. 15 3月, 2022 1 次提交