提交 b2e118bf 编写于 作者: L Logan Adams

Update H100 workflow to open an issue if nightly CI fails

上级 aa54dba0
---
name: CI failure report
about: Report a DeepSpeed CI failure
title: "{{ env.TESTNAME }} CI test failure"
labels: ci-failure
assignees: ''
---
The Nightly CI for {{ env.TESTNAME }} failed.
......@@ -8,6 +8,10 @@ concurrency:
group: ${{ github.workflow }}-${{ github.ref }}
cancel-in-progress: true
permissions:
contents: read
issues: write
jobs:
unit-tests:
runs-on: [self-hosted, nvidia, h100]
......@@ -49,3 +53,12 @@ jobs:
cd tests
python -m pytest $PYTEST_OPTS -n 4 unit/ --torch_ver="2.0" --cuda_ver="12"
python -m pytest $PYTEST_OPTS -m 'sequential' unit/ --torch_ver="2.0" --cuda_ver="12"
- name: Open GitHub issue if nightly CI fails
if: failure()
uses: JasonEtco/create-an-issue@v2
env:
GITHUB_TOKEN: ${{ secrets.GITHUB_TOKEN }}
TESTNAME: nv-h100
with:
filename: .github/ISSUE_TEMPLATE/ci_failure_report.md
Markdown is supported
0% .
You are about to add 0 people to the discussion. Proceed with caution.
先完成此消息的编辑!
想要评论请 注册