未验证 提交 85dc854b 编写于 作者: M mzl 提交者: GitHub

update ut/doc for glm/codegen (#4057)

* update ut/doc for glm/codegen

* formatting/spacing on docs

* re-order/alphabetize the models

---------
Co-authored-by: NLogan Adams <114770087+loadams@users.noreply.github.com>
Co-authored-by: NLogan Adams <loadams@microsoft.com>
上级 4cde5da8
......@@ -123,11 +123,14 @@ The following model families have been successfully tested with automatic tensor
- albert
- bert
- bigbird_pegasus
- bloom
- camembert
- codegen
- deberta_v2
- electra
- ernie
- esm
- glm
- gpt-j
- gpt-neo
- gpt-neox
......@@ -136,6 +139,7 @@ The following model families have been successfully tested with automatic tensor
- llama
- m2m_100
- marian
- mpt
- mvp
- nezha
- openai
......@@ -151,14 +155,11 @@ The following model families have been successfully tested with automatic tensor
- xglm
- xlm_roberta
- yoso
- bloom
- mpt
# Unsupported Models
The following models are not currently supported with automatic tensor parallelism. They may still be compatible with other DeepSpeed features (e.g., kernel injection for Bloom):
- codegen
- deberta
- flaubert
- fsmt
......
......@@ -478,12 +478,8 @@ class TestInjectionPolicy(DistributedTest):
@pytest.mark.seq_inference
@pytest.mark.parametrize(
"model_w_task",
[
("Helsinki-NLP/opus-mt-en-de", "translation"),
],
ids=[
"marian",
],
[("Helsinki-NLP/opus-mt-en-de", "translation"), ("Salesforce/codegen-350M-mono", "text-generation")],
ids=["marian", "codegen"], #codegen has fusedqkv weight.
)
@pytest.mark.parametrize("dtype", [torch.float16], ids=["fp16"])
class TestAutoTensorParallelism(DistributedTest):
......
Markdown is supported
0% .
You are about to add 0 people to the discussion. Proceed with caution.
先完成此消息的编辑!
想要评论请 注册