From ac2d2c6543bf9aefee18e77b23bf60c302cc012f Mon Sep 17 00:00:00 2001 From: Bin Lu Date: Wed, 9 Feb 2022 14:57:01 +0800 Subject: [PATCH] Update readme.md --- test_tipc/supplementary/readme.md | 7 +++++++ 1 file changed, 7 insertions(+) diff --git a/test_tipc/supplementary/readme.md b/test_tipc/supplementary/readme.md index b630b0f3..cc49f009 100644 --- a/test_tipc/supplementary/readme.md +++ b/test_tipc/supplementary/readme.md @@ -47,6 +47,13 @@ bash test_tipc/test_train_python.sh ./test_tipc/train_infer_python_PACT.txt 'lit bash test_tipc/test_train_python.sh ./test_tipc/train_infer_python_FPGM.txt 'lite_train_lite_infer' ``` +多机多卡的运行配置文件分别为'train_infer_python_fleet.txt', 'train_infer_python_FPGM_fleet.txt', 'train_infer_python_PACT_fleet.txt'. +运行时,需要修改配置文件中的`gpu_list:xx.xx.xx.xx,yy.yy.yy.yy;0,1`. 将`xx.xx.xx.xx` 和 `yy.yy.yy.yy`替换为具体的 `ip` 地址。 另外,和单机训练 +不同,多机多卡训练需要在多机的每个节点上分别运行命令。 以多机多卡量化训练为例, 指令如下: +``` +bash test_tipc/test_train_python.sh ./test_tipc/train_infer_python_PACT_fleet.txt 'lite_train_lite_infer' +``` + 运行相应指令后,在`test_tipc/output`文件夹下自动会保存运行日志。如'lite_train_lite_infer'模式运行后,在test_tipc/extra_output文件夹有以下文件: ``` -- GitLab