未验证 提交 77570874 编写于 作者: Z zhurou603 提交者: GitHub

support profiler Distributed Summary for CustomDevice (#53405)

上级 09f8e31d
......@@ -472,7 +472,11 @@ class DistributedSummary:
for runtimenode in hostnode.runtime_node:
for devicenode in runtimenode.device_node:
if devicenode.type == TracerEventType.Kernel:
if 'nccl' in devicenode.name.lower():
kernel_name = devicenode.name.lower()
if (
'nccl' in kernel_name
or 'xccl' in kernel_name
):
self.gpu_communication_range.append(
(devicenode.start_ns, devicenode.end_ns)
)
......
Markdown is supported
0% .
You are about to add 0 people to the discussion. Proceed with caution.
先完成此消息的编辑!
想要评论请 注册