未验证 提交 07514139 编写于 作者: Y Yuanle Liu 提交者: GitHub

add gpu_cpu_map_matmul_to_mul_pass to kGpuLowerPrecisionPasses (#49753)

* add gpu_cpu_map_matmul_to_mul_pass to kGpuLowerPrecisionPasses

* disable fc_elementwise_layernorm_fuse_pass in mixed precision
上级 4d5265b8
......@@ -193,8 +193,9 @@ const std::vector<std::string> kGpuLowerPrecisionPasses{
"fuse_multi_transformer_layer_pass",
"gpu_cpu_map_matmul_v2_to_mul_pass",
"gpu_cpu_map_matmul_v2_to_matmul_pass",
"gpu_cpu_map_matmul_to_mul_pass",
"fc_fuse_pass",
"fc_elementwise_layernorm_fuse_pass",
// "fc_elementwise_layernorm_fuse_pass",
"embedding_eltwise_layernorm_fuse_pass",
"inplace_op_var_pass"};
......
Markdown is supported
0% .
You are about to add 0 people to the discussion. Proceed with caution.
先完成此消息的编辑!
想要评论请 注册