Does it need to enhance matmul_op to support 4-D inputs (#7319) · Issue · PaddlePaddle / Paddle

Does it need to enhance matmul_op to support 4-D inputs

Created by: lcy-seso

When checking the dot product attention in ConvS2S and Transformer. I found in multi-head (self) attention, both inputs of the batched matrix multiplication can potentially be a 4-D tensor.

It seems we can enhance the current matmul_op to support 4-D tensor as its inputs, however, I guess this is determined by how to batch the computation to accelerate the computation speed.

Or the multiple heads can be simply wrapped in a Python API by using a for loop.

PaddlePaddle / Paddle 大约 1 年 前同步成功

Does it need to enhance matmul_op to support 4-D inputs

PaddlePaddle / Paddle
大约 1 年前同步成功