diff --git a/fluid/PaddleCV/video/models/attention_cluster/README.md b/fluid/PaddleCV/video/models/attention_cluster/README.md index 9f538f86dc02be3a1f2795f36a1d55da2355dae6..6e27bde7adea3d5f08749f099f87e0632b02d62e 100644 --- a/fluid/PaddleCV/video/models/attention_cluster/README.md +++ b/fluid/PaddleCV/video/models/attention_cluster/README.md @@ -13,13 +13,15 @@ ## 模型简介 -Attention Cluster模型为ActivityNet Kinetics Challenge 2017中最佳序列模型。该模型通过带Shifting Opeation的Attention Clusters处理已抽取好的RGB、Flow、Audio数据,Attention Cluster结构如下图所示。 +Attention Cluster模型为ActivityNet Kinetics Challenge 2017中最佳序列模型。该模型通过带Shifting Opeation的Attention Clusters处理已抽取好的RGB、Flow、Audio特征数据,Attention Cluster结构如下图所示。


Multimodal Attention Cluster with Shifting Operation

+Shifting Operation通过对每一个attention单元的输出添加一个独立可学习的线性变换处理后进行L2-normalization,使得各attention单元倾向于学习特征的不同成分,从而让Attention Cluster能更好地学习不同分布的数据,提高整个网络的学习表征能力。 + 详细内容请参考[Attention Clusters: Purely Attention Based Local Feature Integration for Video Classification](https://arxiv.org/abs/1711.09550) ## 数据准备