汪文翔, 查铖, 闵卫东, 卢卓群, 余光华. 多注意力机制下自愈人脸表情识别[J]. 微电子学与计算机, 2022, 39(9): 55-62. DOI: 10.19304/J.ISSN1000-7180.2022.0029
引用本文: 汪文翔, 查铖, 闵卫东, 卢卓群, 余光华. 多注意力机制下自愈人脸表情识别[J]. 微电子学与计算机, 2022, 39(9): 55-62. DOI: 10.19304/J.ISSN1000-7180.2022.0029
WANG Wenxiang, ZHA Cheng, MIN Weidong, LU Zhuoqun, YU Guanghua. Self curing facial expression recognition based on multi-attention mechanism[J]. Microelectronics & Computer, 2022, 39(9): 55-62. DOI: 10.19304/J.ISSN1000-7180.2022.0029
Citation: WANG Wenxiang, ZHA Cheng, MIN Weidong, LU Zhuoqun, YU Guanghua. Self curing facial expression recognition based on multi-attention mechanism[J]. Microelectronics & Computer, 2022, 39(9): 55-62. DOI: 10.19304/J.ISSN1000-7180.2022.0029

多注意力机制下自愈人脸表情识别

Self curing facial expression recognition based on multi-attention mechanism

  • 摘要: 人脸表情识别技术在社会生活、刑事侦探等领域中具有重要应用价值和广阔应用前景.现有方法对表情特征提取不充分,使得高维特征易丢失局部关键信息;同时在复杂背景下表情的二义性导致网络泛化能力弱.为解决这些问题,本文提出一种多注意力机制下自愈网络(Multiple Attention Self Curing Network, MASCNet).该网络生成带有注意力权重的多尺度特征,通过融合不同尺度特征,提高网络模型在细粒度下对局部关键信息的表征能力.自注意力机制模块为融合后的特征分配重要性权重,约束不确定性样本在网络训练中所占比重,提高网络的泛化能力.本文方法在FER2013和RAF-DB数据集上的最高的识别正确率分别为74.21%和88.74%.实验结果表明该方法能有效识别人脸表情,优于现有MHBP、AHBRPN等主流方法.

     

    Abstract: Facial expression recognition technology has important application value and broad application prospects in social life, criminal detectives and other fields. Aiming at the problem of insufficient expression feature extraction in the existing methods, which makes high-dimensional features easy to lose local key information; And the ambiguity of expressions in complex backgrounds leads to weak network generalization. In order to solve these problems, a self curing network under multi-attention mechanism (MASCNet) is proposed. The network will generate multi-scale features with attention weights, and by fusing features of different scales, the ability of the network model to represent local key information at a fine-grained level is improved. The self-attention mechanism module can assign importance weights to the fused features, constrain the proportion of uncertain samples in network training, and improve the generalization ability of the network. The highest recognition accuracy rates of this method on the FER2013 and RAF-DB datasets are 74.21% and 88.74% respectively. Experimental results show that this method can effectively recognize facial expressions and is superior to the existing mainstream methods such as MHBP and AHBRPN.

     

/

返回文章
返回