计算机科学
稳健性(进化)
人工智能
模式识别(心理学)
特征提取
分类器(UML)
模式
噪音(视频)
机器学习
情绪识别
噪声测量
语音识别
降噪
图像(数学)
社会学
基因
生物化学
化学
社会科学
作者
Sunan Li,Hailun Lian,Cheng Lu,Yan Zhao,Chuangao Tang,Yuan Zong,Wenming Zheng
标识
DOI:10.1145/3581783.3612867
摘要
The multimodal emotion recognition has attracted more attention in recent decades. Though remarkable progress has been achieved with the rapid development of deep learning, existing methods are still hard to tackle noise problems that occurred commonly in emotion recognition's practical application. To improve the robustness of the multimodal emotion recognition algorithm, we propose an MLP-based label revision algorithm. The framework consists of three complementary feature extraction networks that were verified in MER2023. After that, an MLP-based attention network with specially designed loss functions was used to fuse features from different modalities. Finally, the scheme that used the output probability of each emotion to revise the sample's output category was employed to revise the test set's label obtained by classifier. The samples that are most likely to be affected by noise and misclassified have a chance to get correct classification. The best experimental result shows that the F1-score of our algorithm on the test dataset of the MER 2023 Noise subchallenge is 86.35 and combined metric is 0.6694, which ranks 2nd at the MER 2023 NOISE subchallenge.
科研通智能强力驱动
Strongly Powered by AbleSci AI