计算机科学
卷积神经网络
建筑
人工智能
变压器
机器学习
数据科学
物理
量子力学
艺术
电压
视觉艺术
摘要
According to a publication by the World Health Organization, depression is projected to emerge as the leading mental health issue. In the domain of affective computing, deep learning techniques are frequently employed to represent facial dynamics using both local and global perspectives for the purpose of automatic depression detection (ADD). Yet, current models overlook the crucial interplay between local and global dynamics in discerning the significant features essential for ADD. Addressing this oversight, a novel hybrid computational architecture, named LOGLFormer, has been introduced. This architecture integrates CNN-derived local attributes and transformer-sourced global patterns tailored for ADD. Within LOGLFormer, the design philosophies of ResNet and ViT inspire the CNN and transformer branches, respectively. The synergy of these branches encompasses local convolutional mechanisms, self-attention strategies, and multilayer perceptron entities. Furthermore, the intricacies arising from disparities in CNN and transformer feature sets are reconciled through the specially devised feature alignment module. Rigorous comparative analysis underscores the distinctive efficacy of the LOGLFormer in recognizing depression, notably outperforming several state-of-the-art techniques on two dedicated depression databases: AVEC2013 and AVEC2014. Code will be available at https://github.com/helang818/LOGLFormer.
科研通智能强力驱动
Strongly Powered by AbleSci AI