计算机科学
图像融合
人工智能
传感器融合
融合
上下文图像分类
图像(数学)
模式识别(心理学)
计算机视觉
哲学
语言学
作者
Toufik Datsi,Khalid Aznag,Brahim Ait Ben Ali,Karim Karbout,Ahmed El Oirrak,El Kharrachi Khayya
标识
DOI:10.1109/gast60528.2024.10520797
摘要
Advancements in multimodal learning have experienced rapid growth over the past decade, particularly within various domains, with a significant emphasis on developments in computer vision. Multimodal data fusion has become increasingly prominent in the realm of image classification, where the integration of diverse data sources enhances the overall understanding and performance of classification models. This survey delves into the recent strides made in multimodal learning over the past decade, particularly within the field of image classification. Additionally, the paper undertakes a comparative study, critically evaluating the effectiveness and performance of different multimodal fusion approaches. The aim is to provide a comprehensive overview of the current state-of-the-art in multimodal data fusion for image classification and to identify key trends, challenges, and opportunities in this evolving field.
科研通智能强力驱动
Strongly Powered by AbleSci AI