计算机科学
分类学(生物学)
数据科学
人气
人工智能
生物识别
介绍(产科)
情报检索
心理学
植物
医学
社会心理学
生物
放射科
作者
William C. Sleeman,Rishabh Kapoor,Preetam Ghosh
摘要
Multimodal classification research has been gaining popularity with new datasets in domains such as satellite imagery, biometrics, and medicine. Prior research has shown the benefits of combining data from multiple sources compared to traditional unimodal data that has led to the development of many novel multimodal architectures. However, the lack of consistent terminologies and architectural descriptions makes it difficult to compare different solutions. We address these challenges by proposing a new taxonomy for describing multimodal classification models based on trends found in recent publications. Examples of how this taxonomy could be applied to existing models are presented as well as a checklist to aid in the clear and complete presentation of future models. Many of the most difficult aspects of unimodal classification have not yet been fully addressed for multimodal datasets, including big data, class imbalance, and instance-level difficulty. We also provide a discussion of these challenges and future directions of research.
科研通智能强力驱动
Strongly Powered by AbleSci AI