计算机科学
模式
深度学习
多模式学习
领域(数学)
机器学习
人工智能
数据科学
数学
社会科学
社会学
纯数学
作者
Dhanesh Ramachandram,Graham W. Taylor
出处
期刊:IEEE Signal Processing Magazine
[Institute of Electrical and Electronics Engineers]
日期:2017-11-01
卷期号:34 (6): 96-108
被引量:725
标识
DOI:10.1109/msp.2017.2738401
摘要
The success of deep learning has been a catalyst to solving increasingly complex machine-learning problems, which often involve multiple data modalities. We review recent advances in deep multimodal learning and highlight the state-of the art, as well as gaps and challenges in this active research field. We first classify deep multimodal learning architectures and then discuss methods to fuse learned multimodal representations in deep-learning architectures. We highlight two areas of research-regularization strategies and methods that learn or optimize multimodal fusion structures-as exciting areas for future work.
科研通智能强力驱动
Strongly Powered by AbleSci AI