计算机科学
人工智能
视觉对象识别的认知神经科学
感知
任务(项目管理)
目标检测
对象(语法)
人机交互
视觉感受
机器学习
模式识别(心理学)
工程类
心理学
系统工程
神经科学
标识
DOI:10.1145/3581783.3613435
摘要
Human beings possess the remarkable ability to recognize unseen concepts by integrating their visual perception of known concepts with some high-level descriptions. However, the best-performing deep learning frameworks today are supervised learners that struggle to recognize concepts without training on their labeled visual samples. Zero-shot learning (ZSL) has recently emerged as a solution that mimics humans and leverages multimodal information to transfer knowledge from seen to unseen concepts. This study aims to emphasize the practicality of ZSL, unlocking its potential across four different applications in computer vision, namely -- object recognition, object detection, action recognition, and human-object interaction detection. Several task-specific challenges are identified and addressed in the presented research hypotheses. Zero-shot frameworks are proposed to attain state-of-the-art performance, elucidating some future research directions as well.
科研通智能强力驱动
Strongly Powered by AbleSci AI