要价
答疑
任务(项目管理)
计算机科学
芯(光纤)
考试(生物学)
对比度(视觉)
特征(语言学)
数据科学
人工智能
工程类
电信
生物
语言学
经济
哲学
古生物学
经济
系统工程
作者
Zhibin Liao,Anton van den Hengel,Johan Verjans
出处
期刊:Elsevier eBooks
[Elsevier]
日期:2024-01-01
卷期号:: 157-162
标识
DOI:10.1016/b978-0-323-90534-3.00002-0
摘要
Visual question answering (VQA) enables computers to answer new, open-ended questions in real-time given input imaging. It represents a particularly challenging task within machine learning (ML), not least because it requires answering previously unseen questions about previously unseen images. In this sense it addresses one of the core difficulties in designing ML systems to assist medical professionals, which is that it is impossible to predict the problems they will need to solve tomorrow. Traditional ML assumes that the problem to be solved is indeed predictable years in advance, to allow for the collection and labeling of the appropriate training data, and the training of a suitable model. VQA, in contrast, allows the question to be specified live, at test time. It might thus be used to enable a clinician to ask questions about the literature based on the specifics of the patient in front of them. It might enable a radiologist to ask whether a specified feature of a recent scan is comparable to anything in the database for patients of a particular demographic, or a researcher to use the latest public health records to seek immediate insight into an emerging global health challenge. In each case there is an opportunity to help a medical professional in real time to achieve a better outcome using ML to answer a question that wasn't foreseeable.
科研通智能强力驱动
Strongly Powered by AbleSci AI