答疑
计算机科学
集合(抽象数据类型)
人工智能
理解力
任务(项目管理)
阅读(过程)
鹌鹑
注释
自然语言处理
常识
情报检索
知识抽取
语言学
程序设计语言
经济
管理
哲学
内分泌学
医学
作者
Anna Rogers,Olga Kovaleva,Matthew T. Downey,Anna Rumshisky
出处
期刊:Proceedings of the ... AAAI Conference on Artificial Intelligence
[Association for the Advancement of Artificial Intelligence (AAAI)]
日期:2020-04-03
卷期号:34 (05): 8722-8731
被引量:68
标识
DOI:10.1609/aaai.v34i05.6398
摘要
The recent explosion in question answering research produced a wealth of both factoid reading comprehension (RC) and commonsense reasoning datasets. Combining them presents a different kind of task: deciding not simply whether information is present in the text, but also whether a confident guess could be made for the missing information. We present QuAIL, the first RC dataset to combine text-based, world knowledge and unanswerable questions, and to provide question type annotation that would enable diagnostics of the reasoning strategies by a given QA system. QuAIL contains 15K multi-choice questions for 800 texts in 4 domains. Crucially, it offers both general and text-specific questions, unlikely to be found in pretraining data. We show that QuAIL poses substantial challenges to the current state-of-the-art systems, with a 30% drop in accuracy compared to the most similar existing dataset.
科研通智能强力驱动
Strongly Powered by AbleSci AI