Learning to Reuse Distractors to Support Multiple-Choice Question Generation in Education

计算机科学背景（考古学）重新使用水准点（测量）词汇任务（项目管理）特征（语言学）人工智能集合（抽象数据类型）多项选择自然语言处理人机交互过程（计算）考试（生物学）机器学习法学政治学生态学语言学生物阅读（过程）地理哲学程序设计语言管理操作系统大地测量学经济古生物学

作者

Semere Kiros Bitew,Amir Hadifar,Lucas Sterckx,Johannes Deleu,Chris Develder,Thomas Demeester

出处

期刊：IEEE Transactions on Learning Technologies [Institute of Electrical and Electronics Engineers]
日期：2022-12-05 卷期号：17: 375-390 被引量：3

链接

arxiv.org arxiv.org handle.net ugent.bedoi.org

标识

DOI：10.1109/tlt.2022.3226523

摘要

Multiple choice questions (MCQs) are widely used in digital learning systems, as they allow for automating the assessment process. However, due to the increased digital literacy of students and the advent of social media platforms, MCQ tests are widely shared online, and teachers are continuously challenged to create new questions, which is an expensive and time-consuming task. A particularly sensitive aspect of MCQ creation is to devise relevant distractors, i.e., wrong answers that are not easily identifiable as being wrong. This paper studies how a large existing set of manually created answers and distractors for questions over a variety of domains, subjects, and languages can be leveraged to help teachers in creating new MCQs, by the smart reuse of existing distractors. We built several data-driven models based on context-aware question and distractor representations, and compared them with static feature-based models. The proposed models are evaluated with automated metrics and in a realistic user test with teachers. Both automatic and human evaluations indicate that context-aware models consistently outperform a static feature-based approach. For our best-performing context-aware model, on average 3 distractors out of the 10 shown to teachers were rated as high-quality distractors. We create a performance benchmark, and make it public, to enable comparison between different approaches and to introduce a more standardized evaluation of the task. The benchmark contains a test of 298 educational questions covering multiple subjects & languages and a 77k multilingual pool of distractor vocabulary for future research.

求助该文献

最长约 10秒，即可获得该文献文件

Learning to Reuse Distractors to Support Multiple-Choice Question Generation in Education

今日热心研友