计算机科学
背景(考古学)
重新使用
水准点(测量)
词汇
任务(项目管理)
特征(语言学)
人工智能
集合(抽象数据类型)
多项选择
自然语言处理
人机交互
过程(计算)
考试(生物学)
机器学习
法学
政治学
生态学
语言学
生物
阅读(过程)
地理
哲学
程序设计语言
管理
操作系统
大地测量学
经济
古生物学
作者
Semere Kiros Bitew,Amir Hadifar,Lucas Sterckx,Johannes Deleu,Chris Develder,Thomas Demeester
出处
期刊:IEEE Transactions on Learning Technologies
[Institute of Electrical and Electronics Engineers]
日期:2022-12-05
卷期号:17: 375-390
被引量:3
标识
DOI:10.1109/tlt.2022.3226523
摘要
Multiple choice questions (MCQs) are widely used in digital learning systems, as they allow for automating the assessment process. However, due to the increased digital literacy of students and the advent of social media platforms, MCQ tests are widely shared online, and teachers are continuously challenged to create new questions, which is an expensive and time-consuming task. A particularly sensitive aspect of MCQ creation is to devise relevant distractors, i.e., wrong answers that are not easily identifiable as being wrong. This paper studies how a large existing set of manually created answers and distractors for questions over a variety of domains, subjects, and languages can be leveraged to help teachers in creating new MCQs, by the smart reuse of existing distractors. We built several data-driven models based on context-aware question and distractor representations, and compared them with static feature-based models. The proposed models are evaluated with automated metrics and in a realistic user test with teachers. Both automatic and human evaluations indicate that context-aware models consistently outperform a static feature-based approach. For our best-performing context-aware model, on average 3 distractors out of the 10 shown to teachers were rated as high-quality distractors. We create a performance benchmark, and make it public, to enable comparison between different approaches and to introduce a more standardized evaluation of the task. The benchmark contains a test of 298 educational questions covering multiple subjects & languages and a 77k multilingual pool of distractor vocabulary for future research.
科研通智能强力驱动
Strongly Powered by AbleSci AI