计算机科学
考试(生物学)
自然语言处理
语言能力
语言评估
人工智能
机器学习
数学教育
心理学
生物
古生物学
作者
Burr Settles,Geoffrey T. LaFlair,Masato Hagiwara
摘要
We describe a method for rapidly creating language proficiency assessments, and provide experimental evidence that such tests can be valid, reliable, and secure. Our approach is the first to use machine learning and natural language processing to induce proficiency scales based on a given standard, and then use linguistic models to estimate item difficulty directly for computer-adaptive testing. This alleviates the need for expensive pilot testing with human subjects. We used these methods to develop an online proficiency exam called the Duolingo English Test, and demonstrate that its scores align significantly with other high-stakes English assessments. Furthermore, our approach produces test scores that are highly reliable, while generating item banks large enough to satisfy security requirements.
科研通智能强力驱动
Strongly Powered by AbleSci AI