Validity Arguments for Automated Essay Scoring of Young Students’ Writing Traits

写作评估一致性（知识库）背景（考古学）词汇心理学推论论证（复杂分析）形成性评价特质人工智能自然语言处理计算机科学数学教育语言学化学程序设计语言古生物学哲学生物生物化学

作者

L. Hannah,Eunice Eunhee Jang,Maitree Shah,Vaibhav Gupta

出处

期刊：Language Assessment Quarterly [Informa]
日期：2023-10-20 卷期号：20 (4-5): 399-420 被引量：3

标识

DOI：10.1080/15434303.2023.2288253

摘要

ABSTRACTMachines have a long-demonstrated ability to find statistical relationships between qualities of texts and surface-level linguistic indicators of writing. More recently, unlocked by artificial intelligence, the potential of using machines to identify content-related writing trait criteria has been uncovered. This development is significant, especially in formative assessment contexts where feedback is key. Yet the extent to which writing traits can be validly scored by machines remains under-researched, especially in the K-12 context. The present study investigated the validity of machine learning (ML) models designed for students in grades 3–6 to score three writing traits: task fulfillment, organization and coherence, and vocabulary and expression. The study utilized an argument-based approach, focusing on two primary inferences: evaluation and explanation. The evaluation inference investigated human-machine score alignment, the ability for the models to detect off-topic and gibberish responses, and the consistency of human-machine score alignment across grades and language backgrounds. The explanation inference investigated the relevance of features used in the models. Results indicated that human-machine score alignment was sufficient for all writing traits; however, validity concerns were raised regarding the models' performances detecting off-topic and gibberish responses and the consistency across sub-groups. Implications for language assessment professionals and other educators were discussed. Disclosure statementNo potential conflict of interest was reported by the author(s).EthicsThis research was approved by the social sciences, humanities, and education research ethics board of the University of Toronto, reference number 34,203.Additional informationFundingThe work was supported by the Social Sciences and Humanities Research Council of Canada [486987].

求助该文献

最长约 10秒，即可获得该文献文件

Validity Arguments for Automated Essay Scoring of Young Students’ Writing Traits

今日热心研友