Improving Automated Evaluation of Student Text Responses Using GPT-3.5 for Text Data Augmentation

计算机科学人工智能分类标记数据集合（抽象数据类型）训练集稀缺机器学习程序设计语言经济微观经济学

作者

Keith Cochran,Clayton Cohn,Jean-François Rouet,Peter Hastings

出处

期刊：Lecture Notes in Computer Science 日期：2023-01-01 卷期号：: 217-228

链接

hal.science hal.sciencedoi.org

标识

DOI：10.1007/978-3-031-36272-9_18

摘要

In education, intelligent learning environments allow students to choose how to tackle open-ended tasks while monitoring performance and behavior, allowing for the creation of adaptive support to help students overcome challenges. Timely feedback is critical to aid students’ progression toward learning and improved problem-solving. Feedback on text-based student responses can be delayed when teachers are overloaded with work. Automated evaluation can provide quick student feedback while easing the manual evaluation burden for teachers in areas with a high teacher-to-student ratio. Current methods of evaluating student essay responses to questions have included transformer-based natural language processing models with varying degrees of success. One main challenge in training these models is the scarcity of data for student-generated data. Larger volumes of training data are needed to create models that perform at a sufficient level of accuracy. Some studies have vast data, but large quantities are difficult to obtain when educational studies involve student-generated text. To overcome this data scarcity issue, text augmentation techniques have been employed to balance and expand the data set so that models can be trained with higher accuracy, leading to more reliable evaluation and categorization of student answers to aid teachers in the student’s learning progression. This paper examines the text-generating AI model, GPT-3.5, to determine if prompt-based text-generation methods are viable for generating additional text to supplement small sets of student responses for machine learning model training. We augmented student responses across two domains using GPT-3.5 completions and used that data to train a multilingual BERT model. Our results show that text generation can improve model performance on small data sets over simple self-augmentation.

求助该文献

最长约 10秒，即可获得该文献文件

Improving Automated Evaluation of Student Text Responses Using GPT-3.5 for Text Data Augmentation

今日热心研友