计算机科学
任务(项目管理)
粒度
多项式分布
人工智能
机器学习
度量(数据仓库)
自然语言处理
数据挖掘
程序设计语言
统计
数学
经济
管理
作者
Jia Tracy Shen,Michiharu Yamashita,Ethan Prihar,Neil T. Heffernan,Xintao Wu,Sean McGrew,Dongwon Lee
标识
DOI:10.1007/978-3-030-78292-4_33
摘要
Educational content labeled with proper knowledge components (KCs) are particularly useful to teachers or content organizers. However, manually labeling educational content is labor intensive and error-prone. To address this challenge, prior research proposed machine learning based solutions to auto-label educational content with limited success. In this work, we significantly improve prior research by (1) expanding the input types to include KC descriptions, instructional video titles, and problem descriptions (i.e., three types of prediction task), (2) doubling the granularity of the prediction from 198 to 385 KC labels (i.e., more practical setting but much harder multinomial classification problem), (3) improving the prediction accuracies by 0.5–2.3% using Task-adaptive Pre-trained BERT, outperforming six baselines, and (4) proposing a simple evaluation measure by which we can recover 56–73% of mispredicted KC labels. All codes and data sets in the experiments are available at: https://github.com/tbs17/TAPT-BERT
科研通智能强力驱动
Strongly Powered by AbleSci AI