计算机科学
数据科学
知识图
万维网
可用性
构造(python库)
透视图(图形)
人机交互
情报检索
人工智能
程序设计语言
作者
Jifan Yu,Yuquan Wang,Qingyang Zhong,Gan Luo,Y. Mao,Kai Sun,Wenzheng Feng,Wei Xu,Shulin Cao,Kai Zeng,Zijun Yao,Lei Hou,Yankai Lin,Peng Li,Jie Zhou,Bin Xu,Juanzi Li,Jie Tang,Maosong Sun
出处
期刊:Conference on Information and Knowledge Management
日期:2021-10-26
被引量:10
标识
DOI:10.1145/3459637.3482010
摘要
The prosperity of massive open online courses provides fodder for plentiful research efforts on adaptive learning. However, current open-access educational datasets are still far from sufficient to meet the need for various topics of adaptive learning. Existing released datasets often cover only small-scale data, lack fine-grained knowledge concepts. They are even difficult to curate and supplement due to platform limitations. In this work, we construct MOOCCubeX, a large, knowledge-centered repository consisting of 4,216 courses, 230,263 videos, 358,265 exercises, 637,572 fine-grained concepts and over 296 million behavioral data of 3,330,294 students, for supporting the research topics on adaptive learning in MOOCs. Licensed by XuetangX, one of the largest MOOC websites in China, we obtain abundant and diverse course resources and student behavioral data and are permitted to make subsequent periodic updates. We propose a framework to accomplish data processing, weakly supervised fine-grained concept graph mining, and data curation to improve usability and richness. Based on the fine-grained concepts, we re-organize the data from the knowledge perspective and acquire more external learning resources from the web. Our repository is now available at https://github.com/THU-KEG/MOOCCubeX.
科研通智能强力驱动
Strongly Powered by AbleSci AI