Research on learning behavior patterns from the perspective of educational data mining: Evaluation, prediction and visualization

计算机科学 数据挖掘 教育数据挖掘 朴素贝叶斯分类器 主成分分析 聚类分析 C4.5算法 机器学习 随机森林 分类器(UML) 人工智能 统计的 透视图(图形) 数据集 支持向量机 数学 统计
作者
Guiyun Feng,Muwei Fan
出处
期刊:Expert Systems With Applications [Elsevier BV]
卷期号:237: 121555-121555 被引量:10
标识
DOI:10.1016/j.eswa.2023.121555
摘要

The rapid growth of educational data creates the requirement to mine useful information from learning behavior patterns. The development of data mining technology makes educational data mining possible. The paper intends to use a public educational data set to study learning behavior patterns from the perspective of educational data mining, so as to promote the innovation of educational management. Firstly, in order to reduce the dimension of data analysis that facilitates the improvement in efficiency, principal component analysis is carried out to reduce the number of attributes in the data set. The significant attributes in the rotating principal component matrix rather than principal components which are not closely related to learning behavior patterns are extracted as the research variables. Then, a pseudo statistic is proposed to determine the number of clusters and the preprocessed data set is clustered according to the extracted attributes. The clustering results are applied to add class labels to the data, which is convenient for the later data training. Finally, six classification algorithms J48, K-Nearest Neighbor, Bayes Net, Random Forest, Support Vector Machine and Logit Boost are used to train the data with labels and build prediction models. At the same time, the performance and applicable conditions of six classifiers in terms of accuracy, efficiency, error, and so on are discussed and compared. It is found that the performance of the integrated algorithm is better than that of a single classifier. In the integrated algorithm, compared with Random Forest, the running time of Logit Boost is shorter.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
可可西里完成签到 ,获得积分10
2秒前
4秒前
4秒前
只争朝夕应助科研通管家采纳,获得10
5秒前
只争朝夕应助科研通管家采纳,获得10
5秒前
开胃咖喱完成签到,获得积分10
10秒前
科研你好科研再见完成签到,获得积分10
11秒前
Xx完成签到 ,获得积分10
11秒前
脑洞疼应助arniu2008采纳,获得30
12秒前
风之旅完成签到,获得积分10
14秒前
陌殇完成签到,获得积分10
14秒前
15秒前
星辉的斑斓完成签到 ,获得积分10
16秒前
皇帝的床帘完成签到,获得积分10
17秒前
17秒前
yiding完成签到 ,获得积分10
18秒前
一个美女完成签到,获得积分10
20秒前
毛毛余发布了新的文献求助10
22秒前
无极微光应助Bananana采纳,获得20
23秒前
23秒前
喜喜不嘻嘻发布了新的文献求助100
23秒前
过时的冬易完成签到,获得积分10
23秒前
Eric完成签到,获得积分10
25秒前
南猫喵完成签到,获得积分10
27秒前
牧青发布了新的文献求助10
27秒前
27秒前
28秒前
爱吃草莓的玉米完成签到 ,获得积分10
29秒前
小马甲应助arniu2008采纳,获得30
30秒前
45275357完成签到 ,获得积分10
32秒前
Buduan发布了新的文献求助10
33秒前
36秒前
37秒前
善良的碧灵完成签到,获得积分10
38秒前
123456完成签到,获得积分10
38秒前
shiyi0709完成签到,获得积分10
39秒前
Arya发布了新的文献求助10
40秒前
李嘉图完成签到,获得积分10
40秒前
willa发布了新的文献求助10
40秒前
42秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
PowerCascade: A Synthetic Dataset for Cascading Failure Analysis in Power Systems 2000
Picture this! Including first nations fiction picture books in school library collections 1000
Signals, Systems, and Signal Processing 610
Unlocking Chemical Thinking: Reimagining Chemistry Teaching and Learning 555
Photodetectors: From Ultraviolet to Infrared 500
Cancer Targets: Novel Therapies and Emerging Research Directions (Part 1) 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6359032
求助须知:如何正确求助?哪些是违规求助? 8173021
关于积分的说明 17212158
捐赠科研通 5414033
什么是DOI,文献DOI怎么找? 2865350
邀请新用户注册赠送积分活动 1842737
关于科研通互助平台的介绍 1690871