Computer Science ›› 2018, Vol. 45 ›› Issue (6A): 437-441.

• Big Date & Date Mining • Previous Articles     Next Articles

Research on Data Mining Algorithm Based on Examination Process and Knowledge Structure

DAI Ming-zhu,GAO Song-feng   

  1. School of Mechanical-electronic and Vehicle Engineering,Beijing University of Civil Engineering and Architecture,Beijing 100044,China
  • Online:2018-06-20 Published:2018-08-03

Abstract: In order to study the mastery of knowledge points at different stages of student,based on the theory of data mining,knowledge structure was combined with examination results to study data.Based on the theory of educational measurement and the decision tree algorithm of data mining,an improved algorithm was proposed according to the original C4.5 algorithm,applying the difficulty level of the knowledge points involved in the test papers and the knowledge structure to refine the knowledge structure in order to determine the degree of knowledge of individual students or groups of students and the relationship between the knowledge points.The experimental results show that the efficiency of the improved algorithm is improved,whose formula is simple and practical compared with the original formula.According to the decision tree model,the remaining data is used to verify the improved formula,and it is faster to draw the conclusion that the effect of knowledge points on programming is relatively important.Test data is used to verify the decision tree,and the accuracy rate is 90%.Finally,a visual display of the decision tree can give an effective reference for students to learn the arrangements,teachers to develop teaching programs and arrangements.

Key words: C4.5, Data mining, Decision tree, Knowledge structure, Paper analysis

CLC Number: 

  • TP391
[1]白彦辉.关联规则挖掘在试卷分析系统中的应用[J].内蒙古民族大学学报(自然科学版),2012,27(2):159-161.
[2]牛瑞敏.数据挖掘在国内教育领域应用的研究综述[J].中山大学研究生学刊(人文社会科学版),2016,37(2):193-200.
[3]刘志妩.基于决策树算法的学生成绩的预测分析[J].计算机应用与软件,2012,29(11):312-314,330.
[4]王黎黎,刘学军.决策树C4.5算法在成绩分析中的应用[J].河南工程学院学报( 自然科学版),2014,26(4):69-73.
[5]胡庆.基于决策树的试卷知识点掌握程度分析研究[D].南昌:江西财经大学,2014.
[6]段薇,马丽,路向阳.基于信息增益和最小距离分类的决策树改进算法[J].科学技术与工程,2013,13(6):1643-16552.
[7]NOH C H,CHO K C,MA Y B,et al.Grid resource selection system using decision tree method [J].Korea Soc Comput Inf,2009,13(1):1-10.
[8]阮晓宏,黄小猛,袁鼎荣,等.基于异构代价敏感决策树的分类器算法[J].计算机科学,2013,40(11):140-142,146.
[9]毛国君,段立娟.数据挖掘原理与算法[M].北京:清华大学出版社,2016:128-137.
[10]于孝美,陈贞翔,彭立志.基于决策树的网络流量分类方法[J].济南大学学报(自然科学版),2012,26(3):291-295.
[11]王领,胡扬.基于C4.5决策树的股票数据挖掘[J].计算机与现代化,2015(10):20-24.
[12]JIANG W L.Research and Application of Credit Score Based on Decision Tree Model∥Applied Informatics and Communication.Springer Berlin Heidelberg,2011:493-501.
[13]黄秀霞,孙力.C4.5算法的优化[J].计算机工程与设计,2016,67(5):1267-1271.
[14]宋万洋,李国和,洪云峰,等.基于平衡准确率和规模的决策树剪枝算法[J].科学技术与工程,2016,16(16):79-82.
[15]KANTARDZIE M.Data mining:concepts models methods and algorithms[M].John Wiley & Sons,Inc.,2004.
[1] LI Rong-fan, ZHONG Ting, WU Jin, ZHOU Fan, KUANG Ping. Spatio-Temporal Attention-based Kriging for Land Deformation Data Interpolation [J]. Computer Science, 2022, 49(8): 33-39.
[2] YAO Xiao-ming, DING Shi-chang, ZHAO Tao, HUANG Hong, LUO Jar-der, FU Xiao-ming. Big Data-driven Based Socioeconomic Status Analysis:A Survey [J]. Computer Science, 2022, 49(4): 80-87.
[3] KONG Yu-ting, TAN Fu-xiang, ZHAO Xin, ZHANG Zheng-hang, BAI Lu, QIAN Yu-rong. Review of K-means Algorithm Optimization Based on Differential Privacy [J]. Computer Science, 2022, 49(2): 162-173.
[4] REN Shou-peng, LI Jin, WANG Jing-ru, YUE Kun. Ensemble Regression Decision Trees-based lncRNA-disease Association Prediction [J]. Computer Science, 2022, 49(2): 265-271.
[5] LIU Zhen-yu, SONG Xiao-ying. Multivariate Regression Forest for Categorical Attribute Data [J]. Computer Science, 2022, 49(1): 108-114.
[6] ZHANG Ya-di, SUN Yue, LIU Feng, ZHU Er-zhou. Study on Density Parameter and Center-Replacement Combined K-means and New Clustering Validity Index [J]. Computer Science, 2022, 49(1): 121-132.
[7] MA Dong, LI Xin-yuan, CHEN Hong-mei, XIAO Qing. Mining Spatial co-location Patterns with Star High Influence [J]. Computer Science, 2022, 49(1): 166-174.
[8] CAO Yang-chen, ZHU Guo-sheng, QI Xiao-yun, ZOU Jie. Research on Intrusion Detection Classification Based on Random Forest [J]. Computer Science, 2021, 48(6A): 459-463.
[9] TANG Liang, LI Fei. Research on Forecasting Model of Internet of Vehicles Security Situation Based on Decision Tree [J]. Computer Science, 2021, 48(6A): 514-517.
[10] XU Hui-hui, YAN Hua. Relative Risk Degree Based Risk Factor Analysis Algorithm for Congenital Heart Disease in Children [J]. Computer Science, 2021, 48(6): 210-214.
[11] DING Si-fan, WANG Feng, WEI Wei. Relief Feature Selection Algorithm Based on Label Correlation [J]. Computer Science, 2021, 48(4): 91-96.
[12] ZHANG Yan-jin, BAI Liang. Fast Symbolic Data Clustering Algorithm Based on Symbolic Relation Graph [J]. Computer Science, 2021, 48(4): 111-116.
[13] ZHANG Han-shuo, YANG Dong-ju. Technology Data Analysis Algorithm Based on Relational Graph [J]. Computer Science, 2021, 48(3): 174-179.
[14] ZOU Cheng-ming, CHEN De. Unsupervised Anomaly Detection Method for High-dimensional Big Data Analysis [J]. Computer Science, 2021, 48(2): 121-127.
[15] LIU Xin-bin, WANG Li-zhen, ZHOU Li-hua. MLCPM-UC:A Multi-level Co-location Pattern Mining Algorithm Based on Uniform Coefficient of Pattern Instance Distribution [J]. Computer Science, 2021, 48(11): 208-218.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!