A Survey On Educational Data Mining Techniques
A Survey On Educational Data Mining Techniques
A Survey On Educational Data Mining Techniques
Abstract - Educational data mining (EDM) creates high impact stores E Commerce transaction data for money transfer. The
in the field of academic domain. The methods used in this topic commercial web sites transaction data passes the control to
are playing a major advanced key role in increasing knowledge bank website application and purchases happen. E Learning
among students. EDM explores and gives ideas in technology is used for learning from web sites. The users can
understanding behavioral patterns of students to choose a gather necessary knowledge from web based learning process.
correct path for choosing their carrier. This survey focuses on The web applications used in this type of websites are very
such category and it discusses on various techniques involved useful in providing knowledge for learners. Web based
in making educational data mining for their knowledge applications stores the information through web access log
improvement. Also, it discusses about different types of EDM files, which eventually stores information about users working
tools and techniques in this article. Among the different tools on the websites.
and techniques, best categories are suggested for real world
usage. In educational system the knowledge assessment techniques
applied to improve students’ learning process. The formative
Key words: Educational Data Mining, Web Mining, E- assessment process evaluating continues improvement of
Learning, Data Mining Techniques students learning capacity. The formative system helps the
educator to improve instructional materials. The data mining
I. INTRODUCTION techniques helped the educator to make academic decision
when designing or editing the teaching methodology. The
Collecting relevant student record and analyzing the same from educational data mining follow the common data mining
huge record set always remain difficult task for researchers. methods. Extracted information should enter the circle of the
Data mining process of extracting hidden information from system and guide, fine tuning and refinement of learning [4].
large database provides a meaningful solution for educational This data not only becoming the knowledge, it improves the
data mining. The researcher also faces many problems in mined knowledge for decision making. The rest of the survey
implementing the developed system for educational data paper is planned as follows. Section 2 discusses about the basic
mining in different platform. Huge number of developments in concepts of educational data mining. In section 3, it is
educational courses always remains difficult task for students discussed about tools used for educational data mining. The
in choosing best course. Current web based course applications applications or techniques of educational data mining are
doesn’t provide static learning materials by understanding illustrated in section 4. Finally, section 5 concludes the survey
students mentality. User friendly environment for web based work.
educational system always remain a good solution to richer
learning environment. In traditional education system, students II. EDUCATIONAL DATA MINING
share their learning experiences one to one interaction and
continual evaluation process [1]. Classroom Evaluations Educational data mining play a major role in society and
processes by observing student’s attitude, analyzing record, educational area. The data mining sequence applying in
and student appraisal in teaching strategies. The supervision is educational system can be clearly represented with a diagram
not possible when the students working in IT field; pedagogue shown in Figure1.
chooses for other techniques to get class room data.
Institutions, which run websites for distance learning, collect
huge data, collecting server access log and web server by
automatically. Web based learning analysing tools available
online increses the interaction data between acdemicion and
students [2]. Most effective learninig environment can be
carried by following data mining techniques. The data mining
techniques stages starts from pre processing to post processing
techniques by following KDD process of identifying necessary
educational data. Web based domain area E-commerce uses
data mining techniques in advancing educational mining. E-
Learning process gives optimal solution for improving the
educational data mining process. Some differents in E-learning
and E-Commerce systems are disscussed below [3].
169
Integrated Intelligent Research (IIR) International Journal of Data Mining Techniques and Applications
Volume: 05 Issue: 02 December 2016, Page No.167-171
ISSN: 2278-2419
Table 2: Techniques for Educational Data Mining
Dorina Kabakchieva implemented CRISP (Cross-Industry educational data mining and core paths of EDM. The
Standard Process) approach for Data mining model for non- techniques and tools discussed in this survey will provide a
property, freely available and application neutral standard for clear cut idea to the young educational data mining researchers
data mining projects. Author also discusses about decision tree to carry out their work in this field. Also, this research work
classifier of NaiveBayes and BayesNet with J48 10 fold cross carried out on the areas which make data mining process with
validation and J48 percentage split and identifies weighted educational data mining in a batter way. Finally, it is confirmed
average. Same J48 10 fold cross validation and J48 percentage that most of the classification algorithms perform in a better
split comparison testing was carried for K-NN Classifier (with way of understating the current trends of EDM by the students
k=100 and k=250) and OneR and JRip classifiers [25]. Abeer as well as academicians.
Badr El Din Ahmed uses decision tree method for predicting
students’ performance with the help of ID3 Algorithm [26]. References
Xing Wanli introduces student prediction messures by using
different rules and uses gnetic operator for classification and [1] Sheard. J, Ceddia. J, Hurst. J & Tuovinen. J, “Inferring
evaluate offspring for analysing student participations [27]. student learning behaviour from website interactions: A
Ashwin Satyanarayana uses multiple classifiers such as J48, usage analysis”, Journal of Education and Information
NaiveBayes, and Random Forest for classifying students’ Technologies, 2003, Vol: 8(3), pp. 245-266.
prediction. He also uses K-means clustering algorithm for [2] Sheard. J, Ceddia. J, Hurst. J & Tuovinen. J, “Determining
calculating similar cluster cancroids average in student cluster website usage time from interactions: Data preparation and
[28]. analysis”, Journal of Educational Technology Systems,
2003, Vol: 32(1), pp.101-121.
V. CONCLUSION [3] Muehlenbrock, Martin.,“Automatic action analysis in an
interactive learning environment”, Proceedings of the 12th
Educational data mining is the most valuable research area International Conference on Artificial Intelligence in Edu-
which makes society a better one by giving nice prediction cation. 2005, pp.452-455.
techniques for academician, teachers and students. The papers [4] Anuradha. C and T. Velmurugan, “A Data Mining based
discussed in this survey will give the detailed thought of Survey on Student Performance Evaluation System”, IEEE
170
Integrated Intelligent Research (IIR) International Journal of Data Mining Techniques and Applications
Volume: 05 Issue: 02 December 2016, Page No.167-171
ISSN: 2278-2419
Int. Conference on Computational Intelligence and Com- [18] Romero. C, Alcala-Fdez. J, Sanchez. L, Garcia. S, del Je-
puting Research, 2014, pp. 452-455. sus. M. J, Ventura. S, Garrell. J. M,& Fernandez. J,
[5] Zaiane. O, & Luo. J, “Web usage mining for a better web- “KEEL: a software tool to assess evolutionary algorithms
based learning environment”, In Proceedings of confer- for data mining problems.” Soft Computing, 2009,
ence on advanced technology for education, Banff, Al- Vol:13(3), pp. 307-318.
berta, 2001, pp. 60–64. [19] Marquez-Vera. C, ROMERO. C, "Predicting School Fail-
[6] Silva, D., & Vieira, M. “Using data warehouse and data ure Using Data Mining", Educational Data Min-
mining resources for ongoing assessment in distance learn- ing, 2010,2011.
ing”. In IEEE international conference on advanced learn- [20] Dragan Gasevic, “Let’s not forget: Learning analytics are-
ing technologies, Kazan, Russia, 2002, pp. 40–45. about learning", Association for Educational Communica-
[7] Shen, Ruimin, Fan Yang, and Peng Han. “Data analysis tions and Technology, 2015, Vol: 59(1), pp. 64-71.
center based on e-learning platform”, The Internet Chal- [21] Romero. C, Ventura. S, Espejo. P. G, & Hervas. C, “Data
lenge: Technology and Applications. Springer Nether- mining algorithms to classify students”, Educational Data
lands, 2002, pp.19-28. Mining 2007, 2007, pp:1-10.
[8] Cristobal Romero, Sebastian Ventura, Paul de Bra & Car- [22] Ramaswami. M, and R. Bhaskaran, “A CHAID based per-
los de Castro, “Discovering prediction rules in AHA! formance prediction model in educational data mining”,
Courses”, International Conference on User Modeling, 2010, arXiv preprint arXiv:1002.1144.
Springer Berlin Heidelberg, 2003, pp.25-34. [23] Edin Osmanbegovic & Mirza Suljic , “DATA MINING
[9] Tane, Julien, Christoph Schmitz, and Gerd Stumme, “Se- APPROACH FOR PREDICTING STUDENT PERFOR-
mantic resource management for the web: an e-learning MANCE”, Journal of Economics and Business, 2012, Vol:
application”, Proceedings of the 13th international World 10(1), pp. 3-12.
Wide Web conference on Alternate track papers & posters, [24] Surjeet Kumar Yadav, Saurabh Pal, “Data Mining: A Pre-
ACM, 2004, pp. 1-10. diction for Performance Improvement of Engineering Stu-
[10] Merceron, Agathe, and Kalina Yacef, “Tada-ed for educa- dents using Classification”, World of Computer Science
tional data mining”, Interactive multimedia electronic and Information Technology Journal (WCSIT), 2012, Vol:
journal of computer-enhanced learning, 2005, Vol:7(1), 2(2), pp. 51-56.
pp: 267-287. [25] Dorina Kabakchieva, “Predicting Student Performance by
[11] Vanzin, Mariangela, Karin Becker, and Duncan Dubugras Using Data Mining Methods for Classification”, Cybernet-
Alcoba Ruiz , “Ontology-based filtering mechanisms for ics and Information Technologies, 2013, Vol: 13(1),
web usage patterns retrieval”, International Conference on pp.61-72.
Electronic Commerce and Web Technologies, Springer [26] Abeer Badr El Din Ahmed and Ibrahim Sayed Elaraby,
Berlin Heidelberg, 2005, pp. 267-277. “Data Mining: A prediction for Student's Performance Us-
[12] Avouris, N., Komis, V., Fiotakis, G., Margaritis, M. and ing Classification Method”, World Journal of Computer
Voyiatzaki, E., “Logging of fingertip actions is not enough Application and Technology, 2014, Vol: 2(2), pp. 43-47.
for analysis of learning activities”, In 12th International [27] Xing Wanli , Guo Rui, Petakovic Eva & Goggins Sean,
Conference on Artificial Intelligence in Education, AIED “Participation-based student final performance prediction
05 Workshop1: Usage analysis in learning systems, 2005, model through interpretable Genetic Programming: Inte-
pp.1-8. grating learning analytics, Educational data mining and
[13] Mazza. R , and Milani. C , “Exploring usage analysis in theory”, Computers in Human Behavior, 2015, Vol: 47,
learning systems: Gaining insights from visualizations”, pp. 168–181.
Workshop on usage analysis in learning systems at 12th [28] Ashwin Satyanarayana, Gayathri Ravichandran, "Mining
international conference on artificial intelligence in educa- Student data by Ensemble Classification and Clustering
tion, 2005, pp. 65-72. for Profiling and Prediction of Student Academic Perfor-
[14] Mostow. J, Beck. J, Cen. H, Cuneo. A, Gouvea. E, & mance", 2016 ASEE Mid-Atlantic Section Conference,
Heiner. C, “An educational data mining tool to browse tu- 2016.
tor-student interactions: Time will tell”, Proceedings of the
Workshop on Educational Data Mining, National Confer-
ence on Artificial Intelligence. AAAI Press, 2005, pp. 15-
22.
[15] Damez. M, Marsala. C, Dang. T, & Bouchon-Meunier. B,
“Fuzzy decision tree for user modeling from human–com-
puter interactions”, In International conference on human
system learning: Who is in control?,2005, pp.287–302.
[16] Bari. M, & Benzater. B, “Retrieving data from pdf interac-
tive multimedia productions”. In International conference
on human system learning: Who is in control? ,2005,
pp.321–330.
[17] Qasem A. Al-Radaideh, Emad M. Al-Shawakfa, Mustafa
I. Al-Najjar, “Mining Student Data Using Decision Trees”,
The 2006 International Arab Conference on Information
Technology, Jordan,2006, pp.1-5.
171