Abstract

The increasing number of students dropping out is a major concern of higher educational institutions as it gives a great impact not only cost to the students but also a waste of public funds. Thus, it is imperative to understand which students are at risk of dropping out and what are the factors that contribute to higher dropout rates. This can be done using educational data mining. In this paper, we described the uses of data mining techniques to predict student dropout of Computer Science undergraduate students after 3 years of enrolment in Universiti Teknologi MARA. The experimental results showed an achievable reliable classification accuracy from the selected algorithm in predicting dropouts. Decision tree, logistic regression, random forest, K-nearest neighbour and neural network algorithm were compared to propose the best model. The results showed that some of the machines learning algorithms are able to establish effective predictive models from student retention data. The Logistic Regression model was found to be the best learners to predict the dropout students with identified potential subject causes. In addition, we also presented some findings related to data exploration.

Details

Title
Predicting Student Drop-Out in Higher Institution Using Data Mining Techniques
Author
Wan Yaacob, W F 1 ; N Mohd Sobri 2 ; Md Nasir, S A 2 ; Norshahidi, N D 2 ; Wan Husin, W Z 2 

 Faculty of Computer and Mathematical Sciences, Universiti Teknologi MARA Cawangan Kelantan, Bukit Ilmu, 18500 Machang, Kelantan, Malaysia.; Business Datalytics Research Group, Universiti Teknologi MARA Cawangan Kelantan, Kampus Kota Bharu, Lembah Sireh, 15050 Kota Bharu, Kelantan, Malaysia 
 Faculty of Computer and Mathematical Sciences, Universiti Teknologi MARA Cawangan Kelantan, Bukit Ilmu, 18500 Machang, Kelantan, Malaysia 
Publication year
2020
Publication date
Mar 2020
Publisher
IOP Publishing
ISSN
17426588
e-ISSN
17426596
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2569765810
Copyright
© 2020. This work is published under http://creativecommons.org/licenses/by/3.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.