Abstract

Educational Data Mining (EDM) is a branch of data mining that focuses on extraction of useful knowledge from data generated through academic activities at school, college or at university level. The extracted knowledge can help to perform the academic activities in a better way, so it is useful for students, parents and institutions themselves. One common activity in EDM is students grade prediction with an aim to identify weak or at-risk students. An early identification of such students helps to take supportive measures that may help students to improve. Among a vast number of approaches available in this field, this study mainly focuses on generating a smarter dataset through reduced feature set without compromising the number of records in it and then producing an approach which combines the strengths of classification and clustering for better prediction results. In this study it has been identified that individual features have distinct effect and that removing misclassified data can affect the overall results. Backward selection is adopted using Pearson correlation as a metric to produce smarter dataset with lesser attributes and better accuracy in prediction. After feature set selection, we have applied EMT (Ensemble Meta-Based Tree Model) classification on it to identify best performing classifiers from five families of classifiers. In hybrid approach, first the ensemble clustering is applied on smart dataset and then EMT classification is applied to reevaluate the un-clustered data, which gives a boost in performance and provides us an accuracy of 93%.

Details

Title
A Novel Hybrid Ensemble Clustering Technique for Student Performance Prediction
Author
Sanam Fida  VIAFID ORCID Logo  ; Nayyer Masood  VIAFID ORCID Logo  ; Tariq, Nirmal  VIAFID ORCID Logo  ; Qayyum, Faiza  VIAFID ORCID Logo 
Pages
777-798
Section
Research Article
Publication year
2022
Publication date
2022
Publisher
Pensoft Publishers
ISSN
0948695X
e-ISSN
09486968
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2830893127
Copyright
© 2022. This work is licensed under https://creativecommons.org/licenses/by-nd/4.0/ (the “License”). Notwithstanding the ProQuest Terms and conditions, you may use this content in accordance with the terms of the License.