Back to Search Start Over

Predicting students' academic performance by mining the educational data through machine learning-based classification model.

Authors :
Nayak, Padmalaya
Vaheed, Sk.
Gupta, Surbhi
Mohan, Neeraj
Source :
Education & Information Technologies; Nov2023, Vol. 28 Issue 11, p14611-14637, 27p
Publication Year :
2023

Abstract

Students' academic performance prediction is one of the most important applications of Educational Data Mining (EDM) that helps to improve the quality of the education process. The attainment of student outcomes in an Outcome-based Education (OBE) system adds invaluable rewards to facilitate corrective measures to the learning processes. Furthermore, the explosive increase of e-learning platforms generates a large volume of data that demands the extraction of useful information using up-to-date techniques. Keeping this view in mind and to check the impact of various features on student outcomes during online classes, we have analyzed two sets of datasets; the Kalboard 360 dataset (a larger dataset) that contains academic, demographic as well as behavioral features which have been observed and recorded during the classes held and a local Institute dataset that does not acquire behavioral features. To achieve this, we have selected a few machine learning algorithms such as Decision Tree (J48), Naïve Bayes (NB), Random Forest (RF), and Multilayer Perceptron (MLP) to classify the students, along with a few filter-based feature selection methods like Info gain, gain ratio, and correlation features have been applied to select the key attributes. Finally, we have fine-tuned the learning parameters of MLP called "Opt-MLP" to get an optimized output and compared it with other classification models. Our experimental results conclude that Opt-MLP proves its superiority over other classification models by predicting an accuracy of 87.14% without the feature selection (WOFS) and 90.74% accuracy with the feature selection (WFS) method for data set 1 and an accuracy of 79.37% without feature selection and 97.08% with feature selection for dataset 2. But, when the students' behavioral feature is considered along with other features, the RF model provides 100% accuracy justifying that students' behavior during class hours has a great impact on attaining the students' outcomes. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
13602357
Volume :
28
Issue :
11
Database :
Complementary Index
Journal :
Education & Information Technologies
Publication Type :
Academic Journal
Accession number :
173766188
Full Text :
https://doi.org/10.1007/s10639-023-11706-8