TY - GEN
T1 - Enhancing Student Performance Prediction Through Ensembles of Machine Learning Models and Explainable Artificial Intelligence
AU - Ray, Santosh
AU - Nawaz, Ali
AU - Ahmad, Amir
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - Predicting student performance accurately is essential for personalizing education, allocating resources, and helping at-risk students. This study explores application of Machine Learning (ML) models and Explainable Artificial Intelligence (XAI) methods techniques such as Local Interpretable Modelagnostic Explanations (LIME) and SHapley Additive exPlanations (SHAP), to enhance the interpretability of predictions in educational settings. By utilizing student demographic and behavior data, we built ML models, specifically BaggingClassifier ensemble models by achieving an Accuracy of 0.95, ROC AUC of 0.97, and F1 Score of 0.95, to predict student performance and also compare their performance with other single models. Additionally, LIME and SHAP were applied on the best performing model to determine the most influencer features on the prediction. Specifically, our analysis revealed that StudentAbsenceDays and VisITedResources are most influencial features. The applied bagging ensemble method not only improved the accuracy of the predictions but also highlight the features contributed to the prediction, which can ultimately aid academicians to make better decisions for the students.
AB - Predicting student performance accurately is essential for personalizing education, allocating resources, and helping at-risk students. This study explores application of Machine Learning (ML) models and Explainable Artificial Intelligence (XAI) methods techniques such as Local Interpretable Modelagnostic Explanations (LIME) and SHapley Additive exPlanations (SHAP), to enhance the interpretability of predictions in educational settings. By utilizing student demographic and behavior data, we built ML models, specifically BaggingClassifier ensemble models by achieving an Accuracy of 0.95, ROC AUC of 0.97, and F1 Score of 0.95, to predict student performance and also compare their performance with other single models. Additionally, LIME and SHAP were applied on the best performing model to determine the most influencer features on the prediction. Specifically, our analysis revealed that StudentAbsenceDays and VisITedResources are most influencial features. The applied bagging ensemble method not only improved the accuracy of the predictions but also highlight the features contributed to the prediction, which can ultimately aid academicians to make better decisions for the students.
KW - E-Learning
KW - Ensemble learning
KW - Explainable Artificial
KW - Intelligence
KW - Machine learning
KW - Student performance prediction
UR - https://www.scopus.com/pages/publications/105010681429
UR - https://www.scopus.com/pages/publications/105010681429#tab=citedBy
U2 - 10.1109/ICIET66371.2025.11046309
DO - 10.1109/ICIET66371.2025.11046309
M3 - Conference contribution
AN - SCOPUS:105010681429
T3 - 2025 13th International Conference on Information and Education Technology, ICIET 2025
SP - 160
EP - 164
BT - 2025 13th International Conference on Information and Education Technology, ICIET 2025
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 13th International Conference on Information and Education Technology, ICIET 2025
Y2 - 18 April 2025 through 20 April 2025
ER -