Abstract
Student dropout prediction is a critical challenge in higher education that requires accurate identification of at-risk students to enable timely interventions. This study presents EASE-Predict (Ensemble-SHAP Explainable Student Prediction), a comprehensive ensemble learning framework with SHAP-based explainable AI to predict student academic outcomes. We evaluated five machine learning algorithms (Random Forest, Gradient Boosting, Extra Trees, Logistic Regression, and SVM) and developed voting and stacking ensemble models on a dataset of 4,424 students with 36 features encompassing academic performance, socioeconomic factors, and demographic information.EASE-Predict
achieved superior performance with 77.4% accuracy, representing a statistically significant improvement of 4.3 percentage points over the best individual model (Random Forest: 77.3%). The framework demonstrated exceptional class-specific discriminative performance with AUC scores of 0.930 for Graduate prediction (vs. 0.927 for best individual model), 0.821 for Enrolled students (vs. 0.794 for SVM), and 0.913 for Dropout identification (vs. 0.904 for individual models). Cross-validation results showed superior stability with the lowest performance variance (σ = 0.014 vs. σ = 0.0189 for Random Forest). SHAP explainability analysis quantified feature importance, revealing that second semester curricular units completion accounts for 60% of prediction influence, followed by tuition payment status (35%) and scholarship availability (12%).McNemar’s statistical tests confirmed that EASE-Predict’s performance improvements are statistically significant (p < 0.05) across all evaluation metrics.The framework maintains interpretability while achieving state-of-the-art accuracy, providing educational institutions with actionable insights for implementing evidence-based intervention strategies.
Keywords
student dropout prediction
ensemble learning
explainable AI
SHAP analysis
educational data mining
machine learning
Data Availability Statement
Data will be made available on request.
Funding
This work was supported without any funding.
Conflicts of Interest
The authors declare no conflicts of interest.
Ethical Approval and Consent to Participate
Not applicable.
Cite This Article
APA Style
Liu, Z., Zhou, X., & Liu, Y. (2025). Student Dropout Prediction Using Ensemble Learning with SHAP-Based Explainable AI Analysis. Journal of Social Systems and Policy Analysis, 2(3), 111–132. https://doi.org/10.62762/JSSPA.2025.321501
Publisher's Note
ICCK stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and Permissions
Institute of Central Computation and Knowledge (ICCK) or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.