ICCK Transactions on Educational Data Mining | Volume 1, Issue 1: 36-43, 2025 | DOI: 10.62762/TEDM.2025.732573
Abstract
Predicting student dropout and academic success is important for higher education institutions for enhancing retention and deliver timely interventions. However, educational datasets often exhibit severe class imbalance, particularly when multiple academic outcomes (i.e., dropout, enrolled, and graduate) are considered simultaneously. Thus this study examines the effectiveness of three widely used over-sampling techniques (i.e., RandomOverSampler, synthetic minority oversampling technique, and adaptive synthetic sampling) for mitigating class imbalance and enhancing prediction performance. These sampling strategies are evaluated in combination with several machine learning classifiers to ass... More >