From Prediction to Targeting: Comparative ML Models and Threshold-Based Re-Enrollment Segmentation in Higher Education Marketing
DOI:
10.33395/sinkron.v10i1.15703Keywords:
Student Re-enrollment, Machine Learning, Logistic Regression, Random Forest, XGBoostAbstract
Student re-enrollment is a critical strategic concern for higher education institutions, directly impacting financial stability, capacity planning, and the effectiveness of retention programs. This study develops a decision-support approach to predict non–re-enrollment using institutional records from a private university in Central Java. The dataset includes 2,673 student records across three academic years, which were expanded into 1,099 engineered features through a unified preprocessing pipeline involving missing-value imputation, scaling, and one-hot encoding. Model development utilized a fixed train–test split with 5-fold cross-validation. Results demonstrate that tuned tree ensembles significantly outperform the linear baseline. While Logistic Regression yielded limited discrimination (test ROC-AUC = 0.5602, F1 = 0.7010), tuned Random Forest improved classification quality (test ROC-AUC = 0.7571, F1 = 0.8052). Tuned XGBoost achieved the strongest ranking performance (test ROC-AUC = 0.7606) and was selected for deployment due to its superior risk-ordering capability. SHAP-based interpretation identifies parental income as the dominant driver of non–re-enrollment risk, followed by program-choice indicators and demographic variables. Finally, threshold analysis supports risk-tier segmentation, translating predicted probabilities into practical outreach policies aligned with institutional capacity constraints—addressing two underexplored gaps in applied re-enrollment prediction: rigorous cross-validated ensemble modeling and the integration of predictive scores into actionable marketing segmentation, and highlighting that ranking quality—not classification alone—is essential for operational targeting.
Downloads
References
Alhazmi, E., & Sheneamer, A. (2023). Early Predicting of Students Performance in Higher Education. Ieee Access. https://doi.org/10.1109/access.2023.3250702
Alwarthan, S., Aslam, N., & Khan, I. U. (2022). An Explainable Model for Identifying at-Risk Student at Higher Education. Ieee Access. https://doi.org/10.1109/access.2022.3211070
Amare, M. Y., & Šimonová, S. (2021). Global Challenges of Students Dropout: A Prediction Model Development Using Machine Learning Algorithms on Higher Education Datasets. SHS Web of Conferences, 129, 09001. https://doi.org/10.1051/shsconf/202112909001
Batool, S., & Liu, Z. (2021). Exploring the Relationships Between Socio-Economic Indicators and Student Enrollment in Higher Education Institutions of Pakistan. Plos One, 16(12), e0261577. https://doi.org/10.1371/journal.pone.0261577
Belostecinic, G. (2023). Online Educational Marketing as a Means to Increase the Attractiveness of the University and Its Image. Economica, 3(125), 7–27. https://doi.org/10.53486/econ.2023.125.007
Berges, A., Bravo Ramirez, P. H., Pau, I., Tejero, A., & García-Crespo, Á. (2021). A Framework for Strategic Intelligence Systems Applied to Education Management: A Pilot Study in the Community of Madrid. Ieee Access. https://doi.org/10.1109/access.2021.3081734
Bieganek, C., Aliferis, C. F., & Ma, S. (2022). Prediction of Clinical Trial Enrollment Rates. Plos One, 17(2), e0263193. https://doi.org/10.1371/journal.pone.0263193
Boumi, S., & Vela, A. (2021). Quantifying the Impact of Student Enrollment Patterns on Academic Success Using a Hidden Markov Model. Applied Sciences, 11(14), 6453. https://doi.org/10.3390/app11146453
Bowen, J. C. (2024). Market Orientation and Performance: A Comparative Study of Private and Public Universities in Kenya. Journal of Business and Entrepreneurship, 6(1), 1–12. https://doi.org/10.51317/ecjbms.v6i1.466
Budnikevych, I. (2023). Communication Component in the Formation of the Image of Higher Education Institutions Based on a Marketing Approach. Proceedings of Scientific Works of Cherkasy State Technological University Series Economic Sciences, 24(4), 5–16. https://doi.org/10.62660/ebcstu/4.2023.05
Charte, F. (2020). A Comprehensive and Didactic Review on Multilabel Learning Software Tools. Ieee Access. https://doi.org/10.1109/access.2020.2979787
Cheglakova, L., Devetyarova, I. P., Agalakova, O. S., & Kolesova, Y. A. (2020). Marketing Strategy of Quality Management During Reorganization of Regional Universities in the Process of Modernization of Education in the Conditions of Region’s Transition to Industry 4.0. International Journal for Quality Research, 14(1), 33–50. https://doi.org/10.24874/ijqr14.01-03
Chen, J. M. (2017). Three Levels of Push-Pull Dynamics Among Chinese International Students’ Decision to Study Abroad in the Canadian Context. Journal of International Students, 7(1), 113–135. https://doi.org/10.32674/jis.v7i1.248
Chen, Z., Cen, G., Wei, Y., & Li, Z. (2023). Student Performance Prediction Approach Based on Educational Data Mining. Ieee Access. https://doi.org/10.1109/access.2023.3335985
Couronné, R., Probst, P., & Boulesteix, A. (2018). Random Forest Versus Logistic Regression: A Large-Scale Benchmark Experiment. BMC Bioinformatics, 19(1). https://doi.org/10.1186/s12859-018-2264-5
Czibula, G., Ciubotariu, G., MAIER, M.-I., & Lisei, H. (2022). IntelliDaM: A Machine Learning-Based Framework for Enhancing the Performance of Decision-Making Processes. A Case Study for Educational Data Mining. Ieee Access. https://doi.org/10.1109/access.2022.3195531
Durkin, M., McKenna, S., & Cummins, D. (2012). Emotional Connections in Higher Education Marketing. International Journal of Educational Management, 26(2), 153–161. https://doi.org/10.1108/09513541211201960
Erpurini, W. (2024). The IMPACT OF TARGET MARKETS, UNIQUE RESOURCES &Amp; MARKET ATTRACTION ON DIGITAL MARKETING: THEIR SIGNIFICANCE FOR SELECTING POSTSECONDARY EDUCATION. Jurnal Riset Bisnis Dan Manajemen, 17(2), 29–38. https://doi.org/10.23969/jrbm.v17i2.12093
Esquivel, J. A., & Esquivel, J. A. (2021). A Machine Learning Based DSS in Predicting Undergraduate Freshmen Enrolment in a Philippine University. International Journal of Computer Trends and Technology, 69(5), 50–54. https://doi.org/10.14445/22312803/ijctt-v69i5p107
Fahim, A., Addae, B. A., Ofosu-Adarkwa, J., Tan, Q., & Bhatti, U. A. (2021). Industry 4.0 and Higher Education: An Evaluation of Barriers Affecting Master’s in Business Administration Enrolments Using a Grey Incidence Analysis. Ieee Access. https://doi.org/10.1109/access.2021.3082144
Farhood, H. (2024). Evaluating and Enhancing Artificial Intelligence Models for Predicting Student Learning Outcomes. Informatics, 11(3), 46. https://doi.org/10.3390/informatics11030046
Fernández‐García, A. J., Rodríguez-Echeverría, R., Preciado, J. C., Conejero, J. M., & Sánchez-Figueroa, F. (2020). Creating a Recommender System to Support Higher Education Students in the Subject Enrollment Decision. Ieee Access. https://doi.org/10.1109/access.2020.3031572
Fu, L. D., & Aliferis, C. F. (2010). Using Content-Based and Bibliometric Features for Machine Learning Models to Predict Citation Counts in the Biomedical Literature. Scientometrics, 85(1), 257–270. https://doi.org/10.1007/s11192-010-0160-5
Ghorbani, R., & Ghousi, R. (2020). Comparing Different Resampling Methods in Predicting Students’ Performance Using Machine Learning Techniques. Ieee Access. https://doi.org/10.1109/access.2020.2986809
Gnoh, H. Q. (2024). Enhancing Business Sustainability Through Technology-Enabled AI: Forecasting Student Data and Comparing Prediction Models for Higher Education Institutions (HEIs). PaperASIA, 40(2b), 48–58. https://doi.org/10.59953/paperasia.v40i2b.86
Jahir, A., Wahid, A. M., & Sufranto, T. T. (2024). Optimizing Higher Education Performance Through Data Integration Using the Zachman Framework: A Case Study on LAM Infokom Accreditation Criteria. Jurnal Nasional Teknologi Dan Sistem Informasi, 10(3), Article 3. https://doi.org/10.25077/TEKNOSI.v10i3.2024.201-215
Kevin Mario Laura-De La Cruz. (2023). A Study on Higher Education Student Satisfaction and Educational Marketing in Peru. Human Review International Humanities Review / Revista Internacional De Humanidades, 21(1), 1–10. https://doi.org/10.37467/revhuman.v21.5025
Khusnuliawati, H., & Putri, D. R. (2021). Hybrid Clustering Based on Multi-Criteria Segmentation for Higher Education Marketing. Telkomnika (Telecommunication Computing Electronics and Control), 19(5), 1498. https://doi.org/10.12928/telkomnika.v19i5.18965
Lan, Y. C., Tang, G., & Heitjan, D. F. (2018). Statistical Modeling and Prediction of Clinical Trial Recruitment. Statistics in Medicine, 38(6), 945–955. https://doi.org/10.1002/sim.8036
Li, X. (2022). Sequence Model and Prediction for Sustainable Enrollments in Chinese Universities. Sustainability, 15(1), 214. https://doi.org/10.3390/su15010214
Maphosa, M., Doorsamy, W., & Paul, B. (2023). Student Performance Patterns in Engineering at the University of Johannesburg: An Exploratory Data Analysis. Ieee Access. https://doi.org/10.1109/access.2023.3277225
Mbanga, S. (2023). Enhancing a Quality Teaching and Learning Environment in Large Classes in South African Universities: A Theoretical Exposition. Annals of Social Sciences & Management Studies, 9(5). https://doi.org/10.19080/asm.2023.09.555773
Melak, A., & Singh, S. (2021). Factors Affecting Women’s Choice of Learning Engineering and Technology Education in Ethiopia. Ieee Access. https://doi.org/10.1109/access.2021.3087548
Mengash, H. A. (2020). Using Data Mining Techniques to Predict Student Performance to Support Decision Making in University Admission Systems. Ieee Access. https://doi.org/10.1109/access.2020.2981905
Moogan, Y. J. (2011). Can a Higher Education Institution’s Marketing Strategy Improve the Student‐institution Match? International Journal of Educational Management, 25(6), 570–589. https://doi.org/10.1108/09513541111159068
Mushketova, N. S., Bydanova, E., & Rouet, G. (2018). National Strategy for Promotion of Russian Universities in the World Market of Education Services. International Journal of Educational Management, 32(1), 46–56. https://doi.org/10.1108/ijem-10-2016-0207
Pelima, L. R., Sukmana, Y., & Rosmansyah, Y. (2024). Predicting University Student Graduation Using Academic Performance and Machine Learning: A Systematic Literature Review. Ieee Access. https://doi.org/10.1109/access.2024.3361479
Plak, S., Cornelisz, I., Meeter, M., & Klaveren, C. v. (2021). Early Warning Systems for More Effective Student Counselling in Higher Education: Evidence From a Dutch Field Experiment. Higher Education Quarterly, 76(1), 131–152. https://doi.org/10.1111/hequ.12298
Qin, L., Shanks, K., Phillips, G. A., & Bernard, D. (2019). The Impact of Lengths of Time Series on the Accuracy of the ARIMA Forecasting. International Research in Higher Education, 4(3), 58. https://doi.org/10.5430/irhe.v4n3p58
Rafique, A., Khan, M. S., Jamal, M. H., Tasadduq, M., Rustam, F., Lee, E., Washington, P. B., & Ashraf, I. (2021). Integrating Learning Analytics and Collaborative Learning for Improving Student’s Academic Performance. Ieee Access. https://doi.org/10.1109/access.2021.3135309
Raghavendran, Ch. V., Vamsi, Ch. P. V., Veerraju, T., & Veluri, R. K. (2021). Predicting Student Admissions Rate Into University Using Machine Learning Models. 151–162. https://doi.org/10.1007/978-981-15-9516-5_13
Rajasekar, D., & Aithal, P. S. (2022). Study on Cadet Satisfaction Towards E-Marketing Promotion in Private Maritime University (Study With Reference to Rural Cadets, Chennai). International Journal of Case Studies in Business It and Education, 489–495. https://doi.org/10.47992/ijcsbe.2581.6942.0212
Rasool, A., Tao, R., Kamyab, M., & Hayat, S. (2020). GAWA–A Feature Selection Method for Hybrid Sentiment Classification. Ieee Access. https://doi.org/10.1109/access.2020.3030642
Roy, K., & Farid, D. Md. (2024). An Adaptive Feature Selection Algorithm for Student Performance Prediction. Ieee Access. https://doi.org/10.1109/access.2024.3406252
Sabino Parmezan, A. R., A. Souza, V. M., & A. Batista, G. E. (2022). Time Series Prediction via Similarity Search: Exploring Invariances, Distance Measures and Ensemble Functions. Ieee Access. https://doi.org/10.1109/access.2022.3192849
Setiawan, R., Putranto, A., Wihendro, Princes, E., Geraldina, I., Julianti, E., Safitri, J., & Pannen, P. (2024). Tech-Driven Transformation: Innovative Pricing Strategies for E-Learning. Ieee Access. https://doi.org/10.1109/access.2024.3392489
Singh, H. P., & Alhulail, H. (2022). Predicting Student-Teachers Dropout Risk and Early Identification: A Four-Step Logistic Regression Approach. Ieee Access. https://doi.org/10.1109/access.2022.3141992
Suleiman, J. (2021). Credibility, Ethics and Sustainable Marketing in Higher Education Institutions. International Journal of Innovative Research and Development, 10(12). https://doi.org/10.24940/ijird/2021/v10/i12/dec21016
Thomas, S. (2011). What Drives Student Loyalty in Universities: An Empirical Model From India. International Business Research, 4(2). https://doi.org/10.5539/ibr.v4n2p183
Trần, T. T. (2016). Enhancing Graduate Employability and the Need for University-Enterprise Collaboration. Journal of Teaching and Learning for Graduate Employability, 7(1), 58–71. https://doi.org/10.21153/jtlge2016vol7no1art598
Vaitsis, C., Hervatis, V., & Zary, N. (2016). Introduction to Big Data in Education and Its Contribution to the Quality Improvement Processes. https://doi.org/10.5772/63896
Wahid, A. M., Afuan, L., & Utomo, F. S. (2024). ENHANCING COLLABORATION DATA MANAGEMENT THROUGH DATA WAREHOUSE DESIGN: MEETING BAN-PT ACCREDITATION AND KERMA REPORTING REQUIREMENTS IN HIGHER EDUCATION. Jurnal Teknik Informatika (Jutif), 5(6), Article 6. https://doi.org/10.52436/1.jutif.2024.5.6.1747
Wahid, A. M., Hariguna, T., & Karyono, G. (2025). Optimization of Recommender Systems for Image-Based Website Themes Using Transfer Learning. Journal of Applied Data Sciences, 6(2), Article 2. https://doi.org/10.47738/jads.v6i2.671
Wan, S. J. (2024). A Denoising Time Window Algorithm for Optimizing LSTM Prediction. Ieee Access. https://doi.org/10.1109/access.2024.3404456
Wanjau, S. K., Okeyo, G., & Rimiru, R. (2016). Data Mining Model for Predicting Student Enrolment in STEM Courses in Higher Education Institutions. International Journal of Computer Applications Technology and Research, 5(11), 698–704. https://doi.org/10.7753/ijcatr0511.1004
Watkins, A., & Kaplan, A. (2018). Modeling in R and Weka for Course Enrollment Prediction. International Journal of Institutional Research and Management, 2(1), 1–17. https://doi.org/10.52731/ijirm.v2.i1.212
Wu, J., Lin, M.-S., & Tsai, C. (2023). A Predictive Model That Aligns Admission Offers With Student Enrollment Probability. Education Sciences, 13(5), 440. https://doi.org/10.3390/educsci13050440
Yang, Y., & Yang, Y. (2020). Hybrid Method for Short-Term Time Series Forecasting Based on EEMD. Ieee Access. https://doi.org/10.1109/access.2020.2983588
Yi, J. C., Kang-Yi, C. D., Burton, F., & Chen, H. D. (2018). Predictive Analytics Approach to Improve and Sustain College Students’ Non-Cognitive Skills and Their Educational Outcome. Sustainability, 10(11), 4012. https://doi.org/10.3390/su10114012
Yi, X., Si, W., Zhu, J., Sun, Z., Zhao, J., Xu, M., & Xu, S. (2021). Multi-Model Fusion Short-Term Load Forecasting Based on Random Forest Feature Selection and Hybrid Neural Network. Ieee Access. https://doi.org/10.1109/access.2021.3051337
Zhang, K., Li, Z., Wang, H., & Wang, H. (2015). Fuzzy Time Series Prediction Model and Application Based on Fuzzy Inverse. International Journal of Signal Processing Image Processing and Pattern Recognition, 8(10), 121–128. https://doi.org/10.14257/ijsip.2015.8.10.14
Downloads
How to Cite
Issue
Section
License
Copyright (c) 2026 Akto Hariawan, Arif Mu'amar Wahid, Sultan Ananda Haikal, Prayoga Pribadi

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.






















Moraref
PKP Index
Indonesia OneSearch
OCLC Worldcat
Index Copernicus
Scilit
