From Prediction to Targeting: Comparative ML Models and Threshold-Based Re-Enrollment Segmentation in Higher Education Marketing

Authors

  • Akto Hariawan Informatics Department, Computer Sciences Faculty, Universitas Amikom Purwokerto, Indonesia
  • Arif Mu'amar Wahid Graduate School of Natural and Science Technology, Kanazawa University, Japan
  • Sultan Ananda Haikal Informatics Department, Computer Sciences Faculty, Universitas Amikom Purwokerto, Indonesia
  • Prayoga Pribadi Digital Business Department, Business and Social Sciences Faculty, Universitas Amikom Purwokerto, Indonesia

DOI:

10.33395/sinkron.v10i1.15703

Keywords:

Student Re-enrollment, Machine Learning, Logistic Regression, Random Forest, XGBoost

Abstract

Student re-enrollment is a critical strategic concern for higher education institutions, directly impacting financial stability, capacity planning, and the effectiveness of retention programs. This study develops a decision-support approach to predict non–re-enrollment using institutional records from a private university in Central Java. The dataset includes 2,673 student records across three academic years, which were expanded into 1,099 engineered features through a unified preprocessing pipeline involving missing-value imputation, scaling, and one-hot encoding. Model development utilized a fixed train–test split with 5-fold cross-validation. Results demonstrate that tuned tree ensembles significantly outperform the linear baseline. While Logistic Regression yielded limited discrimination (test ROC-AUC = 0.5602, F1 = 0.7010), tuned Random Forest improved classification quality (test ROC-AUC = 0.7571, F1 = 0.8052). Tuned XGBoost achieved the strongest ranking performance (test ROC-AUC = 0.7606) and was selected for deployment due to its superior risk-ordering capability. SHAP-based interpretation identifies parental income as the dominant driver of non–re-enrollment risk, followed by program-choice indicators and demographic variables. Finally, threshold analysis supports risk-tier segmentation, translating predicted probabilities into practical outreach policies aligned with institutional capacity constraints—addressing two underexplored gaps in applied re-enrollment prediction: rigorous cross-validated ensemble modeling and the integration of predictive scores into actionable marketing segmentation, and highlighting that ranking quality—not classification alone—is essential for operational targeting.

GS Cited Analysis

Downloads

Download data is not yet available.

References

Alhazmi, E., & Sheneamer, A. (2023). Early Predicting of Students Performance in Higher Education. Ieee Access. https://doi.org/10.1109/access.2023.3250702

Alwarthan, S., Aslam, N., & Khan, I. U. (2022). An Explainable Model for Identifying at-Risk Student at Higher Education. Ieee Access. https://doi.org/10.1109/access.2022.3211070

Amare, M. Y., & Šimonová, S. (2021). Global Challenges of Students Dropout: A Prediction Model Development Using Machine Learning Algorithms on Higher Education Datasets. SHS Web of Conferences, 129, 09001. https://doi.org/10.1051/shsconf/202112909001

Batool, S., & Liu, Z. (2021). Exploring the Relationships Between Socio-Economic Indicators and Student Enrollment in Higher Education Institutions of Pakistan. Plos One, 16(12), e0261577. https://doi.org/10.1371/journal.pone.0261577

Belostecinic, G. (2023). Online Educational Marketing as a Means to Increase the Attractiveness of the University and Its Image. Economica, 3(125), 7–27. https://doi.org/10.53486/econ.2023.125.007

Berges, A., Bravo Ramirez, P. H., Pau, I., Tejero, A., & García-Crespo, Á. (2021). A Framework for Strategic Intelligence Systems Applied to Education Management: A Pilot Study in the Community of Madrid. Ieee Access. https://doi.org/10.1109/access.2021.3081734

Bieganek, C., Aliferis, C. F., & Ma, S. (2022). Prediction of Clinical Trial Enrollment Rates. Plos One, 17(2), e0263193. https://doi.org/10.1371/journal.pone.0263193

Boumi, S., & Vela, A. (2021). Quantifying the Impact of Student Enrollment Patterns on Academic Success Using a Hidden Markov Model. Applied Sciences, 11(14), 6453. https://doi.org/10.3390/app11146453

Bowen, J. C. (2024). Market Orientation and Performance: A Comparative Study of Private and Public Universities in Kenya. Journal of Business and Entrepreneurship, 6(1), 1–12. https://doi.org/10.51317/ecjbms.v6i1.466

Budnikevych, I. (2023). Communication Component in the Formation of the Image of Higher Education Institutions Based on a Marketing Approach. Proceedings of Scientific Works of Cherkasy State Technological University Series Economic Sciences, 24(4), 5–16. https://doi.org/10.62660/ebcstu/4.2023.05

Charte, F. (2020). A Comprehensive and Didactic Review on Multilabel Learning Software Tools. Ieee Access. https://doi.org/10.1109/access.2020.2979787

Cheglakova, L., Devetyarova, I. P., Agalakova, O. S., & Kolesova, Y. A. (2020). Marketing Strategy of Quality Management During Reorganization of Regional Universities in the Process of Modernization of Education in the Conditions of Region’s Transition to Industry 4.0. International Journal for Quality Research, 14(1), 33–50. https://doi.org/10.24874/ijqr14.01-03

Chen, J. M. (2017). Three Levels of Push-Pull Dynamics Among Chinese International Students’ Decision to Study Abroad in the Canadian Context. Journal of International Students, 7(1), 113–135. https://doi.org/10.32674/jis.v7i1.248

Chen, Z., Cen, G., Wei, Y., & Li, Z. (2023). Student Performance Prediction Approach Based on Educational Data Mining. Ieee Access. https://doi.org/10.1109/access.2023.3335985

Couronné, R., Probst, P., & Boulesteix, A. (2018). Random Forest Versus Logistic Regression: A Large-Scale Benchmark Experiment. BMC Bioinformatics, 19(1). https://doi.org/10.1186/s12859-018-2264-5

Czibula, G., Ciubotariu, G., MAIER, M.-I., & Lisei, H. (2022). IntelliDaM: A Machine Learning-Based Framework for Enhancing the Performance of Decision-Making Processes. A Case Study for Educational Data Mining. Ieee Access. https://doi.org/10.1109/access.2022.3195531

Durkin, M., McKenna, S., & Cummins, D. (2012). Emotional Connections in Higher Education Marketing. International Journal of Educational Management, 26(2), 153–161. https://doi.org/10.1108/09513541211201960

Erpurini, W. (2024). The IMPACT OF TARGET MARKETS, UNIQUE RESOURCES &Amp; MARKET ATTRACTION ON DIGITAL MARKETING: THEIR SIGNIFICANCE FOR SELECTING POSTSECONDARY EDUCATION. Jurnal Riset Bisnis Dan Manajemen, 17(2), 29–38. https://doi.org/10.23969/jrbm.v17i2.12093

Esquivel, J. A., & Esquivel, J. A. (2021). A Machine Learning Based DSS in Predicting Undergraduate Freshmen Enrolment in a Philippine University. International Journal of Computer Trends and Technology, 69(5), 50–54. https://doi.org/10.14445/22312803/ijctt-v69i5p107

Fahim, A., Addae, B. A., Ofosu-Adarkwa, J., Tan, Q., & Bhatti, U. A. (2021). Industry 4.0 and Higher Education: An Evaluation of Barriers Affecting Master’s in Business Administration Enrolments Using a Grey Incidence Analysis. Ieee Access. https://doi.org/10.1109/access.2021.3082144

Farhood, H. (2024). Evaluating and Enhancing Artificial Intelligence Models for Predicting Student Learning Outcomes. Informatics, 11(3), 46. https://doi.org/10.3390/informatics11030046

Fernández‐García, A. J., Rodríguez-Echeverría, R., Preciado, J. C., Conejero, J. M., & Sánchez-Figueroa, F. (2020). Creating a Recommender System to Support Higher Education Students in the Subject Enrollment Decision. Ieee Access. https://doi.org/10.1109/access.2020.3031572

Fu, L. D., & Aliferis, C. F. (2010). Using Content-Based and Bibliometric Features for Machine Learning Models to Predict Citation Counts in the Biomedical Literature. Scientometrics, 85(1), 257–270. https://doi.org/10.1007/s11192-010-0160-5

Ghorbani, R., & Ghousi, R. (2020). Comparing Different Resampling Methods in Predicting Students’ Performance Using Machine Learning Techniques. Ieee Access. https://doi.org/10.1109/access.2020.2986809

Gnoh, H. Q. (2024). Enhancing Business Sustainability Through Technology-Enabled AI: Forecasting Student Data and Comparing Prediction Models for Higher Education Institutions (HEIs). PaperASIA, 40(2b), 48–58. https://doi.org/10.59953/paperasia.v40i2b.86

Jahir, A., Wahid, A. M., & Sufranto, T. T. (2024). Optimizing Higher Education Performance Through Data Integration Using the Zachman Framework: A Case Study on LAM Infokom Accreditation Criteria. Jurnal Nasional Teknologi Dan Sistem Informasi, 10(3), Article 3. https://doi.org/10.25077/TEKNOSI.v10i3.2024.201-215

Kevin Mario Laura-De La Cruz. (2023). A Study on Higher Education Student Satisfaction and Educational Marketing in Peru. Human Review International Humanities Review / Revista Internacional De Humanidades, 21(1), 1–10. https://doi.org/10.37467/revhuman.v21.5025

Khusnuliawati, H., & Putri, D. R. (2021). Hybrid Clustering Based on Multi-Criteria Segmentation for Higher Education Marketing. Telkomnika (Telecommunication Computing Electronics and Control), 19(5), 1498. https://doi.org/10.12928/telkomnika.v19i5.18965

Lan, Y. C., Tang, G., & Heitjan, D. F. (2018). Statistical Modeling and Prediction of Clinical Trial Recruitment. Statistics in Medicine, 38(6), 945–955. https://doi.org/10.1002/sim.8036

Li, X. (2022). Sequence Model and Prediction for Sustainable Enrollments in Chinese Universities. Sustainability, 15(1), 214. https://doi.org/10.3390/su15010214

Maphosa, M., Doorsamy, W., & Paul, B. (2023). Student Performance Patterns in Engineering at the University of Johannesburg: An Exploratory Data Analysis. Ieee Access. https://doi.org/10.1109/access.2023.3277225

Mbanga, S. (2023). Enhancing a Quality Teaching and Learning Environment in Large Classes in South African Universities: A Theoretical Exposition. Annals of Social Sciences & Management Studies, 9(5). https://doi.org/10.19080/asm.2023.09.555773

Melak, A., & Singh, S. (2021). Factors Affecting Women’s Choice of Learning Engineering and Technology Education in Ethiopia. Ieee Access. https://doi.org/10.1109/access.2021.3087548

Mengash, H. A. (2020). Using Data Mining Techniques to Predict Student Performance to Support Decision Making in University Admission Systems. Ieee Access. https://doi.org/10.1109/access.2020.2981905

Moogan, Y. J. (2011). Can a Higher Education Institution’s Marketing Strategy Improve the Student‐institution Match? International Journal of Educational Management, 25(6), 570–589. https://doi.org/10.1108/09513541111159068

Mushketova, N. S., Bydanova, E., & Rouet, G. (2018). National Strategy for Promotion of Russian Universities in the World Market of Education Services. International Journal of Educational Management, 32(1), 46–56. https://doi.org/10.1108/ijem-10-2016-0207

Pelima, L. R., Sukmana, Y., & Rosmansyah, Y. (2024). Predicting University Student Graduation Using Academic Performance and Machine Learning: A Systematic Literature Review. Ieee Access. https://doi.org/10.1109/access.2024.3361479

Plak, S., Cornelisz, I., Meeter, M., & Klaveren, C. v. (2021). Early Warning Systems for More Effective Student Counselling in Higher Education: Evidence From a Dutch Field Experiment. Higher Education Quarterly, 76(1), 131–152. https://doi.org/10.1111/hequ.12298

Qin, L., Shanks, K., Phillips, G. A., & Bernard, D. (2019). The Impact of Lengths of Time Series on the Accuracy of the ARIMA Forecasting. International Research in Higher Education, 4(3), 58. https://doi.org/10.5430/irhe.v4n3p58

Rafique, A., Khan, M. S., Jamal, M. H., Tasadduq, M., Rustam, F., Lee, E., Washington, P. B., & Ashraf, I. (2021). Integrating Learning Analytics and Collaborative Learning for Improving Student’s Academic Performance. Ieee Access. https://doi.org/10.1109/access.2021.3135309

Raghavendran, Ch. V., Vamsi, Ch. P. V., Veerraju, T., & Veluri, R. K. (2021). Predicting Student Admissions Rate Into University Using Machine Learning Models. 151–162. https://doi.org/10.1007/978-981-15-9516-5_13

Rajasekar, D., & Aithal, P. S. (2022). Study on Cadet Satisfaction Towards E-Marketing Promotion in Private Maritime University (Study With Reference to Rural Cadets, Chennai). International Journal of Case Studies in Business It and Education, 489–495. https://doi.org/10.47992/ijcsbe.2581.6942.0212

Rasool, A., Tao, R., Kamyab, M., & Hayat, S. (2020). GAWA–A Feature Selection Method for Hybrid Sentiment Classification. Ieee Access. https://doi.org/10.1109/access.2020.3030642

Roy, K., & Farid, D. Md. (2024). An Adaptive Feature Selection Algorithm for Student Performance Prediction. Ieee Access. https://doi.org/10.1109/access.2024.3406252

Sabino Parmezan, A. R., A. Souza, V. M., & A. Batista, G. E. (2022). Time Series Prediction via Similarity Search: Exploring Invariances, Distance Measures and Ensemble Functions. Ieee Access. https://doi.org/10.1109/access.2022.3192849

Setiawan, R., Putranto, A., Wihendro, Princes, E., Geraldina, I., Julianti, E., Safitri, J., & Pannen, P. (2024). Tech-Driven Transformation: Innovative Pricing Strategies for E-Learning. Ieee Access. https://doi.org/10.1109/access.2024.3392489

Singh, H. P., & Alhulail, H. (2022). Predicting Student-Teachers Dropout Risk and Early Identification: A Four-Step Logistic Regression Approach. Ieee Access. https://doi.org/10.1109/access.2022.3141992

Suleiman, J. (2021). Credibility, Ethics and Sustainable Marketing in Higher Education Institutions. International Journal of Innovative Research and Development, 10(12). https://doi.org/10.24940/ijird/2021/v10/i12/dec21016

Thomas, S. (2011). What Drives Student Loyalty in Universities: An Empirical Model From India. International Business Research, 4(2). https://doi.org/10.5539/ibr.v4n2p183

Trần, T. T. (2016). Enhancing Graduate Employability and the Need for University-Enterprise Collaboration. Journal of Teaching and Learning for Graduate Employability, 7(1), 58–71. https://doi.org/10.21153/jtlge2016vol7no1art598

Vaitsis, C., Hervatis, V., & Zary, N. (2016). Introduction to Big Data in Education and Its Contribution to the Quality Improvement Processes. https://doi.org/10.5772/63896

Wahid, A. M., Afuan, L., & Utomo, F. S. (2024). ENHANCING COLLABORATION DATA MANAGEMENT THROUGH DATA WAREHOUSE DESIGN: MEETING BAN-PT ACCREDITATION AND KERMA REPORTING REQUIREMENTS IN HIGHER EDUCATION. Jurnal Teknik Informatika (Jutif), 5(6), Article 6. https://doi.org/10.52436/1.jutif.2024.5.6.1747

Wahid, A. M., Hariguna, T., & Karyono, G. (2025). Optimization of Recommender Systems for Image-Based Website Themes Using Transfer Learning. Journal of Applied Data Sciences, 6(2), Article 2. https://doi.org/10.47738/jads.v6i2.671

Wan, S. J. (2024). A Denoising Time Window Algorithm for Optimizing LSTM Prediction. Ieee Access. https://doi.org/10.1109/access.2024.3404456

Wanjau, S. K., Okeyo, G., & Rimiru, R. (2016). Data Mining Model for Predicting Student Enrolment in STEM Courses in Higher Education Institutions. International Journal of Computer Applications Technology and Research, 5(11), 698–704. https://doi.org/10.7753/ijcatr0511.1004

Watkins, A., & Kaplan, A. (2018). Modeling in R and Weka for Course Enrollment Prediction. International Journal of Institutional Research and Management, 2(1), 1–17. https://doi.org/10.52731/ijirm.v2.i1.212

Wu, J., Lin, M.-S., & Tsai, C. (2023). A Predictive Model That Aligns Admission Offers With Student Enrollment Probability. Education Sciences, 13(5), 440. https://doi.org/10.3390/educsci13050440

Yang, Y., & Yang, Y. (2020). Hybrid Method for Short-Term Time Series Forecasting Based on EEMD. Ieee Access. https://doi.org/10.1109/access.2020.2983588

Yi, J. C., Kang-Yi, C. D., Burton, F., & Chen, H. D. (2018). Predictive Analytics Approach to Improve and Sustain College Students’ Non-Cognitive Skills and Their Educational Outcome. Sustainability, 10(11), 4012. https://doi.org/10.3390/su10114012

Yi, X., Si, W., Zhu, J., Sun, Z., Zhao, J., Xu, M., & Xu, S. (2021). Multi-Model Fusion Short-Term Load Forecasting Based on Random Forest Feature Selection and Hybrid Neural Network. Ieee Access. https://doi.org/10.1109/access.2021.3051337

Zhang, K., Li, Z., Wang, H., & Wang, H. (2015). Fuzzy Time Series Prediction Model and Application Based on Fuzzy Inverse. International Journal of Signal Processing Image Processing and Pattern Recognition, 8(10), 121–128. https://doi.org/10.14257/ijsip.2015.8.10.14

Downloads


Crossmark Updates

How to Cite

Hariawan, A., Wahid, A. M., Haikal, S. A., & Pribadi, P. (2026). From Prediction to Targeting: Comparative ML Models and Threshold-Based Re-Enrollment Segmentation in Higher Education Marketing. Sinkron : Jurnal Dan Penelitian Teknik Informatika, 10(1). https://doi.org/10.33395/sinkron.v10i1.15703