Designing a Stunting Prediction Model Using Machine Learning to Support SDGs Achievement in Indonesia
DOI:
10.33395/sinkron.v9i4.15296Keywords:
Stunting, Machine Learning, Decision Tree, Random Forest, Sustainable Development GoalsAbstract
Stunting remains a major public health challenge in Indonesia, with national prevalence among children under five reaching 21.6% in 2022, according to the Ministry of Health. This condition, defined by the World Health Organization as a height-for-age less than -2 SD, is associated with long-term consequences including impaired cognitive development, reduced educational attainment, and diminished economic productivity. Addressing stunting is therefore critical to achieving Sustainable Development Goals (SDGs) related to hunger, health, and education. Despite multiple national initiatives, early identification of stunting risk is still limited by reliance on conventional, reactive surveillance methods. Recent advances in machine learning (ML) provide promising alternatives for proactive stunting prediction, with several studies reporting high predictive accuracy using ensemble methods, hybrid frameworks, and geographically weighted models. Building upon this evidence, the present study develops and evaluates ML models for stunting risk prediction using a large dataset of 10,000 records from North Sumatra, Indonesia. The dataset included three predictor variables—age, height, and weight—and a target variable, nutritional status (Normal, Stunted, Severely Stunted, Tall). Four algorithms were compared: K-Nearest Neighbors (KNN), Naïve Bayes, Decision Tree, and Random Forest. Performance was assessed using accuracy, precision, recall, F1-score, and ROC area, with 10-fold cross-validation ensuring robust estimation. Results demonstrated that Decision Tree (88.6% accuracy) and Random Forest (88.3% accuracy) outperformed KNN (84.7%) and Naïve Bayes (72%). ROC areas further confirmed the superiority of ensemble-based approaches, particularly Random Forest (0.979). Statistical significance was tested using McNemar’s test, revealing that Decision Tree and Random Forest achieved comparable performance (p = 0.651), both significantly outperforming KNN and Naïve Bayes (p < 0.05). This study contributes a context-specific evaluation of ML methods for stunting prediction in North Sumatra, emphasizing not only predictive accuracy but also interpretability to support health policy and program implementation. By bridging data-driven insights with actionable decision support, the proposed framework advances progress toward SDG-aligned strategies and provides a foundation for more targeted and preventive interventions in child nutrition and growth monitoring.
Downloads
References
Berti, C., & La Vecchia, A. (2023). Temporal trend of child stunting prevalence and Food and Nutritional Surveillance System. Jornal de Pediatria, 99(2), 99–100. https://doi.org/10.1016/j.jped.2022.10.001
Dewi, Y. S., Hastuti, S., & Fatekurohman, M. (2024). Analysis of stunting in East Java, Indonesia using random forest and geographically weighted random forest regression. Brazilian Journal of Biometrics, 42(3), 213–224. https://doi.org/10.28951/bjb.v42i3.679
Gustriansyah, R., Suhandi, N., Puspasari, S., & Sanmorino, A. (2024). Machine Learning Method to Predict the Toddlers’ Nutritional Status. JURNAL INFOTEL, 16(1), 32–43.
Hanieh, S., Braat, S., Simpson, J. A., Ha, T. T. T., Tran, T. D., Tuan, T., Fisher, J., & Biggs, B.-A. (2019). The Stunting Tool for Early Prevention: development and external validation of a novel tool to predict risk of stunting in children at 3 years of age. BMJ Global Health, 4, 1–12. https://doi.org/10.1136/bmjgh-2019-001801
Haque, A., Choudhury, N., Wahid, B. Z., Ahmed, S. M. T., Farzana, F. D., Ali, M., Naz, F., Siddiqua, T. J., Rahman, S. S., Faruque, A. S. G., & Ahmed, T. (2023). A predictive modelling approach to illustrate factors correlating with stunting among children aged 12 – 23 months : a cluster randomised pre- post study. BMJ Open, 13, 1–15. https://doi.org/10.1136/bmjopen-2022-067961
Hasdyna, N., Dinata, R. K., Rahmi, & Fajri, T. I. (2024). Hybrid Machine Learning for Stunting Prevalence: A Novel Comprehensive Approach to Its Classification, Prediction, and Clustering Optimization in Aceh, Indonesia. Informatics, 11(4). https://doi.org/10.3390/informatics11040089
Mulyani, A. T., Khairinisa, M. A., & Khatib, A. (2025). Understanding Stunting : Impact, Causes , and Strategy to Accelerate Stunting Reduction — A Narrative Review. Nutrients, 17.
Ndagijimana, S., Kabano, I. H., Masabo, E., & Ntaganda, J. M. (2023). Prediction of Stunting Among Under-5 Children in Rwanda Using Machine Learning Techniques. Journal of Preventive Medicine & Public Health, 56, 41–49.
Novalina, N., Tarigan, I. A. A., Kameela, F. K., & Rizkinia, M. (2025). Benchmarking machine learning algorithm for stunting risk prediction in Indonesia. Bulletin of Electrical Engineering and Informatics, 14(3), 2252–2263. https://doi.org/10.11591/eei.v14i3.8997
Prasetyo, E., & Nugroho, K. (2024). Optimasi Klasifikasi Data Stunting Melalui Ensemble Learning pada Label Multiclass dengan Imbalance Data. Techno.Com, 23(1), 1–10. https://doi.org/10.62411/tc.v23i1.9779
Purnomo, A. joko, & Rozaq, A. (2022). CLASSIFICATION OF STUNTING STATUS IN TODDLERS USING NAIVE BAYES METHOD IN THE CITY OF MADIUN BASED ON WEBSITE. Techno Nusa Mandiri: Journal of Computing and Information Technology, 19(2), 69–76.
Raiten, D. J., & Bremer, A. A. (2020). Exploring the Nutritional Ecology of Stunting : New Approaches to an Old Problem. Nutrients, 12(371).
Rao, B., Rashid, M., Hasan, M. G., & Thunga, G. (2025). Machine Learning in Predicting Child Malnutrition: A Meta-Analysis of Demographic and Health Surveys Data. International Journal of Environmental Research and Public Health, 22(3), 1–15. https://doi.org/10.3390/ijerph22030449
Shen, H., Zhao, H., & Jian, Y. (2023). Machine Learning Algorithms for Predicting Stunting among Under-Five Children in Papua New Guinea. Children, 10(1638).
Siramaneerat, I., Astutik, E., Agushybana, F., Bhumkittipich, P., & Lamprom, W. (2024). Examining determinants of stunting in Urban and Rural Indonesian: a multilevel analysis using the population-based Indonesian family life survey (IFLS). BMC Public Health, 24(1), 1–13. https://doi.org/10.1186/s12889-024-18824-z
Swastina, L., Rahmatullah, B., Saad, A., & Khan, H. (2024). A systematic review on research trends, datasets, algorithms, and frameworks of children’s nutritional status prediction. IAES International Journal of Artificial Intelligence (IJ-AI), 13(2), 1868–1877. https://doi.org/10.11591/ijai.v13.i2.pp1868-1877
Syahrial, S., Ilham, R., Asikin, Z. F., & Nurdin, S. S. I. (2022). Stunting Classification in Children’s Measurement Data Using Machine Learning Models. JOURNAL LA MULTIAPP, 03(02), 52–60. https://doi.org/10.37899/journallamultiapp.v3i2.614
Turjo, E. A., & Rahman, H. (2024). Assessing risk factors for malnutrition among women in Bangladesh and forecasting malnutrition using machine learning approaches. BMC Nutrition, 10(22), 1–25. https://doi.org/10.1186/s40795-023-00808-8
Widyawati, F., Suhito, H. P., Yassin, W., & Agus Santoso, H. (2025). Classification of Toddler Nutritional Status Using Support Vector Machine and Random Forest Techniques With Optimal Feature Selection. Jurnal Teknik Informatika (Jutif), 5(6), 1893–1904. https://doi.org/10.52436/1.jutif.2024.5.6.4162
Downloads
How to Cite
Issue
Section
License
Copyright (c) 2025 Mikha Sinaga, Fujiati, Darma Halawa

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.