Analytical Study Forecasting Students Using Random Forest and Linear Regression Algorithms
DOI:
10.33395/sinkron.v8i4.13886Keywords:
Student Admission Forecasting, Random Forest Algorithm, Linear Regression, Historical Admission Data, Prediction AccuracyAbstract
Forecasting new student admissions essential for higher education institutions as it helps them plan for staffing and budgetary needs. Accurate predictions are difficult due to factors like economic conditions, government policies, and University competition. This study aims to analysis forecasting at Nasional university using Random Forest and Linear Regression algorithms. By examining historical admission data, the research seeks to identify key factors influencing the number of accepted students. Methodology involves collecting data from past admissions and applying both Random Forest and Linear Regression to compare their performance. Preliminary results, based on parameters such as application form purchases from 2015 to 2023, form prices, accreditation, and leading study programs, suggest that Random Forest offers more stable and realistic predictions. Analysis for MAE, MSE, RMSE, MAPE, MAD suggests that Linear Regression is more accurate for this data. predicts closer to actual values with lower overall errors. This makes Linear Regression preferable as it provides more reliable predictions with less deviation compared to Random Forest. Looking at admissions forecasts for the next 5 years, Random Forest predicts a steady decrease from 4224 in 2024 to 4129 in 2028. In contrast, Linear Regression suggests a stable trend with slight annual dips, going from 4954 in 2024 to 4941 in 2028. Therefore, Linear Regression is a more stable and realistic choice compared to Random Forest for this forecasting task in this research.
Downloads
References
Alamri, L. H., Almuslim, R. S., Alotibi, M. S., Alkadi, D. K., Ullah Khan, I., & Aslam, N. (2020). Predicting Student Academic Performance using Support Vector Machine and Random Forest. ACM International Conference Proceeding Series, PartF16898(July), 100–107. https://doi.org/10.1145/3446590.3446607
Armstrong, J., & Franke, G. (2001). Principles of forecasting. In Journal of Marketing Research. http://elibrary.ru/item.asp?id=4643952%5Cnpapers2://publication/uuid/2EF07596-D9F9-463E-8773-080AE5DD8ADC
Chen, M., & Liu, Z. (2024). Heliyon Predicting performance of students by optimizing tree components of random forest using genetic algorithm. Heliyon, 10(12), e32570. https://doi.org/10.1016/j.heliyon.2024.e32570
Du, Q., & Zhai, J. (2024). Measurement : Sensors Application of artificial intelligence Sensors based on random forest algorithm in financial recognition models. 33(June).
Gatera, A., Kuradusenge, M., Bajpai, G., Mikeka, C., & Shrivastava, S. (2023). Comparison of random forest and support vector machine regression models for forecasting road accidents. Scientific African, 21, e01739. https://doi.org/10.1016/j.sciaf.2023.e01739
KOC, T., & AKIN, P. (2022). Estimation of High School Entrance Examination Success Rates Using Machine Learning and Beta Regression Models. Journal of Intelligent Systems: Theory and Applications, 5(1), 9–15. https://doi.org/10.38016/jista.922663
Maulud, D., & Abdulazeez, A. M. (2020). A Review on Linear Regression Comprehensive in Machine Learning. Journal of Applied Science and Technology Trends, 1(2), 140–147. https://doi.org/10.38094/jastt1457
Meher, B. K., Singh, M., Birau, R., & Anand, A. (2024). Forecasting stock prices of fintech companies of India using random forest with high-frequency data. Journal of Open Innovation: Technology, Market, and Complexity, 10(1), 100180. https://doi.org/10.1016/j.joitmc.2023.100180
Narnaware, M. (2023). College Admission Prediction using Machine Learning. International Journal for Research in Applied Science and Engineering Technology, 11(11), 520–523. https://doi.org/10.22214/ijraset.2023.56539
Nirmala, I., Wijayanto, H., & Notodiputro, K. A. (2022). Prediction of Undergraduate Student’s Study Completion Status Using MissForest Imputation in Random Forest and XGBoost Models. ComTech: Computer, Mathematics and Engineering Applications, 13(1), 53–62. https://doi.org/10.21512/comtech.v13i1.7388
Petropoulos, F., Apiletti, D., Assimakopoulos, V., Babai, M. Z., Barrow, D. K., Ben Taieb, S., Bergmeir, C., Bessa, R. J., Bijak, J., Boylan, J. E., Browell, J., Carnevale, C., Castle, J. L., Cirillo, P., Clements, M. P., Cordeiro, C., Cyrino Oliveira, F. L., De Baets, S., Dokumentov, A., … Ziel, F. (2022). Forecasting: theory and practice. International Journal of Forecasting, 38(3), 705–871. https://doi.org/10.1016/j.ijforecast.2021.11.001
Savargiv, M., Masoumi, B., & Keyvanpour, M. R. (2021). A new random forest algorithm based on learning automata. Computational Intelligence and Neuroscience, 2021. https://doi.org/10.1155/2021/5572781
Tang, Q., Chen, Y., Yang, H., Liu, M., Xiao, H., Wang, S., Chen, H., & Raza Naqvi, S. (2021). Machine learning prediction of pyrolytic gas yield and compositions with feature reduction methods: Effects of pyrolysis conditions and biomass characteristics. Bioresource Technology, 339(July), 125581. https://doi.org/10.1016/j.biortech.2021.125581
Xu, W., Tu, J., Xu, N., & Liu, Z. (2024). Predicting daily heating energy consumption in residential buildings through integration of random forest model and meta-heuristic algorithms. Energy, 301(May), 131726. https://doi.org/10.1016/j.energy.2024.131726
Downloads
How to Cite
Issue
Section
License
Copyright (c) 2024 Muhammad Nurdin, Fauziah
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.