Analytical Study Forecasting Students Using Random Forest and Linear Regression Algorithms

Authors

  • Muhammad Nurdin Nasional University, Indonesia
  • Fauziah Nasional University, Indonesia

DOI:

10.33395/sinkron.v8i4.13886

Keywords:

Student Admission Forecasting, Random Forest Algorithm, Linear Regression, Historical Admission Data, Prediction Accuracy

Abstract

Forecasting new student admissions essential for higher education institutions as it helps them plan for staffing and budgetary needs. Accurate predictions are difficult due to factors like economic conditions, government policies, and University competition. This study aims to analysis forecasting at Nasional university using Random Forest and Linear Regression algorithms. By examining historical admission data, the research seeks to identify key factors influencing the number of accepted students. Methodology involves collecting data from past admissions and applying both Random Forest and Linear Regression to compare their performance. Preliminary results, based on parameters such as application form purchases from 2015 to 2023, form prices, accreditation, and leading study programs, suggest that Random Forest offers more stable and realistic predictions. Analysis for MAE, MSE, RMSE, MAPE, MAD suggests that Linear Regression is more accurate for this data. predicts closer to actual values with lower overall errors. This makes Linear Regression preferable as it provides more reliable predictions with less deviation compared to Random Forest. Looking at admissions forecasts for the next 5 years, Random Forest predicts a steady decrease from 4224 in 2024 to 4129 in 2028. In contrast, Linear Regression suggests a stable trend with slight annual dips, going from 4954 in 2024 to 4941 in 2028. Therefore, Linear Regression is a more stable and realistic choice compared to Random Forest for this forecasting task in this research.

GS Cited Analysis

Downloads

Download data is not yet available.

References

Alamri, L. H., Almuslim, R. S., Alotibi, M. S., Alkadi, D. K., Ullah Khan, I., & Aslam, N. (2020). Predicting Student Academic Performance using Support Vector Machine and Random Forest. ACM International Conference Proceeding Series, PartF16898(July), 100–107. https://doi.org/10.1145/3446590.3446607

Armstrong, J., & Franke, G. (2001). Principles of forecasting. In Journal of Marketing Research. http://elibrary.ru/item.asp?id=4643952%5Cnpapers2://publication/uuid/2EF07596-D9F9-463E-8773-080AE5DD8ADC

Chen, M., & Liu, Z. (2024). Heliyon Predicting performance of students by optimizing tree components of random forest using genetic algorithm. Heliyon, 10(12), e32570. https://doi.org/10.1016/j.heliyon.2024.e32570

Du, Q., & Zhai, J. (2024). Measurement : Sensors Application of artificial intelligence Sensors based on random forest algorithm in financial recognition models. 33(June).

Gatera, A., Kuradusenge, M., Bajpai, G., Mikeka, C., & Shrivastava, S. (2023). Comparison of random forest and support vector machine regression models for forecasting road accidents. Scientific African, 21, e01739. https://doi.org/10.1016/j.sciaf.2023.e01739

KOC, T., & AKIN, P. (2022). Estimation of High School Entrance Examination Success Rates Using Machine Learning and Beta Regression Models. Journal of Intelligent Systems: Theory and Applications, 5(1), 9–15. https://doi.org/10.38016/jista.922663

Maulud, D., & Abdulazeez, A. M. (2020). A Review on Linear Regression Comprehensive in Machine Learning. Journal of Applied Science and Technology Trends, 1(2), 140–147. https://doi.org/10.38094/jastt1457

Meher, B. K., Singh, M., Birau, R., & Anand, A. (2024). Forecasting stock prices of fintech companies of India using random forest with high-frequency data. Journal of Open Innovation: Technology, Market, and Complexity, 10(1), 100180. https://doi.org/10.1016/j.joitmc.2023.100180

Narnaware, M. (2023). College Admission Prediction using Machine Learning. International Journal for Research in Applied Science and Engineering Technology, 11(11), 520–523. https://doi.org/10.22214/ijraset.2023.56539

Nirmala, I., Wijayanto, H., & Notodiputro, K. A. (2022). Prediction of Undergraduate Student’s Study Completion Status Using MissForest Imputation in Random Forest and XGBoost Models. ComTech: Computer, Mathematics and Engineering Applications, 13(1), 53–62. https://doi.org/10.21512/comtech.v13i1.7388

Petropoulos, F., Apiletti, D., Assimakopoulos, V., Babai, M. Z., Barrow, D. K., Ben Taieb, S., Bergmeir, C., Bessa, R. J., Bijak, J., Boylan, J. E., Browell, J., Carnevale, C., Castle, J. L., Cirillo, P., Clements, M. P., Cordeiro, C., Cyrino Oliveira, F. L., De Baets, S., Dokumentov, A., … Ziel, F. (2022). Forecasting: theory and practice. International Journal of Forecasting, 38(3), 705–871. https://doi.org/10.1016/j.ijforecast.2021.11.001

Savargiv, M., Masoumi, B., & Keyvanpour, M. R. (2021). A new random forest algorithm based on learning automata. Computational Intelligence and Neuroscience, 2021. https://doi.org/10.1155/2021/5572781

Tang, Q., Chen, Y., Yang, H., Liu, M., Xiao, H., Wang, S., Chen, H., & Raza Naqvi, S. (2021). Machine learning prediction of pyrolytic gas yield and compositions with feature reduction methods: Effects of pyrolysis conditions and biomass characteristics. Bioresource Technology, 339(July), 125581. https://doi.org/10.1016/j.biortech.2021.125581

Xu, W., Tu, J., Xu, N., & Liu, Z. (2024). Predicting daily heating energy consumption in residential buildings through integration of random forest model and meta-heuristic algorithms. Energy, 301(May), 131726. https://doi.org/10.1016/j.energy.2024.131726

Downloads


Crossmark Updates

How to Cite

Nurdin, M., & Fauziah, F. (2024). Analytical Study Forecasting Students Using Random Forest and Linear Regression Algorithms. Sinkron : Jurnal Dan Penelitian Teknik Informatika, 8(4), 2369-2378. https://doi.org/10.33395/sinkron.v8i4.13886