Comparison Analysis of C4.5 Algorithm and KNN Algorithm for Predicting Data of Non-Active Students at Prima Indonesia University

Authors

  • Jepri Banjarnahor Universitas Prima Indonesia
  • Ferman Zai Universitas Prima Indonesia
  • Janiali Sirait Universitas Prima Indonesia
  • Dicky Wijaya Nainggolan Universitas Prima Indonesia
  • Nissi Grace Dian Sihombing Universitas Prima Indonesia

DOI:

10.33395/sinkron.v8i4.12879

Keywords:

Analysis; Predicting; Algorithm C4.5; KNN Non active students

Abstract

Education is important nowadays because universities need to improve their students' skills so they can compete in the globalization era. Education can be obtained through both formal and informal channels, and knowledge is available everywhere, especially in today's world where information tools are rapidly evolving. Inactive students are students who do not participate in a course for a maximum of two consecutive semesters. Students who are not active have the opportunity to drop out of university studies. Students who drop out of college are usually motivated by economic factors, and the cessation of the lecture process can cause inactivity and administrative costs. Therefore, this research was conducted using the C4.5 algorithm method and the K-Nearest Neighbor (KNN) algorithm to compare and predict data on inactive students at Universitas Prima Indonesia. The research continued with the data collection and data preprocessing stages, after which the data mining process was carried out to get the final results of this research. The testing process follows the process of comparing the C4.5 algorithm and the K-Nearest Neighbor (KNN) algorithm with K-fold crossing. This evaluation step is compared by considering the comparison values of the confusion matrix (precision, precision, recall). The accuracy results obtained by each algorithm provide information about the effectiveness of using these techniques in processing the specified dataset. The accuracy of the Decision Tree C4.5 algorithm is 99.12% and the K-Nearest Neighbors algorithm is 99.14%. Based on research conducted using the K-Nearest Neighbors and C4.5 algorithms to predict inactive students, the KNN algorithm is more accurate than the C4.5 algorithm.

GS Cited Analysis

Downloads

Download data is not yet available.

References

Anestiviya, V., Ferico, A., & Pasaribu, O. (2021). Pattern Analysis Using the C4.5 Method for Student Majors Based on the Curriculum (Case Study: Sman 1 Natar). Journal of Technology and Information Systems (JTSI), 2(1), 80–85. http://jim.teknokrat.ac.id/index.php/JTSI

Atma, Y. D., & Setyanto, A. (2018). Comparison of C4.5 and K-NN Algorithms in Identifying Potential Drop Out Students. Journal Metics, 2(2), 31–37.

Dewi, N. A. K., Zukhri, A., & Dunia, I. K. (2014). Analysis of the Factors Causing Elementary School Drop Out Children in Gerokgak District, 2012/2013. Undiksha Journal of Economic Education, 4(1), 1–12. https://ejournal.undiksha.ac.id/index.php/JJPE/article/view/1898

Gaol, N. Y. L. (2020). Prediction of Potentially Inactive Students Using Data Mining in Decision Trees and Algorithms C4.5. Journal of Information & Technology, 2, 23–29. https://doi.org/10.37034/jidt.v2i1.22

Haryanto, C., Rahaningsih, N., & Muhammad Basysyar, F. (2023). Comparison of Machine Learning Algorithms in Predicting House Prices. JATI (Informatics Engineering Student Journal), 7(1), 533–539. https://doi.org/10.36040/jati.v7i1.6343

Husen, A. H., Nur Afiah, A. S., Soesanti, S., & Tempola, F. (2022). Early Detection of Tuberculosis Risk in Ternate City: Tracking and Implementation of a Classification Algorithm. Journal of CoSciTech (Computer Science and Information Technology), 3(2), 217–225. https://doi.org/10.37859/coscitech.v3i2.3986

Journal, H., Mambang, M., Hidayat, A., Dona Marleny, F., Wahyudi, J., Information, T., & Sari Mulia, U. (2022). Journal of Informatics and Computer Technology Explanatory Data Analysis to Evaluate Keyword Searches Learning Videos on Youtube with a Machine Learning Approach. July, 2(2), 181–189.

Karyono, G. (2016). ANALYSIS OF DATA MINING TECHNIQUES " ALGORITHM C4.5 AND K-NEAREST NEIGHBOR " TO DIAGNOSE DIABETES MELLITUS. Information Technology National Seminar, 77–82. http://news.palcomtech.com/wp-content/uploads/downloads/2016/06/IT13_Giat-Karyono.pdf

Latifah, R., Wulandari, E. S., & Kreshna, P. E. (2019). Decision Tree Model for Predicting Work Schedule Using Scikit-Learn. Journal of University of Muhammadiyah Jakarta, 1–6. https://jurnal.umj.ac.id/index.php/semnastek/article/download/5239/3517

Nasrullah, A. H. (2018). Application of the C4.5 Method for the Classification of Potential Drop Out Students. ILKOM Scientific Journal, 10(2), 244–250. https://doi.org/10.33096/ilkom.v10i2.300.244-250

Nikmatun, I. A., & Waspada, I. (2019). Implementation of Data Mining for Classification of Student Study Period Using the K-Nearest Neighbor Algorithm. SIMETRIS Journal, 10(2), 421–432.

Noviana, D., Susanti, Y., & Susanto, I. (2019). Analysis of Scholarship Recipients Recommendations Using the K-Nearest Neighbor (K-NN) Algorithm and the C4.5 Algorithm. National Seminar on Mathematics Education Research (SNP2M) 2019 UMT, 79–87.

Novianti, B., Rismawan, T., & Bahri, S. (2016). Implementation of Data Mining Using the C4.5 Algorithm for Student Majors (Case Study: Sma Negeri 1 Pontianak). Journal of Coding, Untan Computer Systems, 04(3), 75–84.

Prihandoko, P. (2018). C4 Algorithm Performance Comparison. 5, Naïve Bayes, K-Nearest Neighbor, Logistic Regression, And Support Vector Machines To Detect Breast Cancer. Journal of Information and Communication Technology, 7(2), 1–10.

Rosandy, T. (2016). Comparison of the Naive Bayes Classifier Method with the Decision Tree Method for Analyzing Smooth Financing. Journal of TIM Darmajaya, 02(01), 52–62.

Regards, A., Nugroho, F. B., & Zeniarja, J. (2020). Implementation of K-Nearest Neighbor Algorithm Based on Forward Selection for Prediction of Inactive Students at Dian Nuswantoro University Semarang. JOINS (Journal of Information Systems), 5(1), 69–76. https://doi.org/10.33633/joins.v5i1.3351

Saputra, H. K. (2018). Data Mining Analysis for Mapping Students in Need of Guidance and Counseling Using the Naïve Bayes Classifier Algorithm. Journal of Information Technology and Education, 11(1), 14–26. https://doi.org/10.24036/tip.v11i1.104

Wanto, A. (2016). Analysis of the Application of the Fuzzy Inference System (FIS) Using the Mamdani Method in the Non-Active Student Prediction System (Case Study: AMIK Tunas Bangsa Pematangsiantar). National Seminar on Innovation and Information Technology (SNITI) 3, 3, 393–400.

Downloads


Crossmark Updates

How to Cite

Banjarnahor, J. ., Zai , F. ., Sirait , J. ., Nainggolan , D. W. ., & Sihombing , N. G. D. . (2023). Comparison Analysis of C4.5 Algorithm and KNN Algorithm for Predicting Data of Non-Active Students at Prima Indonesia University. Sinkron : Jurnal Dan Penelitian Teknik Informatika, 7(4), 2027-2035. https://doi.org/10.33395/sinkron.v8i4.12879

Most read articles by the same author(s)