Comparison Decision Tree and Logistic Regression Machine Learning Classification Algorithms to determine Covid-19


  • Artika Arista Universitas Pembangunan Nasional "Veteran" Jakarta




Covid-19; Machine Learning; Classification Algorithms; Decision Tree; Logistic Regression


Many people today are unsure whether they have COVID-19. The frequent fever, dry cough, and sore throat are all signs and symptoms of COVID-19. If a person has signs or symptoms of coronavirus disease 2019 (COVID-19), he/she should see the doctor or go to a clinic as soon as possible. As a result, it's vital to learn and comprehend the fundamental differences. COVID-19 can cause a wide range of symptoms. The experiments were carried out using two Machine Learning Classification Algorithms, namely Decision Tree (DT) and Logistic Regression (LR). Both algorithms were written and analyzed using the Python program in Jupyter Notebook 6.4.5. From the results obtained in the experiments of covid symptoms dataset, on average, the DT model has obtained the best cross-validation average and the testing performance average compared to the LR machine learning models. For cross-validation results, the DT model has achieved an accuracy of 98.0%. For performance testing, the DT model has achieved an accuracy of 98.0%. The LR has obtained the second-best result on the average of cross-validation performance and the testing results. For cross-validation results, the LR model has achieved an accuracy of 96.0%. For performance testing, the LR model has achieved an accuracy of 97.0%. Consequently, the DT for the COVID-19 symptoms dataset is outperforming the LR for cross-validation and testing results.

GS Cited Analysis


Download data is not yet available.


Abdelminaam, D. S., Ismail, F. H., Taha, M., Taha, A., Houssein, E. H., & Nabil, A. (2021). CoAID-DEEP: An Optimized Intelligent Framework for Automated Detecting COVID-19 Misleading Information on Twitter. IEEE Access, 9, 27840–27867.

Banihashemi, S., Ding, G., & Wang, J. (2017). Developing a Hybrid Model of Prediction and Classification Algorithms for Building Energy Consumption. Energy Procedia, 110, 371–376.

Carleo, G., Cirac, I., Cranmer, K., Daudet, L., Schuld, M., Tishby, N., Vogt-Maranto, L., & Zdeborová, L. (2019). Machine learning and the physical sciences. Reviews of Modern Physics, 91(4).

Charbuty, B., & Abdulazeez, A. (2021). Classification Based on Decision Tree Algorithm for Machine Learning. Journal of Applied Science and Technology Trends, 2(01), 20–28.

hemanth hari. (2020). Symptoms and COVID Presence (May 2020 data) Kaggle.

Hermans, J. J. R., Groen, J., Zwets, E., Boxma-De Klerk, B. M., van Werkhoven, J. M., Ong, D. S. Y., Hanselaar, W. E. J. J., Waals-Prinzen, L., & Brown, V. (2020). Chest CT for triage during COVID-19 on the emergency department: myth or truth? Emergency Radiology, 27(6), 641–651.

Hillel, T., Bierlaire, M., Elshafie, M. Z. E. B., & Jin, Y. (2021). A systematic review of machine learning classification methodologies for modelling passenger mode choice. Journal of Choice Modelling, 38.

Kunal Pahwa, & Neha Agarwal. (2019). Stock Market Analysis using Supervised Machine Learning. 2019 International Conference on Machine Learning, Big Data, Cloud and Parallel Computing (Com-IT-Con), India, 14th -16th Feb 2019.

Kwekha-Rashid, A. S., Abduljabbar, H. N., & Alhayani, B. (2021). Coronavirus disease (COVID-19) cases analysis using machine-learning applications. Applied Nanoscience (Switzerland).

Majumder, A. B., Gupta, S., Singh, D., & Majumder, S. (2021). An intelligent system for prediction of COVID-19 case using machine learning framework-logistic regression. Journal of Physics: Conference Series, 1797(1).

Mashat, A. F., Fouad, M. M., Yu, P. S., & Gharib, T. F. (2012). A Decision Tree Classification Model for University Admission System. IJACSA) International Journal of Advanced Computer Science and Applications, 3(10).

Mohsin Abdulazeez, A., Zeebaree, D., Abdulqader, D. M., & Zeebaree, D. Q. (2020). Machine Learning Supervised Algorithms of Gene Selection: A Review. 62.

Molin, S., Jee, K., O’Reilly for Higher Education (Firm), & Safari, an O. M. Company. (2021). Hands-On Data Analysis with Pandas - Second Edition.

Nandhini, S., & Marseline, D. J. (2020, February 1). Performance Evaluation of Machine Learning Algorithms for Email Spam Detection. International Conference on Emerging Trends in Information Technology and Engineering, Ic-ETITE 2020.

Narayan, S. (2021). Allergies, Cold, Flu or COVID-19? How to Tell the Difference. Emersonhospital.Org.

Sathiyanarayanan, M. P., Sai, S. M., & Vinayagar, M. (2019). Identification of Breast Cancer Using The Decision Tree Algorithm. 2019 IEEE International Conference on System, Computation, Automation and Networking (ICSCAN).

Sharma, D., & Kumar, N. (2017). A Review on Machine Learning Algorithms, Tasks and Applications. In International Journal of Advanced Research in Computer Engineering & Technology (IJARCET) (Vol. 6, Issue 10, pp. 2278–1323).

Yan, L., Zhang, H. T., Goncalves, J., Xiao, Y., Wang, M., Guo, Y., Sun, C., Tang, X., Jin, L., Zhang, M., Huang, X., Xiao, Y., Cao, H., Chen, Y., Ren, T., Wang, F., Xiao, Y., Huang, S., Tan, X., … Yuan, Y. (2020). A machine learning-based model for survival prediction in patients with severe COVID-19 infection. MedRxiv.

Zou, Q., Qu, K., Luo, Y., Yin, D., Ju, Y., & Tang, H. (2018). Predicting Diabetes Mellitus With Machine Learning Techniques. Frontiers in Genetics, 9.


Crossmark Updates

How to Cite

Arista, A. . (2022). Comparison Decision Tree and Logistic Regression Machine Learning Classification Algorithms to determine Covid-19. Sinkron : Jurnal Dan Penelitian Teknik Informatika, 7(1), 59-65.