Comparison Decision Tree and Logistic Regression Machine Learning Classification Algorithms to determine Covid-19

Authors

  • Artika Arista Universitas Pembangunan Nasional "Veteran" Jakarta

DOI:

10.33395/sinkron.v7i1.11243

Keywords:

Covid-19; Machine Learning; Classification Algorithms; Decision Tree; Logistic Regression

Abstract

Many people today are unsure whether they have COVID-19. The frequent fever, dry cough, and sore throat are all signs and symptoms of COVID-19. If a person has signs or symptoms of coronavirus disease 2019 (COVID-19), he/she should see the doctor or go to a clinic as soon as possible. As a result, it's vital to learn and comprehend the fundamental differences. COVID-19 can cause a wide range of symptoms. The experiments were carried out using two Machine Learning Classification Algorithms, namely Decision Tree (DT) and Logistic Regression (LR). Both algorithms were written and analyzed using the Python program in Jupyter Notebook 6.4.5. From the results obtained in the experiments of covid symptoms dataset, on average, the DT model has obtained the best cross-validation average and the testing performance average compared to the LR machine learning models. For cross-validation results, the DT model has achieved an accuracy of 98.0%. For performance testing, the DT model has achieved an accuracy of 98.0%. The LR has obtained the second-best result on the average of cross-validation performance and the testing results. For cross-validation results, the LR model has achieved an accuracy of 96.0%. For performance testing, the LR model has achieved an accuracy of 97.0%. Consequently, the DT for the COVID-19 symptoms dataset is outperforming the LR for cross-validation and testing results.

GS Cited Analysis

Downloads

Download data is not yet available.

References

Abdelminaam, D. S., Ismail, F. H., Taha, M., Taha, A., Houssein, E. H., & Nabil, A. (2021). CoAID-DEEP: An Optimized Intelligent Framework for Automated Detecting COVID-19 Misleading Information on Twitter. IEEE Access, 9, 27840–27867. https://doi.org/10.1109/ACCESS.2021.3058066

Banihashemi, S., Ding, G., & Wang, J. (2017). Developing a Hybrid Model of Prediction and Classification Algorithms for Building Energy Consumption. Energy Procedia, 110, 371–376. https://doi.org/10.1016/j.egypro.2017.03.155

Carleo, G., Cirac, I., Cranmer, K., Daudet, L., Schuld, M., Tishby, N., Vogt-Maranto, L., & Zdeborová, L. (2019). Machine learning and the physical sciences. Reviews of Modern Physics, 91(4). https://doi.org/10.1103/RevModPhys.91.045002

Charbuty, B., & Abdulazeez, A. (2021). Classification Based on Decision Tree Algorithm for Machine Learning. Journal of Applied Science and Technology Trends, 2(01), 20–28. https://doi.org/10.38094/jastt20165

hemanth hari. (2020). Symptoms and COVID Presence (May 2020 data) Kaggle. https://www.kaggle.com/hemanthhari/symptoms-and-covid-presence

Hermans, J. J. R., Groen, J., Zwets, E., Boxma-De Klerk, B. M., van Werkhoven, J. M., Ong, D. S. Y., Hanselaar, W. E. J. J., Waals-Prinzen, L., & Brown, V. (2020). Chest CT for triage during COVID-19 on the emergency department: myth or truth? Emergency Radiology, 27(6), 641–651. https://doi.org/10.1007/s10140-020-01821-1

Hillel, T., Bierlaire, M., Elshafie, M. Z. E. B., & Jin, Y. (2021). A systematic review of machine learning classification methodologies for modelling passenger mode choice. Journal of Choice Modelling, 38. https://doi.org/10.1016/j.jocm.2020.100221

Kunal Pahwa, & Neha Agarwal. (2019). Stock Market Analysis using Supervised Machine Learning. 2019 International Conference on Machine Learning, Big Data, Cloud and Parallel Computing (Com-IT-Con), India, 14th -16th Feb 2019.

Kwekha-Rashid, A. S., Abduljabbar, H. N., & Alhayani, B. (2021). Coronavirus disease (COVID-19) cases analysis using machine-learning applications. Applied Nanoscience (Switzerland). https://doi.org/10.1007/s13204-021-01868-7

Majumder, A. B., Gupta, S., Singh, D., & Majumder, S. (2021). An intelligent system for prediction of COVID-19 case using machine learning framework-logistic regression. Journal of Physics: Conference Series, 1797(1). https://doi.org/10.1088/1742-6596/1797/1/012011

Mashat, A. F., Fouad, M. M., Yu, P. S., & Gharib, T. F. (2012). A Decision Tree Classification Model for University Admission System. IJACSA) International Journal of Advanced Computer Science and Applications, 3(10). www.ijacsa.thesai.org

Mohsin Abdulazeez, A., Zeebaree, D., Abdulqader, D. M., & Zeebaree, D. Q. (2020). Machine Learning Supervised Algorithms of Gene Selection: A Review. 62. https://www.researchgate.net/publication/341119469

Molin, S., Jee, K., O’Reilly for Higher Education (Firm), & Safari, an O. M. Company. (2021). Hands-On Data Analysis with Pandas - Second Edition.

Nandhini, S., & Marseline, D. J. (2020, February 1). Performance Evaluation of Machine Learning Algorithms for Email Spam Detection. International Conference on Emerging Trends in Information Technology and Engineering, Ic-ETITE 2020. https://doi.org/10.1109/ic-ETITE47903.2020.312

Narayan, S. (2021). Allergies, Cold, Flu or COVID-19? How to Tell the Difference. Emersonhospital.Org. https://www.emersonhospital.org/articles/allergies-or-covid-19

Sathiyanarayanan, M. P., Sai, S. M., & Vinayagar, M. (2019). Identification of Breast Cancer Using The Decision Tree Algorithm. 2019 IEEE International Conference on System, Computation, Automation and Networking (ICSCAN).

Sharma, D., & Kumar, N. (2017). A Review on Machine Learning Algorithms, Tasks and Applications. In International Journal of Advanced Research in Computer Engineering & Technology (IJARCET) (Vol. 6, Issue 10, pp. 2278–1323).

Yan, L., Zhang, H. T., Goncalves, J., Xiao, Y., Wang, M., Guo, Y., Sun, C., Tang, X., Jin, L., Zhang, M., Huang, X., Xiao, Y., Cao, H., Chen, Y., Ren, T., Wang, F., Xiao, Y., Huang, S., Tan, X., … Yuan, Y. (2020). A machine learning-based model for survival prediction in patients with severe COVID-19 infection. MedRxiv. https://doi.org/10.1101/2020.02.27.20028027

Zou, Q., Qu, K., Luo, Y., Yin, D., Ju, Y., & Tang, H. (2018). Predicting Diabetes Mellitus With Machine Learning Techniques. Frontiers in Genetics, 9. https://doi.org/10.3389/fgene.2018.00515

Downloads


Crossmark Updates

How to Cite

Arista, A. . (2022). Comparison Decision Tree and Logistic Regression Machine Learning Classification Algorithms to determine Covid-19. Sinkron : Jurnal Dan Penelitian Teknik Informatika, 6(1), 59-65. https://doi.org/10.33395/sinkron.v7i1.11243