Comparison Decision Tree and Logistic Regression Machine Learning Classification Algorithms to determine Covid-19
Keywords:Covid-19; Machine Learning; Classification Algorithms; Decision Tree; Logistic Regression
Many people today are unsure whether they have COVID-19. The frequent fever, dry cough, and sore throat are all signs and symptoms of COVID-19. If a person has signs or symptoms of coronavirus disease 2019 (COVID-19), he/she should see the doctor or go to a clinic as soon as possible. As a result, it's vital to learn and comprehend the fundamental differences. COVID-19 can cause a wide range of symptoms. The experiments were carried out using two Machine Learning Classification Algorithms, namely Decision Tree (DT) and Logistic Regression (LR). Both algorithms were written and analyzed using the Python program in Jupyter Notebook 6.4.5. From the results obtained in the experiments of covid symptoms dataset, on average, the DT model has obtained the best cross-validation average and the testing performance average compared to the LR machine learning models. For cross-validation results, the DT model has achieved an accuracy of 98.0%. For performance testing, the DT model has achieved an accuracy of 98.0%. The LR has obtained the second-best result on the average of cross-validation performance and the testing results. For cross-validation results, the LR model has achieved an accuracy of 96.0%. For performance testing, the LR model has achieved an accuracy of 97.0%. Consequently, the DT for the COVID-19 symptoms dataset is outperforming the LR for cross-validation and testing results.
Abdelminaam, D. S., Ismail, F. H., Taha, M., Taha, A., Houssein, E. H., & Nabil, A. (2021). CoAID-DEEP: An Optimized Intelligent Framework for Automated Detecting COVID-19 Misleading Information on Twitter. IEEE Access, 9, 27840–27867. https://doi.org/10.1109/ACCESS.2021.3058066
Banihashemi, S., Ding, G., & Wang, J. (2017). Developing a Hybrid Model of Prediction and Classification Algorithms for Building Energy Consumption. Energy Procedia, 110, 371–376. https://doi.org/10.1016/j.egypro.2017.03.155
Carleo, G., Cirac, I., Cranmer, K., Daudet, L., Schuld, M., Tishby, N., Vogt-Maranto, L., & Zdeborová, L. (2019). Machine learning and the physical sciences. Reviews of Modern Physics, 91(4). https://doi.org/10.1103/RevModPhys.91.045002
Charbuty, B., & Abdulazeez, A. (2021). Classification Based on Decision Tree Algorithm for Machine Learning. Journal of Applied Science and Technology Trends, 2(01), 20–28. https://doi.org/10.38094/jastt20165
hemanth hari. (2020). Symptoms and COVID Presence (May 2020 data) Kaggle. https://www.kaggle.com/hemanthhari/symptoms-and-covid-presence
Hermans, J. J. R., Groen, J., Zwets, E., Boxma-De Klerk, B. M., van Werkhoven, J. M., Ong, D. S. Y., Hanselaar, W. E. J. J., Waals-Prinzen, L., & Brown, V. (2020). Chest CT for triage during COVID-19 on the emergency department: myth or truth? Emergency Radiology, 27(6), 641–651. https://doi.org/10.1007/s10140-020-01821-1
Hillel, T., Bierlaire, M., Elshafie, M. Z. E. B., & Jin, Y. (2021). A systematic review of machine learning classification methodologies for modelling passenger mode choice. Journal of Choice Modelling, 38. https://doi.org/10.1016/j.jocm.2020.100221
Kunal Pahwa, & Neha Agarwal. (2019). Stock Market Analysis using Supervised Machine Learning. 2019 International Conference on Machine Learning, Big Data, Cloud and Parallel Computing (Com-IT-Con), India, 14th -16th Feb 2019.
Kwekha-Rashid, A. S., Abduljabbar, H. N., & Alhayani, B. (2021). Coronavirus disease (COVID-19) cases analysis using machine-learning applications. Applied Nanoscience (Switzerland). https://doi.org/10.1007/s13204-021-01868-7
Majumder, A. B., Gupta, S., Singh, D., & Majumder, S. (2021). An intelligent system for prediction of COVID-19 case using machine learning framework-logistic regression. Journal of Physics: Conference Series, 1797(1). https://doi.org/10.1088/1742-6596/1797/1/012011
Mashat, A. F., Fouad, M. M., Yu, P. S., & Gharib, T. F. (2012). A Decision Tree Classification Model for University Admission System. IJACSA) International Journal of Advanced Computer Science and Applications, 3(10). www.ijacsa.thesai.org
Mohsin Abdulazeez, A., Zeebaree, D., Abdulqader, D. M., & Zeebaree, D. Q. (2020). Machine Learning Supervised Algorithms of Gene Selection: A Review. 62. https://www.researchgate.net/publication/341119469
Molin, S., Jee, K., O’Reilly for Higher Education (Firm), & Safari, an O. M. Company. (2021). Hands-On Data Analysis with Pandas - Second Edition.
Nandhini, S., & Marseline, D. J. (2020, February 1). Performance Evaluation of Machine Learning Algorithms for Email Spam Detection. International Conference on Emerging Trends in Information Technology and Engineering, Ic-ETITE 2020. https://doi.org/10.1109/ic-ETITE47903.2020.312
Narayan, S. (2021). Allergies, Cold, Flu or COVID-19? How to Tell the Difference. Emersonhospital.Org. https://www.emersonhospital.org/articles/allergies-or-covid-19
Sathiyanarayanan, M. P., Sai, S. M., & Vinayagar, M. (2019). Identification of Breast Cancer Using The Decision Tree Algorithm. 2019 IEEE International Conference on System, Computation, Automation and Networking (ICSCAN).
Sharma, D., & Kumar, N. (2017). A Review on Machine Learning Algorithms, Tasks and Applications. In International Journal of Advanced Research in Computer Engineering & Technology (IJARCET) (Vol. 6, Issue 10, pp. 2278–1323).
Yan, L., Zhang, H. T., Goncalves, J., Xiao, Y., Wang, M., Guo, Y., Sun, C., Tang, X., Jin, L., Zhang, M., Huang, X., Xiao, Y., Cao, H., Chen, Y., Ren, T., Wang, F., Xiao, Y., Huang, S., Tan, X., … Yuan, Y. (2020). A machine learning-based model for survival prediction in patients with severe COVID-19 infection. MedRxiv. https://doi.org/10.1101/2020.02.27.20028027
Zou, Q., Qu, K., Luo, Y., Yin, D., Ju, Y., & Tang, H. (2018). Predicting Diabetes Mellitus With Machine Learning Techniques. Frontiers in Genetics, 9. https://doi.org/10.3389/fgene.2018.00515
How to Cite
Copyright (c) 2022 Artika Arista
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.