TF-IDF Method and Vector Space Model Regarding the Covid-19 Vaccine on Online News
Keywords:TF-IDF, Vector Space Model, Covid 19, Vaccine, Online News
Advances in information and technology have caused the use of the internet to be a concern of the general public. Online news sites are one of the technologies that have developed as a means of disseminating the latest information in the world. When viewed in terms of numbers, newsreaders are very sufficient to get the desired information. However, with this, the amount of information collected will result in an explosion of information and the possibility of information redundancy. The search system is one of the solutions which expected to help in finding the desired or relevant information by the input query. The methods commonly used in this case are TF-IDF and VSM (Vector Space Model) which are used in weighting to measure statistics from a collection of documents on the search for some information about the Covid 19 vaccine on kompas.com news then tokenizing it to separate the text, stopword removal or filtering to remove unnecessary words which usually consist of conjunctions and others. The next step is sentence stemming which aims to eliminate word inflection to its basic form. Then the TF-IDF and VSM calculations were carried out and the final result are news documents 3 (DOC 3) with a weight of 5.914226424; news documents 2 (DOC 2) with a weight of 1.767692186; news documents 5 (DOC 5) with weights 1.550165096; news document 4 (DOC 4) with a weight of 1.17141223;, and the last is news document 1 (DOC 1) with a weight of 0.5244103739.
F. Wiranto, “Development of a Time Frame Detection System for News Documents Based on a Vector Space Model,” Universitas Jember, 2019.
Rachman, Ff, 2020. An Analysis of the Pros and Cons of Indonesian Society Sentiments regarding the COVID-19 Vaccine on Social Media Twitter. Indonesian of Health Information Management Journal. Vol.8, No.2, December 2020, p.100-109
Abu El-Khair, I. (2017). TF*IDF. In Encyclopedia of Database Systems (pp. 1–2). Springers New York. https://doi.org/10.1007/978-1-4899-7993-3_956-2
Aziz, Abdul, 2015. Implementation of the Vector Space Model in Generating Automatic Frequently Asked Questions and Relevant Solutions for Customer Complaints. Scientific Journal of Informatics. p-ISSN 2407-7658
Kompas Gramedia, 2020 “A Step Towards Vaccines”, Kompas, 12 August 2020, p. 1
Baeza-Yates, Ricardo. Modern Information Retrieval. University of Pompeu Fabra. 1999
Manning, D. Christopher, Raghavan, P. & Schütze H. 2009. An Introduction to Information Retrieval. Cambridge University Press.
R. Melita Et Al., "Application of Term Frequency Inverse Document Frequency (TF-IDF) and Cosine Similarity Methods in Information Retrieval Systems to Know Web-Based Hadith Syarah (Case Study: Syarah Umdatil Ahkam)," J. Tek. Inform., vol. 11, no. 2, 2018.
Salloum, S. A., Al-Emran, M., Monem, A. A., & Shaalan, K. (2018). Using text mining techniques for extracting information from research articles. Studies in Computational Intelligence, 740, 373–397. https://doi.org/10.1007/978-3-319-67056-0_18
Ifa Musfiroh Nurjannah, Hamdani, “Application of the Term Frequency-Inverse Document Frequency (Tf- Idf) Algorithm for Text Mining” J. Inform. Mulawarman, vol. 8, no. 3, pp. 110–113, 2013.
Ah Dwijawisnu B, “Information Retrieval (IR ) Design for Searching Main Idea of English Article Text with Vector Space Model Weighting,” J. Ilm. Technol. and Inf. ASIA, vol. 9, no. 1, 2015.
C Slamet Et All "Automated Text Summarization for Indonesian Article Using Vector Space Model" IOP Conference Series: Materials Science and Engineering. IOP Conf. Ser.: Mater. science. eng. 288 012037. 2018
F. Amin, “Search Engine Implementation Using Vector Space Model Method” Din. Tech. (Journal of Development of Technological Sciences , vol. V, no. 1, pp. 45–58, 2016.
Saptono, R., Prasetyo, H., & Irawan, A. (2018). Combination of Cosine Similarity Method and Conditional Probability for Plagiarism Detection in the Thesis Documents Vector Space Model. Journal of Telecommunication, Electronic and Computer Engineering (JTEC), 10(2-4), 139–143. Retrieved from https://jtec.utem.edu.my/jtec/article/view/4332
F. Amin, “Information Retrieval System by Ranking the Vector Space Model Method,” vol. 18, no. 2, pp. 122–129, 2013.
Hammond, M. (2020). NLTK. In Python for Linguists (pp. 291–296). Cambridge University Press. https://doi.org/10.1017/9781108642408.013
Herwijayanti, B., Ratnawati, De, & Muflikhah, L. (2018). Online News Classification using TF-IDF Weighting and Cosine Similarity. Development of Information Technology and Computer Science, 2(1), 306–312.
Kompas Gramedia. (2021, Jan). Sinovac Covid 19 vaccine begins to be distributed to 34 provinces [Online] Avaiable : https://nasional. kompas.com/read/2021/01/03/14230441/vaccine-covid-19-sinovac-start-distributed-to-34-province
Kompas Gramedia. (2020, Dec) Update on the preparation for the covid 19 vaccination in Indonesia, where can the vaccine be obtained ? [Online]. Avaiable : https://www.kompas.com/ tren/read/2020/12/ 20/070 000 965 / update-preparation-vaccination-covid-19-in-Indonesia -in-where-vaccine -can ?page =all
Kompas Gramedia. (2021, Jan). 40 , 2 Million People Will Receive Vaccine Covid-19 Stage One, this breakdown [Online]. Avaiable : https:// www.kompas.com/tren/read/2021/01/02/ 205404 065/402-million-people-will-receive-vaccinecovid-19-the-first-step-this-details?page=all
Kompas Gramedia. (2020, Dec) Starting today the government will send sms to recipients of the covid 19 vaccine [Online]. Avaiable : https:// nasional.kompas.com/read/2020/12/31/07443471/mulai-hari- this-government-will-send-sms-to-vaccine-covid-19-recipients?page=all
Kompas Gramedia. (2020, Dec) Residents who receive sms from the Ministry of Health must have a covid-19 vaccine [Online] Avaiable : https://nasional.kompas.com/read/2020/12/31/11582611/warga-yang- received -sms-from-kemenkes-mandatory-vaccine-covid-19
How to Cite
Copyright (c) 2021 Bita Parga Zen, Irwan Susanto, Dian Finaliamartha
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.