Comparison of Feature Extraction Methods on Sentiment Analysis in Hotel Reviews





The development of technology causes things that done through meet in person or coming to a place can now be done by viewing information through gadgets or websites. Nowadays, to find out information about a place that provides accommodation for a vacation or a business visit, it can be done by accessing social media to see reviews from visitors who have visited the place, example, a hotel. Reviews given by hotel visitors are seen as more credible than information obtained from advertisements but the problem is that there are many reviews circulating on social media and it takes a time to analyze them. This study aims to analyze hotel reviews using the sentiment analysis method with the Support Vector Machine (SVM) approach. Sentiment analysis can be used to analyze the opinions of a large number of hotel visitors where it usually focuses on opinions that positive, negative and neutral. Before being analyzed with the support vector machine algorithm, 3 feature extraction methods will be used, namely Bag Of Words, TF-IDF and improvement TF-IDF to get the value of each word weight. The selection of these three methods is carried out by considering the influence of the presence of the same word feature in each review. In this comparison method, TF-IDF was found to be the best feature extraction method with 71.75% accuracy, 78.66% precision, 71.91% recall and 70.08% f1-score. The results obtained indicate that there are influence of features of the word in the hotel review data.

GS Cited Analysis


Download data is not yet available.


Ahuja, R., Chug, A., Kohli, S., Gupta, S., & Ahuja, P. (2019). The impact of features extraction on the sentiment analysis. Procedia Computer Science, 152.

Berrar, D. (2018). Cross-validation. In Encyclopedia of Bioinformatics and Computational Biology: ABC of Bioinformatics (Vol. 1–3).

Guo, A., & Yang, T. (2016). Research and improvement of feature words weight based on TFIDF algorithm. Proceedings of 2016 IEEE Information Technology, Networking, Electronic and Automation Control Conference, ITNEC 2016.

Himawan, H., Kaswidjanti, W., Sentimen, A., Sosial, M., & Based, L. (2018). Metode Lexicon Based dan Support Vector Machine untuk Menganalisis Sentimen pada Media Sosial sebagai Rekomendasi Oleh-Oleh Favorit. Seminar Nasional Informatika, 2018(November).

Kurniawan, A., Indriarti, & Adinugroho, S. (2019). Analisis Sentimen Opini Film Menggunakan Metode Naïve Bayes dan Lexicon Based Features. Jurnal Pengembangan Teknologi Informasi Dan Ilmu Komputer, 3(9).

Liang, H., Sun, X., Sun, Y., & Gao, Y. (2017). Text feature extraction based on deep learning: a review. Eurasip Journal on Wireless Communications and Networking, Vol. 2017.

Lo, A. S., & Yao, S. S. (2019). What makes hotel online reviews credible?: An investigation of the roles of reviewer expertise, review rating consistency and review valence. International Journal of Contemporary Hospitality Management, 31(1).

Najib, A. C., Irsyad, A., Qandi, G. A., & Rakhmawati, N. A. (2019). Perbandingan Metode Lexicon-based dan SVM untuk Analisis Sentimen Berbasis Ontologi pada Kampanye Pilpres Indonesia Tahun 2019 di Twitter. Fountain of Informatics Journal, 4(2).

Padurariu, C., & Breaban, M. E. (2019). Dealing with data imbalance in text classification. Procedia Computer Science, 159.

Pecar, S., Simko, M., & Bielikova, M. (2018). Sentiment analysis of customer reviews: Impact of text pre-processing. DISA 2018 - IEEE World Symposium on Digital Intelligence for Systems and Machines, Proceedings.

Qader, W. A., Ameen, M. M., & Ahmed, B. I. (2019). An Overview of Bag of Words;Importance, Implementation, Applications, and Challenges. Proceedings of the 5th International Engineering Conference, IEC 2019.


Silaa, V., Masui, F., & Ptaszynski, M. (2022). A Method of Supplementing Reviews to Less-Known Tourist Spots Using Geotagged Tweets. Applied Sciences (Switzerland), 12(5).

Wankhade, M., Rao, A. C. S., & Kulkarni, C. (2022). A survey on sentiment analysis methods, applications, and challenges. Artificial Intelligence Review.

Ying, X. (2019). An Overview of Overfitting and its Solutions. Journal of Physics: Conference Series, 1168(2).

Zhu, F. (2021). The Impact of High Technology on the Economy. Proceedings - 2021 5th International Conference on Data Science and Business Analytics, ICDSBA 2021.


Crossmark Updates

How to Cite

Dharma, A. S., & Saragih , Y. G. R. . (2022). Comparison of Feature Extraction Methods on Sentiment Analysis in Hotel Reviews. Sinkron : Jurnal Dan Penelitian Teknik Informatika, 7(4), 2349-2354.