Sentiment Analysis Of Hotel Reviews On Tripadvisor With LSTM And ELECTRA
DOI:
10.33395/sinkron.v8i2.12234Abstract
This study examines the importance of hotel review data analysis and the use of Natural Language Processing (NLP) technology in predicting hotel review sentiment. In this study, deep learning models such as Long Short-Term Memory (LSTM) and Efficiently Learning an Encoder that Classifies Token Replacements Accurately (ELECTRA) are used to predict hotel review sentiment in Indonesian. Hotel review data was obtained through a data scraping process with webscraper.io from the Tripadvisor website and a total of 977 hotel review data were obtained from Grand Mercure Maha Cipta Medan Angkasa. Before the sentiment prediction process is carried out, hotel review data must go through the text preprocessing stage to remove punctuation marks, capital letters, stopwords, and a lemmatizer process is carried out to facilitate further data processing. In addition, sentiments that were previously unbalanced need to be balanced through the undersampling process. The data that has been cleaned and balanced is then labeled as negative (0), neutral (1) and positive (2) sentiments. The test results show that the ELECTRA model produces better performance than the LSTM with an accuracy of 47% by ELECTRA and 30% by LSTM.Downloads
References
Arora, S. et al. (2022) ‘Reasoning over Public and Private Data in Retrieval-Based Systems’, pp. 1–27. Available at: http://arxiv.org/abs/2203.11027.
Bayu Baskoro, B., Susanto, I. and Khomsah, S. (2021) ‘Analisis Sentimen Pelanggan Hotel di Purwokerto Menggunakan Metode Random Forest dan TF-IDF (Studi Kasus: Ulasan Pelanggan Pada Situs TRIPADVISOR)’, Journal of Informatics, Information System, Software Engineering and Applications (INISTA), 3(2), pp. 21–029. doi: 10.20895/INISTA.V3.
Clark, K. et al. (2020) ‘ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators’, pp. 1–18. Available at: http://arxiv.org/abs/2003.10555.
Divya, K. et al. (2020) ‘An Interpretation of Lemmatization and Stemming in Natural Language Processing’, Journal of University of Shanghai for Science and Technology, 22(10), pp. 350–357. Available at: https://www.researchgate.net/publication/348306833.
HaCohen-Kerner, Y., Miller, D. and Yigal, Y. (2020) ‘The influence of preprocessing on text classification using a bag-of-words representation’, PLoS ONE, 15(5), pp. 1–22. doi: 10.1371/journal.pone.0232525.
Jayanto, R., Kusumaningrum, R. and Wibowo, A. (2022) ‘Aspect-based sentiment analysis for hotel reviews using an improved model of long short-term memory’, International Journal of Advances in Intelligent Informatics, 8(3), pp. 391–403. doi: 10.26555/ijain.v8i3.691.
Kusumaningrum, R. et al. (2021) ‘Sentiment analysis of Indonesian hotel reviews: from classical machine learning to deep learning’, International Journal of Advances in Intelligent Informatics, 7(3), pp. 292–303. doi: 10.26555/ijain.v7i3.737.
Moch Sambas et al. (2022) ‘Analysis of Lodging and Competition on the Island of Bali during Covid-19 with Big Data’, International Journal of Travel, Hospitality and Events, 1(3), pp. 214–228. doi: 10.56743/ijothe.v1i3.172.
Morama, H. C., Ratnawati, D. E. and Arwani, I. (2022) ‘Analisis Sentimen berbasis Aspek terhadap Ulasan Hotel Tentrem Yogyakarta menggunakan Algoritma Random Forest Classifier’, Jurnal Pengembangan Teknologi Informasi dan Ilmu Komputer, 6(4), pp. 1702–1708.
Priyantina, R. A. and Sarno, R. (2019) ‘Sentiment analysis of hotel reviews using Latent Dirichlet Allocation, semantic similarity and LSTM’, International Journal of Intelligent Engineering and Systems, 12(4), pp. 142–155. doi: 10.22266/ijies2019.0831.14.
Utami, T. A. (2021) ‘Sentiment Analysis of Hotel User Review using RNN Algorithm’, International Journal of Informatics and Computation, 3(1), p. 30. doi: 10.35842/ijicom.v3i1.34.
Yao, L. (2022) ‘Sentiment analysis based on CNN - LSTM hotel reviews’, Journal of Physics: Conference Series, 2330(1). doi: 10.1088/1742-6596/2330/1/012018.
Downloads
How to Cite
Issue
Section
License
Copyright (c) 2023 Andika Lin, Nicholas Livando, William Chandra, Gary Phan, Amir Mahmud Husein

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.