Sentiment Analysis Of Hotel Reviews On Tripadvisor With LSTM And ELECTRA

Authors

  • Amir Mahmud Husein Universitas Prima Indonesia
  • Nicholas Livando Universitas Prima Indonesia
  • Andika Universitas Prima Indonesia
  • William Chandra Universitas Prima Indonesia
  • Gary Phan Universitas Prima Indonesia

DOI:

10.33395/sinkron.v8i2.12234

Abstract

This study examines the importance of hotel review data analysis and the use of Natural Language Processing (NLP) technology in predicting hotel review sentiment. In this study, deep learning models such as Long Short-Term Memory (LSTM) and Efficiently Learning an Encoder that Classifies Token Replacements Accurately (ELECTRA) are used to predict hotel review sentiment in Indonesian. Hotel review data was obtained through a data scraping process with webscraper.io from the Tripadvisor website and a total of 977 hotel review data were obtained from Grand Mercure Maha Cipta Medan Angkasa. Before the sentiment prediction process is carried out, hotel review data must go through the text preprocessing stage to remove punctuation marks, capital letters, stopwords, and a lemmatizer process is carried out to facilitate further data processing. In addition, sentiments that were previously unbalanced need to be balanced through the undersampling process. The data that has been cleaned and balanced is then labeled as negative (0), neutral (1) and positive (2) sentiments. The test results show that the ELECTRA model produces better performance than the LSTM with an accuracy of 47% by ELECTRA and 30% by LSTM. GS Cited Analysis

Downloads

Download data is not yet available.

References

Arora, S. et al. (2022) ‘Reasoning over Public and Private Data in Retrieval-Based Systems’, pp. 1–27. Available at: http://arxiv.org/abs/2203.11027.

Bayu Baskoro, B., Susanto, I. and Khomsah, S. (2021) ‘Analisis Sentimen Pelanggan Hotel di Purwokerto Menggunakan Metode Random Forest dan TF-IDF (Studi Kasus: Ulasan Pelanggan Pada Situs TRIPADVISOR)’, Journal of Informatics, Information System, Software Engineering and Applications (INISTA), 3(2), pp. 21–029. doi: 10.20895/INISTA.V3.

Clark, K. et al. (2020) ‘ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators’, pp. 1–18. Available at: http://arxiv.org/abs/2003.10555.

Divya, K. et al. (2020) ‘An Interpretation of Lemmatization and Stemming in Natural Language Processing’, Journal of University of Shanghai for Science and Technology, 22(10), pp. 350–357. Available at: https://www.researchgate.net/publication/348306833.

HaCohen-Kerner, Y., Miller, D. and Yigal, Y. (2020) ‘The influence of preprocessing on text classification using a bag-of-words representation’, PLoS ONE, 15(5), pp. 1–22. doi: 10.1371/journal.pone.0232525.

Jayanto, R., Kusumaningrum, R. and Wibowo, A. (2022) ‘Aspect-based sentiment analysis for hotel reviews using an improved model of long short-term memory’, International Journal of Advances in Intelligent Informatics, 8(3), pp. 391–403. doi: 10.26555/ijain.v8i3.691.

Kusumaningrum, R. et al. (2021) ‘Sentiment analysis of Indonesian hotel reviews: from classical machine learning to deep learning’, International Journal of Advances in Intelligent Informatics, 7(3), pp. 292–303. doi: 10.26555/ijain.v7i3.737.

Moch Sambas et al. (2022) ‘Analysis of Lodging and Competition on the Island of Bali during Covid-19 with Big Data’, International Journal of Travel, Hospitality and Events, 1(3), pp. 214–228. doi: 10.56743/ijothe.v1i3.172.

Morama, H. C., Ratnawati, D. E. and Arwani, I. (2022) ‘Analisis Sentimen berbasis Aspek terhadap Ulasan Hotel Tentrem Yogyakarta menggunakan Algoritma Random Forest Classifier’, Jurnal Pengembangan Teknologi Informasi dan Ilmu Komputer, 6(4), pp. 1702–1708.

Priyantina, R. A. and Sarno, R. (2019) ‘Sentiment analysis of hotel reviews using Latent Dirichlet Allocation, semantic similarity and LSTM’, International Journal of Intelligent Engineering and Systems, 12(4), pp. 142–155. doi: 10.22266/ijies2019.0831.14.

Utami, T. A. (2021) ‘Sentiment Analysis of Hotel User Review using RNN Algorithm’, International Journal of Informatics and Computation, 3(1), p. 30. doi: 10.35842/ijicom.v3i1.34.

Yao, L. (2022) ‘Sentiment analysis based on CNN - LSTM hotel reviews’, Journal of Physics: Conference Series, 2330(1). doi: 10.1088/1742-6596/2330/1/012018.

Downloads


Crossmark Updates

How to Cite

Husein, A. M., Livando, N., Andika, A., Chandra, W., & Phan, G. . (2023). Sentiment Analysis Of Hotel Reviews On Tripadvisor With LSTM And ELECTRA. Sinkron : Jurnal Dan Penelitian Teknik Informatika, 7(2), 733-740. https://doi.org/10.33395/sinkron.v8i2.12234

Most read articles by the same author(s)

1 2 > >>