Supervised Model for Sentiment Analysis Based on Hotel Review Clusters using RapidMiner

Authors

  • Revin Novian Juliadi Widyatama University, Bandung, Indonesia
  • Yan Puspitarani Widyatama University, Bandung, Indonesia

DOI:

10.33395/sinkron.v7i3.11564

Keywords:

Data Mining, Text Mining, Sentiment Analysis, Opinion Mining, Text Clustering, Naive Bayes, K-Means, Natural Language Processing, RapidMiner

Abstract

Customer feedback in the modern era like today is mostly presented in the form of digital reviews, including customer feedback at an inn or hotel, customer feedback is very valuable data where from this data the management can find out, identify and analyze the customer experience and what they need. With customer feedback in the form of digital reviews, it will allow a lot of data that can be obtained by hotel management and will provide many benefits if the data is processed correctly. To take advantage of large  text review data, a combination of data mining and natural language processing techniques was chosen to process text in depth and efficiently.  Text mining in the form of creating an opinion mining model using the Naïve Bayes classification algorithm is applied to find information and measure the main sentiments expressed in the reviewed text dataset, then the application of K-Means text grouping aims to group texts and get information about the main topics discussed from the content of the review dataset text in each group . By applying the constructed sentiment analysis model, approximately 90.90% accuracy results were obtained in reading texts and measuring sentiments related to hotel customer feedback data.

GS Cited Analysis

Downloads

Download data is not yet available.

References

Alam, Md. H., Ryu, W.-J., & Lee, S. (2016). Joint multi-grain topic sentiment: modeling semantic aspects for online reviews. Information Sciences, 339, 206–223. doi:10.1016/j.ins.2016.01.013

Chauhan, N. S. (2022, April 8). Naïve Bayes Algorithm: Everything You Need to Know - KDnuggets. Retrieved July 6, 2022, from https://www.kdnuggets.com/2020/06/naive-bayes-algorithm-everything.html

Datafiniti. (2019). Hotel Reviews | Kaggle. Retrieved July 7, 2022, from https://www.kaggle.com/datasets/datafiniti/hotel-reviews

Farisi, A. A., Sibaroni, Y., & Faraby, S. al. (2019). Sentiment analysis on hotel reviews using Multinomial Naïve Bayes classifier. Journal of Physics: Conference Series, 1192, 012024. doi:10.1088/1742-6596/1192/1/012024

Ghorpade, T., & Ragha, L. (2012). Featured-based sentiment classification for hotel reviews using NLP and Bayesian classification. In 2012 International Conference on Communication, Information & Computing Technology (ICCICT) (pp. 1–5). IEEE. doi:10.1109/ICCICT.2012.6398136

Harmanpreet Singh. (2017). Hotel Reviews | Kaggle. Retrieved July 7, 2022, from https://www.kaggle.com/datasets/harmanpreet93/hotelreviews

Hu, Y.-H., Chen, Y.-L., & Chou, H.-L. (2017). Opinion mining from online hotel reviews – A text summarization approach. Information Processing & Management, 53(2), 436–449. doi:10.1016/j.ipm.2016.12.002

Jiashen Liu. (2017). 515K Hotel Reviews Data in Europe | Kaggle. Retrieved July 7, 2022, from https://www.kaggle.com/datasets/jiashenliu/515k-hotel-reviews-data-in-europe/metadata

Križo, P., Madzík, P., Vilgová, Z., & Sirotiaková, M. (2018). Evaluation of the Most Frequented Forms of Customer Feedback Acquisition and Analysis (pp. 562–573). doi:10.1007/978-3-319-95204-8_47

Mergel, I., Edelmann, N., & Haug, N. (2019). Defining digital transformation: Results from expert interviews. Government Information Quarterly, 36(4). doi:10.1016/J.GIQ.2019.06.002

Redhu, S., Srivastava, S., Bansal, B., & Gupta, G. (2018). Sentiment Analysis Using Text Mining: A Review. International Journal on Data Science and Technology, 4(2), 49. doi:10.11648/j.ijdst.20180402.12

Shi, H.-X., & Li, X.-J. (2011). A sentiment analysis model for hotel reviews based on supervised learning. In 2011 International Conference on Machine Learning and Cybernetics (pp. 950–954). IEEE. doi:10.1109/ICMLC.2011.6016866

Trevino, A. (2016, December 6). Introduction to K-means Clustering. Retrieved July 6, 2022, from https://blogs.oracle.com/ai-and-datascience/post/introduction-to-k-means-clustering

Webb, G. I. (2016). Naïve Bayes. In Encyclopedia of Machine Learning and Data Mining (pp. 1–2). Boston, MA: Springer US. doi:10.1007/978-1-4899-7502-7_581-1

What is Digital Transformation - Salesforce. (n.d.). Retrieved July 4, 2022, from https://www.salesforce.com/ap/products/platform/what-is-digital-transformation/

Zhang, X., & Yu, Q. (2017). Hotel reviews sentiment analysis based on word vector clustering. In 2017 2nd IEEE International Conference on Computational Intelligence and Applications (ICCIA) (pp. 260–264). IEEE. doi:10.1109/CIAPP.2017.8167219

Zvarevashe, K., & Olugbara, O. O. (2018). A framework for sentiment analysis with opinion mining of hotel reviews. In 2018 Conference on Information Communications Technology and Society (ICTAS) (pp. 1–4). IEEE. doi:10.1109/ICTAS.2018.8368746

Downloads


Crossmark Updates

How to Cite

Juliadi, R. N., & Puspitarani, Y. (2022). Supervised Model for Sentiment Analysis Based on Hotel Review Clusters using RapidMiner. Sinkron : Jurnal Dan Penelitian Teknik Informatika, 6(3), 1059-1066. https://doi.org/10.33395/sinkron.v7i3.11564