Comparison of Naïve Bayes Algorithm, C4.5 and Random Forest for Classification in Determining Sentiment for Ojek Online Service
Keywords:
Sentiment Analysis, Naïve Bayes, C4.5 and Random ForestAbstract
At present the technological developments that affect each system and in the aspects of online transportation and transportation are currently the most popular transportation because of the ease of using these transportation services in mobile phone applications. Some people express their opinions and opinions about users of public transport through social media sites and other websites. This opinion can be used as a material for sentiment analysis to find out whether public transport services are positive or negative. The purpose of this study was to find out the sentiments in the tweet opinion and to find out the results of the classification of the Naive Bayes method, C4.5 and the Random Forest algorithm that were used and compared. In this study, from the results of testing with performance measurements the three algorithms use Cross Validation, Confusion Matrix and ROC Curve.At present the technological developments that affect each system and in the aspects of online transportation and transportation are currently the most popular transportation because of the ease of using these transportation services in mobile phone applications. Some people express their opinions and opinions about users of public transport through social media sites and other websites. This opinion can be used as a material for sentiment analysis to find out whether public transport services are positive or negative. The purpose of this study was to find out the sentiments in the tweet opinion and to find out the results of the classification of the Naive Bayes method, C4.5 and the Random Forest algorithm that were used and compared. In this study, from the results of testing with performance measurements the three algorithms use Cross Validation, Confusion Matrix and ROC Curve.
Downloads
References
Aprilla, D, Baskoro, Donny Aji, Ambarwati, Lia, & Wicaksana, IWayan Simri. (2013). Belajar Data Mining dengan Rapid Miner. Jakarta.
Dey, Lopamudra, Sanjay Chakraborty, Anuraag Biswas, Beepa Bose, Sweta Tiwari. (2016). Sentiment Analysis of Review Datasets Using Naïve Bayes‘and K-NN Classifier. I.J. Information Engineering and Electronic Business. DOI: 10.5815/ijieeb.2016.04.07
Feldman, R dan Sanger, J., (2007). The Text Mining Handbook: Advanced Approaches in Analyzing Unstructured Data. Cambridge University Pres : New York.
Gorunescu, F. (2011). Data Mining Concepts, Models and Techniques. Verlag Berlin Heidelrbeg: Springer
Hadi, Alfian Futuhul, Dimas Bagus C. W, Moh.
Hasan. (2017). Text Mining Pada Media Sosial Twitter Studi Kasus: Masa Tenang Pilkada DKI 2017 Putaran 2. Seminar Nasional Matematika dan Aplikasinya, 21 Oktober 2017.Surabaya, Universitas Airlangga.
Liu, B. 2010. Handbook of Natural Language Processing, chapter Sentiment Analysis and Analysis, 2nd Edition. Chapman & Hall / CRC Press.
Negara, Edi Surya. Ria Andryani dan Prihambodo Hendro Saksono, 2016. Twitter Data Analytics: Geospatial Data Extraction and Analysis, Pew Research Center, INKOM, Vol. 10, No. 1, Mei 2016.
Nugroho, Didik Gabian, Yulison Herry Chrisnanto dan Agung Wahana. (2016). Analisis Sentimen Pada Jasa Ojek Online Menggunakan Metode Naïve Bayes. ISBN 978-602-99334-5-1. Prosiding SNST ke-7.
Prasetyo, Eko. (2014). Data Mining Mengolah Data Menjadi Informasi Menggunakan Matlab. Yogyakarta: Andi Offset.
Putler, Daniel S, dan Robert E Krider. (2015). Customer and Business Analytics: Applied Data Mining for Business Decision Making Using R. CRC Press. USA. ISBN-13:978-1-4987-5970-0 (EPUB)
Sartika, Dewi dan Dana Indra Sensuse. (2017). Perbandingan Algoritma Kasifikasi Naïve Bayes dan Neares Neighbour dan Decission Tree pada Studi Kasus Pengambilan Keputusan Pemilihan Pola Pakaian. Jatisi. Vol 8, No. 2, Maret 2017.
Susanti, Aisah Rini, Taufik Djatna dan Wisnu Ananta Kusuma. (2017). Twitter’s Sentiment Analysis on Gsm Services using Multinomial Naïve Bayes. Jurnal; TELKOMNIKA. Vol.15, No.3, September 2017. ISSN: 1693-6930, accredited “A” by DIKTI, Decree No: 58/DIKTI/Kep/2013.
Tala Fadillah Z. 2003. A study of stemming effects on Information retrieval in
Vercellis, C. (2009). Business Intelligent: Data Mining and Optimization for Decision Making. Southern Gate: John Willey & Sons Inc.