Retweet Prediction Based on User-Based, Content-Based, Time-Based Features Using ANN Classification Optimized with the Bat Algorithm
Keywords:Retweet; Artificial Neural Network; Bat Algorithm; Undersampling; Oversampling
Twitter is one of the most popular social media platforms today for information dissemination. It is favored by the public due to its real-time information sharing capabilities. Twitter provides two important features for information dissemination: Tweets and Retweets. Tweets allow users to write messages that can be instantly shared. Each tweet can contain text, media such as images, videos, or URLs. Retweets allow users to repost someone else's tweet and distribute it to their own followers. The Retweet feature is considered an effective way to spread information, as a high number of retweets indicates that the information in the tweet is spreading quickly and widely. This research aims to predict retweets based on several features: User-Based Feature, Content-Based Feature, and Time-Based Feature. The classification method used is Artificial Neural Network, which is optimized using a Nature-Inspired Algorithm called Bat Algorithm. The evaluation results of this study show an accuracy of 86%, precision of 87.8%, recall of 93.6%, and F1-score of 90.6% without imbalance class handling. Under Undersampling condition, the accuracy is 80.8%, precision is 91.0%, recall is 81.4%, and F1-score is 85.9%. Under Oversampling condition, the accuracy is 82.4%, precision is 89.6%, recall is 85.6%, and F1-score is 87.5%. These results indicate that using user-based, content-based, and time-based features, applying Artificial Neural Network classification method, and optimizing hyperparameters using Bat Algorithm are effective in predicting retweets.
Daga, I., Gupta, A., Vardhan, R., & Mukherjee, P. (2020). Prediction of likes and retweets using text information retrieval. Procedia Computer Science, 168, 123–128.
Dastres, R., & Soori, M. (2021). Artificial Neural Network Systems. International Journal of Imaging and Robotics (IJIR), 21(2), 13–25.
Efsa, H. A. P. A.-S., Jondri, J., & Lhaksmana, K. M. (2021). Prediksi Retweet Menggunakan Fitur Berbasis Pengguna Dan Fitur Berbasis Konten Dengan Metode Klasifikasi Ann. EProceedings of Engineering, 8(5).
Farizawani, A. G., Puteh, M., Marina, Y., & Rivaie, A. (2020). A review of artificial neural network learning rule based on multiple variant of conjugate gradient approaches. Journal of Physics: Conference Series, 1529(2), 022040.
Firdaus, S. N., Ding, C., & Sadeghian, A. (2016). Retweet prediction considering user’s difference as an author and retweeter. 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), 852–859.
Hoang, T. B. N., & Mothe, J. (2018). Predicting information diffusion on Twitter–Analysis of predictive features. Journal of Computational Science, 28, 257–264.
Kushwaha, A. K., Kar, A. K., & Vigneswara Ilavarasan, P. (2020). Predicting information diffusion on twitter a deep learning neural network model using custom weighted word features. Responsible design, implementation and use of information and communication technology, 456–468. In Lecture Notes in Computer Science (Vol. 12066, pp. 456–468).
Kwak, H., Lee, C., Park, H., & Moon, S. (2010). What is Twitter, a social network or a news media? Proceedings of the 19th International Conference on World Wide Web, 591–600.
Lemaître, G., Nogueira, F., & Aridas, C. K. (2017). Imbalanced-learn: A python toolbox to tackle the curse of imbalanced datasets in machine learning. The Journal of Machine Learning Research, 18(1), 559–563.
Mijwel, M. M. (2018). Artificial neural networks advantages and disadvantages. Retrieved from LinkedIn Https//Www. Linkedin. Com/Pulse/Artificial-Neuralnet Work.
Pradipta, G. A., Wardoyo, R., Musdholifah, A., Sanjaya, I. N. H., & Ismail, M. (2021). SMOTE for handling imbalanced data problem: A review. 2021 Sixth International Conference on Informatics and Computing (ICIC), 1–8.
Shehab, M., Abu-Hashem, M. A., Shambour, M. K. Y., Alsalibi, A. I., Alomari, O. A., Gupta, J. N. D., Alsoud, A. R., Abuhaija, B., & Abualigah, L. (2022). A Comprehensive Review of Bat Inspired Algorithm: Variants, Applications, and Hybridization. Archives of Computational Methods in Engineering, 1–33.
Suh, B., Hong, L., Pirolli, P., & Chi, E. H. (2010). Want to be retweeted? large scale analytics on factors impacting retweet in twitter network. 2010 IEEE Second International Conference on Social Computing, 177–184.
Wasukar, A. R. (2014). Artificial neural network–an important asset for future computing. International Journal For Research In Emerging Science And Technology, 1.
Yang, X.-S. (2010). A new metaheuristic bat-inspired algorithm. In Nature inspired cooperative strategies for optimization (NICSO 2010) (pp. 65–74). Springer.
Zebari, A. Y., Almufti, S. M., & Abdulrahman, C. M. (2020). Bat algorithm (BA): review, applications and modifications. Int J Sci World, 8(1), 1–7.
Zhang, Q., Gong, Y., Wu, J., Huang, H., & Huang, X. (2016). Retweet prediction with attention-based deep neural network. Proceedings of the 25th ACM International on Conference on Information and Knowledge Management, 75–84.
How to Cite
Copyright (c) 2023 Muhammad Rafi Rahadian, Jondri, Kemas Muslim L
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.