IMPLEMENTATITON OF RANDOM FOREST ALGORTIHM ON SALES DATA TO PREDICT CHURN POTENTIAL IN SUZUYA SUPERMARKET PRODUCTS

Authors

  • Abdi Dharma Universitas Prima Indonesia https://orcid.org/0000-0003-4411-8572
  • Christnatalis Universitas Prima Indonesia, Indonesia
  • Windy Candra Universitas Prima Indonesia
  • Josua Presen Turnip Universitas Prima Indonesia, Indonesia

DOI:

10.33395/sinkron.v8i2.12243

Abstract

Concentration of sales that are focused on products that are in great demand and are popular is one of the supermarket sales techniques. Seasonal sales techniques like this sometimes have an impact that can be seen obviously by the imbalance in sales of existing products in supermarkets. Sales imbalance can be the initial cause for a product to lose interest and become a product that is eventually removed from store. With a classification model made to predict which products will be eliminated or churn, it can assist staff in distributing the sales of each product. The more products are churn due to lack of enthusiasts which can affect the overall sales of the supermarket. The purpose of this study is to assist staff in classifying potentially churn products. The classification model consists of 3 models with different algorithms and the results show that the application of the Random Forest algorithm is more effective for predicting data with 96% accuracy compared to 81% for the Logistic Regression algorithm and 46% for the Support Vector Machine algorithm.

GS Cited Analysis

Downloads

Download data is not yet available.

References

Ay, M., Stemmler, S., Schwenzer, M., Abel, D., & Bergs, T. (2019). Model predictive control in milling based on support vector machines. IFAC-PapersOnLine, 52(13), 1797–1802. https://doi.org/10.1016/j.ifacol.2019.11.462

Ay, M., Stenger, D., Schwenzer, M., Abel, D., & Bergs, T. (2019). Kernel Selection for Support Vector Machines for System Identification of a CNC Machining Center. IFAC-PapersOnLine, 52(29), 192–198. https://doi.org/10.1016/j.ifacol.2019.12.643

Azmatul Barro, R., Sulvianti, I. D., & Afendi, M. (2013). PENERAPAN SYNTHETIC MINORITY OVERSAMPLING TECHNIQUE (SMOTE) TERHADAP DATA TIDAK SEIMBANG PADA PEMBUATAN MODEL KOMPOSISI JAMU (Vol. 1, Issue 1).

Dablain, D., Krawczyk, B., & Chawla, N. v. (2022). DeepSMOTE: Fusing Deep Learning and SMOTE for Imbalanced Data. IEEE Transactions on Neural Networks and Learning Systems. https://doi.org/10.1109/TNNLS.2021.3136503

Dingli, A., Marmara, V., & Fournier, N. S. (2017). Comparison of deep learning algorithms to predict customer churn within a local retail industry. International Journal of Machine Learning and Computing, 7(5), 128–132. https://doi.org/10.18178/ijmlc.2017.7.5.634

Fernández, A., García, S., Herrera, F., & Chawla, N. v. (2018). SMOTE for Learning from Imbalanced Data: Progress and Challenges, Marking the 15-year Anniversary. In Journal of Artificial Intelligence Research (Vol. 61).

Fosdal, S. (2017). The use of logistic regression and quantile regression in medical statistics.

Gu, H., Yang, M., Gu, C. shi, & Huang, X. fei. (2021). A factor mining model with optimized random forest for concrete dam deformation monitoring. Water Science and Engineering, 14(4), 330–336. https://doi.org/10.1016/j.wse.2021.10.004

Irmanda, H. N., Astriratma, R., & Afrizal, S. (2019). PERBANDINGAN METODE JARINGAN SYARAF TIRUAN DAN POHON KEPUTUSAN UNTUK PREDIKSI CHURN Universitas Pembangunan Nasional Veteran Jakarta. JSI : Jurnal Sistem Informasi (E-Journal), 11(2). http://ejournal.unsri.ac.id/index.php/jsi/index

Jawa, T. M. (2022). Logistic regression analysis for studying the impact of home quarantine on psychological health during COVID-19 in Saudi Arabia. Alexandria Engineering Journal, 61(10), 7995–8005. https://doi.org/10.1016/j.aej.2022.01.047

Kartika Sari, Y., & Wahyu Wibowo, F. (2018). Prediksi Customer Churn Berbasis Adaptive Neuro Fuzzy Inference System. In Generation Journal (Vol. 2, Issue 1). www.internetworldstats.com,

Somvanshi, M., & Chavan, P. (2016). A Review of Machine Learning Techniques using Decision Tree and Support Vector Machine.

Wardani N, Dantes G, & Indrawan G. (2018). PREDIKSI CUSTOMER CHURN DENGAN ALGORITMA DECISION TREE C4.5.

Wicaksono, A. (2021). Uji Performa Teknik Klasifikasi untuk Memprediksi Customer Churn. 9(1).

Zhu, L., Zhou, X., & Zhang, C. (2021). Rapid identification of high-quality marine shale gas reservoirs based on the oversampling method and random forest algorithm. Artificial Intelligence in Geosciences, 2, 76–81. https://doi.org/10.1016/j.aiig.2021.12.001

Downloads


Crossmark Updates

How to Cite

Dharma, A., Christnatalis, Candra, W., & Turnip, J. P. . (2023). IMPLEMENTATITON OF RANDOM FOREST ALGORTIHM ON SALES DATA TO PREDICT CHURN POTENTIAL IN SUZUYA SUPERMARKET PRODUCTS. Sinkron : Jurnal Dan Penelitian Teknik Informatika, 7(2), 866-872. https://doi.org/10.33395/sinkron.v8i2.12243