Optimizing Facial Expression Recognition with Image Augmentation Techniques: VGG19 Approach on FERC Dataset
DOI:
10.33395/sinkron.v8i2.13507Keywords:
Augmentation; Image Generator; SMOTE; Convolutional Neural Networl; VGG19Abstract
In the field of facial expression recognition (FER), the availability of balanced and representative datasets is key to success in training accurate models. However, Facial Expression Recognition Challenge (FERC) datasets often face the challenge of class imbalance, where some facial expressions have a much smaller number of samples compared to others. This issue can result in biased and unsatisfactory model performance, especially in recognizing less common facial expressions. Data augmentation techniques are becoming an important strategy as they can expand the dataset by creating new variations of existing samples, thus increasing the variety and diversity of the data. Data augmentation can be used to increase the number of samples for less common facial expression classes, thus improving the model's ability to recognize and understand diverse facial expressions. The augmentation results are then combined with balancing techniques such as SMOTE coupled with undersampling to improve model performance. In this study, VGG19 is used to support better model performance. This will provide valuable guidelines for optimizing more advanced CNN models in the future and may encourage further research in creating more innovative augmentation techniques.
Downloads
References
Anantrasirichai, N., & Bull, D. (2022). Artificial intelligence in the creative industries: a review. Artificial Intelligence Review, 55(1), 589–656. https://doi.org/10.1007/s10462-021-10039-7
Balla, A., Habaebi, M. H., Elsheikh, E. A. A., Islam, M. R., & Suliman, F. M. (2023). The Effect of Dataset Imbalance on the Performance of SCADA Intrusion Detection Systems. Sensors, 23(2). https://doi.org/10.3390/s23020758
Białek, C., Matiolański, A., & Grega, M. (2023). An Efficient Approach to Face Emotion Recognition with Convolutional Neural Networks. Electronics (Switzerland), 12(12). https://doi.org/10.3390/electronics12122707
Chan, E., Kelly, M., & Schnabel, J. A. (2021). Comparison of classical machine learning deep learning to characterise fibrosis inflammation using quantitative MRI. Proceedings - International Symposium on Biomedical Imaging, 2021-April, 729–732. https://doi.org/10.1109/ISBI48211.2021.9433962
Chen, Y., Chang, R., & Guo, J. (2021). Effects of Data Augmentation Method Borderline-SMOTE on Emotion Recognition of EEG Signals Based on Convolutional Neural Network. IEEE Access, 9, 47491–47502. https://doi.org/10.1109/ACCESS.2021.3068316
Dablain, D., Krawczyk, B., & Chawla, N. V. (2022). DeepSMOTE: Fusing Deep Learning and SMOTE for Imbalanced Data. IEEE Transactions on Neural Networks and Learning Systems, PP, 1–15. https://doi.org/10.1109/TNNLS.2021.3136503
Febrian, R., Halim, B. M., Christina, M., Ramdhan, D., & Chowanda, A. (2022). Facial expression recognition using bidirectional LSTM-CNN. Procedia Computer Science, 216(2022), 39–47. https://doi.org/10.1016/j.procs.2022.12.109
Gour, M., Jain, S., & Sunil Kumar, T. (2020). Residual learning based CNN for breast cancer histopathological image classification. International Journal of Imaging Systems and Technology, 30(3), 621–635. https://doi.org/10.1002/ima.22403
He, Y. (2023). Facial Expression Recognition Using Multi-Branch Attention Convolutional Neural Network. IEEE Access, 11(December 2022), 1244–1253. https://doi.org/10.1109/ACCESS.2022.3233362
Isthigosah, M., Sunyoto, A., & Hidayat, T. (2023). Image Augmentation for BreaKHis Medical Data using Convolutional Neural Networks. Sinkron, 8(4), 2381–2392. https://doi.org/10.33395/sinkron.v8i4.12878
Kummer, A., Ruppert, T., Medvegy, T., & Abonyi, J. (2022). Machine learning-based software sensors for machine state monitoring - The role of SMOTE-based data augmentation. Results in Engineering, 16(November). https://doi.org/10.1016/j.rineng.2022.100778
Küntzler, T., Höfling, T. T. A., & Alpers, G. W. (2021). Automatic Facial Expression Recognition in Standardized and Non-standardized Emotional Expressions. Frontiers in Psychology, 12(May), 1–13. https://doi.org/10.3389/fpsyg.2021.627561
Li, S., & Deng, W. (2022). Deep Facial Expression Recognition: A Survey. IEEE Transactions on Affective Computing, 13(3), 1195–1215. https://doi.org/10.1109/TAFFC.2020.2981446
Mehendale, N. (2020). Facial emotion recognition using convolutional neural networks (FERC). SN Applied Sciences, 2(3). https://doi.org/10.1007/s42452-020-2234-1
Pham, L., Huynh Vu, T., Anh Tran, T., Chi Minh City, H., Trung Ward, L., & Duc District, T. (n.d.). Facial Expression Recognition Using Residual Masking Network. Retrieved from http://github.com/phamquiluan/ResidualMaskingNetwork
Pise, A. A., Alqahtani, M. A., Verma, P., Purushothama, K., Karras, D. A., Prathibha, S., & Halifa, A. (2022). Methods for Facial Expression Recognition with Applications in Challenging Situations. Computational Intelligence and Neuroscience, 2022. https://doi.org/10.1155/2022/9261438
Prabaswera, D. R., & Soeparno, H. (2023). Facial Emotion Recognition Using Convolutional Neural Network Based on the Visual Geometry Group-19. Jurnal TAM (Technology Acceptance Model), 14(1), 48. https://doi.org/10.56327/jurnaltam.v14i1.1475
Tripathi, P., Khatri, S. K., & Greunen, D. Van. (2022). A transfer learning approach to implementation of pretrained CNN models for Breast cancer diagnosis. 6(4), 5816–5830.
Xu, Y., & Wali, A. (2023). Handwritten Pattern Recognition using Birds-Flocking Inspired Data Augmentation Technique. IEEE Access, 11(May), 71426–71434. https://doi.org/10.1109/ACCESS.2023.3294566
M. C. Irmak, M. B. H. Taş, S. Turan and A. Haşıloğlu, Emotion Analysis from Facial Expressions Using Convolutional Neural Networks, 2021 6th International Conference on Computer Science and Engineering (UBMK), Ankara, Turkey, 2021, pp. 570-574, doi: 10.1109/UBMK52708.2021.9558917.
Wang, Y., Li, Y., Song, Y., & Rong, X. (2019). Facial expression recognition based on random forest and convolutional neural network. Information (Switzerland), 10(12). https://doi.org/10.3390/info10120375
Lagunas, M., & Garces, E. (2017). Transfer Learning for Illustration Classification. 27th Spanish Computer Graphics Conference, CEIG 2017, 77–85. https://doi.org/10.2312/ceig.20171213
Downloads
How to Cite
Issue
Section
License
Copyright (c) 2024 Fahma Inti Ilmawati, Kusrini, Tonny Hidayat
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.