Effect of Data Augmentation Levels on MobileNetV2 Performance for Facial Expression Recognition Using FER-2013 Dataset
Keywords:
Deep Learning, Data Augmentation, MobileNetV2, Facial Expression RecognitionAbstract
This research is conducted to assess how data augmentation techniques influence the performance of deep learning models in the task of facial expression classification. A critical issue commonly encountered in facial image analysis is the scarcity of available datasets, which frequently results in overfitting and limits the model’s ability to generalize effectively to unseen data. In response to this limitation, the study implements two distinct levels of augmentation, referred to as light augmentation and complex augmentation, and evaluates their performance against a baseline condition where no augmentation is applied. The model employed in this investigation is MobileNetV2, trained using the FER-2013 dataset that contains seven distinct emotion categories. The findings from the experimental evaluation indicate that the application of light augmentation yields the highest validation accuracy at 22.7%, outperforming both the no-augmentation scenario (10.8%) and the complex augmentation approach (8.3%). Although the use of complex augmentation results in lower loss values, it does not translate into improved classification accuracy. This outcome suggests that overly intensive augmentation strategies may hinder the model’s capacity to effectively learn and extract meaningful features. Overall, these results highlight the necessity of carefully determining appropriate augmentation methods to enhance the performance of deep learning models.
References
Goodfellow, Y. Bengio, and A. Courville, Deep Learning. MIT Press, 2016.
Krizhevsky, I. Sutskever, and G. Hinton, “ImageNet Classification with Deep Convolutional Neural Networks,” in Proc. NeurIPS, 2012.
L. Perez and J. Wang, “The Effectiveness of Data Augmentation in Image Classification,” arXiv preprint arXiv:1712.04621, 2017.
M. Sandler et al., "MobileNetV2: Inverted Residuals and Linear Bottlenecks," CVPR, 2018.
Goodfellow et al., “Challenges in Representation Learning: A Report on FER-2013,” in ICML Workshop, 2013.
S. Shorten and T. M. Khoshgoftaar, “A survey on Image Data Augmentation for Deep Learning,” Journal of Big Data, vol. 6, no. 1, 2020.
H. Wang, Y. Wang, and Z. Zhou, “Facial Expression Recognition Based on Deep Learning: A Survey,” IEEE Access, vol. 8, pp. 146, 2020.
M. Tan and Q. Le, “EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks,” in Proc. ICML, 2020.
Howard et al., “Searching for MobileNetV3,” in Proc. ICCV, 2020.
Z. Zhang, P. Luo, C. C. Loy, and X. Tang, “Facial Expression Recognition Using Deep Learning: A Review,” Pattern Recognition, vol. 122, 2022.
J. Li and S. Deng, “Robust Facial Expression Recognition with Data Augmentation and CNN,” IEEE Access, vol. 10, pp. 2023.
Y. Li et al., “Lightweight CNN-Based Facial Expression Recognition for Real-Time Applications,” Sensors, vol. 23, no. 4, 2023.
M. S. Novelan and S. Aryza, “Optimization CVRP with Machine Learning for Improved Classification of Imbalanced Data Food Distribution,” JITK, vol. 10, no. 4, pp. 917–925, 2025.
R. Idhami et al., “Implementation of an Intelligent System to Predict Product Demand with Backpropagation Neural Network Algorithm,” IJSET, 2026.
N. Tampubolon et al., “Implementation of Deep Learning Algorithm for Face Detection Attendance System,” IJSET.
R. Yuliansyah et al., “Deep Q-Network Analysis in Optimizing Data Processing for Decision Making,” Jurnal Jatilima, 2025.
M. S. Novelan and S. Aryza, “Belajar Mengenal Dasar Machine Learning”, Serasi Media Teknologi, 2025.
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Ramlan Marbun, Muhammad Syahputra Novelan; muhammad Irfan Sarif

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.




