Agarwal et
al.
– 65.77 0.93M
Mollahossei
ni et al.
AlexNet 66.4 25M
Tang AlexNet 69.30 7.17M
Bodapati CNN 69.57 2.3M
3.2 Discussion
The focus is on the advantages and disadvantages of
the methods used in FER. CNN can effectively
extract spatial features from images, making it
suitable for recognizing subtle differences in facial
expressions. However, they lack the ability to capture
temporal changes in emotions, which limits their
performance in analyzing video sequences. On the
other hand, RNN-LSTM excels at processing data
sequences, allowing it to capture the evolution of
emotions over time, although it is more complex to
train and requires more computational resources.
Since CNN already has great potential, future
research can explore combining CNN and RNN-
LSTM, leveraging the advantages of these two
models to create a more accurate system for FER,
especially in videos. In addition, by using different
datasets to address cultural differences in emotional
expression, it can help reduce biases in these models.
Exploring the real-time application of FER in low-
power devices through edge computing is another
promising direction. Ethical issues, such as privacy,
should also be a focus, with research aimed at
developing FER systems for security and privacy
protection. Finally, further research can examine the
application of FER in monitoring mental health,
assisting in early detection and prevention of
psychological problems.
4 CONCLUSIONS
This study investigates the integration of deep
learning techniques into FER, with the goal of
enhancing both psychological research and practical
applications. The proposed approach employs CNNs
to extract spatial features from facial images and
RNNs with LSTM units to analyze the temporal
evolution of these features, particularly in video
sequences. Through extensive experimentation, this
study demonstrates that the CNN significantly
outperforms other models on the FER-2013 dataset,
achieving accuracy levels that surpass human
recognition. This finding underscores the
effectiveness of deep learning in capturing and
interpreting complex emotional expressions. Looking
ahead, future research will aim to refine the FER
system by further integrating CNN and RNN-LSTM
models to improve accuracy and robustness. Key
areas of focus will include addressing cultural biases
to ensure the system's applicability across diverse
populations, developing real-time FER applications
for deployment on low-power devices, and
addressing privacy and ethical considerations. These
efforts will help in creating more accurate, inclusive,
and practical FER solutions that can be effectively
used in a variety of settings.
REFERENCES
Antonio, V. A. A., Ono, N., Saito, A., Sato, T., Altaf-Ul-
Amin, M., & Kanaya, S. 2018. Classification of lung
adenocarcinoma transcriptome subtypes from
pathological images using deep convolutional networks.
International journal of computer assisted radiology
and surgery, 13, 1905-1913.
Bodapati, J. D., Srilakshmi, U., & Veeranjaneyulu, N. 2022.
FERNet: a deep CNN architecture for facial expression
recognition in the wild. Journal of The institution of
engineers (India): series B, 103(2), 439-448.
Bota, P. J., Wang, C., Fred, A. L., & Da Silva, H. P. 2019.
A review, current challenges, and future possibilities on
emotion recognition using machine learning and
physiological signals. IEEE access, 7, 140990-141020.
(2)
Cowie, R., Douglas-Cowie, E., Tsapatsoulis, N., Votsis, G.,
Kollias, S., Fellenz, W., & Taylor, J. G. 2001. Emotion
recognition in human-computer interaction. IEEE
Signal processing magazine, 18(1), 32-80. (7)
Dondeti, V., Bodapati, J. D., Shareef, S. N., &
Veeranjaneyulu, N. 2020. Deep Convolution Features
in Non-linear Embedding Space for Fundus Image
Classification. Rev. d'Intelligence Artif., 34(3), 307-
313.
Georgescu, M. I., Ionescu, R. T., & Popescu, M. 2019.
Local learning with deep and handcrafted features for
facial expression recognition. IEEE Access, 7, 64827-
64836.
Giannopoulos, P., Perikos, I., & Hatzilygeroudis, I. 2018.
Deep learning approaches for facial emotion
recognition: A case study on FER-2013. Advances in
hybridization of intelligent methods: Models, systems
and applications, 1-16.
Girshick, R., Donahue, J., Darrell, T., & Malik, J. 2014.
Rich feature hierarchies for accurate object detection
and semantic segmentation. In Proceedings of the IEEE
conference on computer vision and pattern recognition
(pp. 580-587).
Goodfellow, I. J., Erhan, D., Carrier, P. L., Courville, A.,
Mirza, M., Hamner, B., ... & Bengio, Y. 2013.
Challenges in representation learning: A report on three
machine learning contests. In Neural information
processing: 20th international conference, ICONIP