Face Recognition Based on ResNet Architecture
Peilu Zhu
Yongkang No.1 High School, Chengnan Road, Yongkang, Zhejiang Province 321300, China
Keywords: ResNet, AdamW, Early Stopping, Deep Learning, Face Recognition.
Abstract: With deep learning developing, face recognition techniques are widely applied in various scenarios. Currently,
most methods for achieving face recognition are based on Convolutional Neural Networks (CNN), although
they have flaws. To solve the problem of long training time and improve accuracy, the Residual Network
(ResNet), as a strong tool, was introduced as the main architecture in place of CNN to deal with complex data.
Besides, Adam with Weight Decay (AdamW) was chosen as the optimizer and early stopping was
implemented to improve model performance. 400 facial images from 40 different individuals were selected
from the Olivetti faces dataset in the experiment. The experiment was conducted successfully. Losses, training
time and accuracy of ResNet-18 and ResNet-50 in both 10 epochs and 30 epochs were calculated and then
compared. The experimental results show that ResNet-18 performed better than ResNet-50 on such a 400
images dataset with overall lower losses and less training time. The training time is saved by 10%~15% after
introducing the early stopping method.
1 INTRODUCTION
Face recognition is a kind of biological recognition
technology. Nowadays face recognition technique is
widely used in multiple situations, including security
monitoring, password authentication and law
enforcement (Taskiran, Kahraman, & Erdem, 2020).
Though this technique makes life more convenient
and improves the security of private data to a certain
extent, the problems of slow recognition and
uncertainty still exist when the dataset is large. It is
vitally significant to accelerate the recognition
process and improve the accuracy of recognition.
There are various methods to achieve face
recognition (Changwei, Jun, Lingyun, Yali, Sheng,
2020) (Opanasenko, Fazilov, Mirzaev, Sa’dullo ugli
Kakharov, 2024) (Opanasenko, Fazilov, Radjabov,
2024. ). For example, Opanasenko, V. M., Fazilov, S.
K., Mirzaev, O. N. and Sa’dullo ugli Kakharov, S.
used a method for recognizing faces in mobile
devices, based on an ensemble approach to solving
the problem (Opanasenko, Fazilov, Mirzaev, Sa’
dullo ugli Kakharov, 2024). Compared with others,
methods based on deep learning seem to be the most
simplified and efficient. Deng, N., Xu, Z., Li, X.,
Gao, C. and Wang, X. used a noise-applied spatial
clustering algorithm based on density to cluster a
large dataset to a self-constructed dataset (Deng, Xu,
Li, Gao, Wang, 2024). They reduced the uncertainty
of the model and saved space taken up by the dataset.
Said, Y., Barr, M. and Ahmed, H. E. designed and
evaluated a deep learning model based on CNN to
detect facial information in a real-time environment
(Said, Barr, Ahmed, 2020). They successfully
adjusted parameters to improve the accuracy in
standard datasets and real-time input. Khalifa, A.,
Abdelrahman, A.A. and Hempel, T. introduced a
robust and efficient CNN-designed model for face
recognition (Khalifa, Abdelrahman, Hempel, 2024).
They succeeded in achieving a balance through
incorporating multiple features and attention
mechanisms. Xie, Z., Li, J. and Shi, H. focused on the
influence of the increasing number of neural and
feature maps (Xie, Li, Shi, 2019). They used Python
with Keras methods to test CNN in face recognition
and gained accuracy close to 100%. Liu, Y. and Qu,
Y. combined multiple algorithms to build an
improved multitask face recognition CNN (Liu, Qu,
2024). They reached an accuracy of 99.05% in feature
matching.
Du, A., Zhou, Q., and Dai, Y. utilized Intersection
over Union (IoU) to quantify the ratio of task-relevant
features and evaluate the generalization ability of
their Residual Networks (ResNet) model set (Du,
Zhou, Dai, 2024). The results show that ResNet is a