recognition is validated by high benchmark datasets
equivalent like LFW that show very high accuracy for
recognition experiments.
3.2 Schroff, F., Kalenichenko, D., &
Philbin, J: (2015)
Title: FaceNet: A Unified Embedding for Face
Recognition and Clustering.
Abstract: Nowadays, FaceNet introduces a pure end-
to-end deep learning model which maps a given
person's face images directly to a compact Euclidean
space in which distances correspond to how similar
the faces seem to be. Instead of a classifier, FaceNet
learns to project all facial images into the feature
space through a triplet loss function to ensure that
photos of the same person's face appear closer while
having maximum distance with the different ones. It
achieves very high accuracy in face verification and
clustering tasks, where it overcomes many of the
traditional methods which were feature-based and
hand-crafted. The research demonstrated a scalable
and efficient face recognition of real-world
applications with deep neural networks.
3.3 Taigman, Y., Yang, M., Ranzato,
M., & Wolf, L: (2014)
Title: DeepFace: Closing the Gap to Human-Level
Performance in Face Verification.
Abstract: DeepFace introduces a deep learning-
driven facial recognition model that comes close to
human-level accuracy. The authors use a deep neural
network that is trained on a massive set of labeled face
images to enhance facial feature extraction and
representation. Using 3D face alignment methods and
deep network structures, the system greatly enhances
the robustness of recognition in different poses and
illumination conditions. The model exceeds 97%
accuracy on benchmark data and poses a new
standard for deep learning-based biometrics. The
work highlights the efficacy of deep learning in
closing the gap between the facial recognition
capability of machines and humans.
3.4 He, K., Zhang, X., Ren, S., & Sun,
J: (2016)
Title: Deep Residual Learning for Image
Recognition (ResNet).
Abstract: As the present paper does not restrict itself
to face recognition, it introduces the structure of
ResNet that is almost entirely based on a deep
learning model and scales image classification tasks.
Deep residual learning is suggested by the authors to
assist with the issue of the vanishing gradient and to
allow training deep networks at an extremely high
level. This factor of skip connection, induced by
ResNet, improves feature extraction and
generalization, thus becoming one of the most widely
preferred backbones among the latest face
recognition models. This paper encompasses various
ways in which deep residual networks improve
accuracy and efficiency on complex vision tasks in
areas such as face recognition and feature mapping.
3.5 Cao, Q., Shen, L., Xie, W., Parkhi,
O: M., & Zisserman, A - (2018)
Title: VGGFace2: A Dataset for Recognizing
Faces across Age, Pose, and Illumination.
Abstract: Presented is a large dataset for facial
recognition against variation in age, pose, and
illumination. The principal idea is to strengthen deep-
learning face-recognition models by introducing
large binomial variations in pose, age, and lighting
conditions. The authors train a CNN-based face-
recognition model on the dataset in an attempt to
achieve improved generalization on real applications.
The work demonstrates the importance of having
such robust datasets in constructing efficient face-
verification systems. Further, the paper ends with
benchmark analyses and comparisons among several
CNN architectures, strengthening the need for high-
quality training data for pattern recognition.
4 EXISTING SYSTEM
Facial recognition systems have evolved throughout
the ages, utilizing classical image-processing
methods but now mostly favoring deep-learning
paradigms. The existent methods of facial recognition
deal with several aspects relating to feature
extraction, classification, and matching of images to
identify a particular individual.
4.1 Traditional Methods
The older facial recognition systems relied on
handcrafted feature-extraction techniques such as
Eigenfaces, Fisher faces, and LBP1. They aimed at
statistically analyzing the facial structure but suffered
from variations in lighting, posing, and occlusions,