
Automated proctoring system using computer vi-
sion techniques (Maniar et al., 2021) is a proctoring
system. The system has features of eye gaze track-
ing, mouth detection, object detection, head posture
estimation via facial landmark detection. Further, the
system transcribes the audio recording of the student
voice into text using the Google speech recognition
API, to detect conversations. The techniques used
mostly are: biometric recognition (face recognition,
eye gaze tracking, mouth detection, head pose esti-
mation), object detection, cloud storage. The sys-
tem used dlib’s pre-trained network for real-time eye
tracking and head pose detection. Challenges include
accurate eye tracking and head pose estimation. Al-
though the system alerts multiple persons, it does
not provide continuous identity verification. The fu-
ture upgrades can also include the detection of other
devices and continuous student authentication.
ProctoXpert–An AI Based Online Proctoring Sys-
tem (Chougule et al., 2024) outlines a proctoring sys-
tem using AI for face recognition, audio detection,
and biometric authentication with live monitoring for
anomaly detection. It supports post-exam review with
encrypted data for security. The key methodologies
include AI-based biometric recognition, misbehav-
ior detection, and facial/audio analysis using Convo-
lutional Neural Networks (CNNs) for tracking head
position and eye movement to monitor the student’s
screen view. While the system benefits from AI-
driven accuracy, challenges remain in detecting light-
ing conditions and background noise. Future en-
hancements may focus on identifying device use and
improving additional monitoring data accuracy.
An incremental training on deep learning face
recognition for M-Learning online exam proctoring
(Ganidisastra and Bandung, 2021) elaborates on the
accuracy issues that exist in today’s proc- toring-
based face recognition systems, and more specifically
about the system’s performance during the illumina-
tion change and minimal changes to the facial struc-
ture. The paper points out the pros and cons of these
techniques by testing four face detectors; including
Haar-cascade, LBP, MTCNN, and YOLO- face, along
with testing one Facenet model. A deep learning-
based face detector is better than the earlier methods.
The Facenet model, which was batch trained, resulted
in 98% accuracy but, in this incremental training ver-
sion, the dataset is quite smaller. Viola-Jones/Haar-
cascade : A traditional approach, based on integral
images and Adaboost classifiers but poses and light-
ing variations affect significantly. Local Binary Pat-
tern (LBP) : Makes local texture de-scriptions but
is posing and lighting challenging. Multi-Task Cas-
caded CNN (MTCNN) : This has three stages of
CNN to efficiently generate candidate windows and
enhance face detection with the preservation of real-
time performance YOLO-face : Deep learning algo-
rithm based on YOLOv3; specially optimized for high
precision face detection with very rapid speed in de-
tection.
Multiple instance learning for cheating detection
and localization in online examinations (Liu et al.,
2024) presents CHEESE, an advanced cheating de-
tection framework utilizing multiple instance learn-
ing to improve upon existing proctoring systems. It
integrates body posture, background, eye gaze, head
posture, and facial features through a spatio-temporal
graph module, achieving an AUC score of 87.58 on
three datasets. Key challenges addressed include
video anomaly detection, with the Multiscale Tempo-
ral Network (MTN) capturing temporal dependencies
in video clips. CHEESE consists of a label genera-
tor that produces anomaly scores using multiple in-
stance learning and a feature encoder enhanced by
C3D or I3D models, incorporating a self-guided at-
tention module. This framework combines features
from OpenFace 2.0 and C3D/I3D to analyze can-
didates’ behavior holistically. Future enhancements
may focus on detecting unauthorized devices used
during exams and generating alerts.
A Systematic Review of Deep Learning Based
Online Exam Proctoring Systems for Abnormal
Student Behaviour Detection(Abbas and Hameed,
2022)discusses the challenges of ensuring academic
integrity in online exams, highlighting the need for
advanced technologies like deep learning to moni-
tor abnormal student behavior. It investigates the ef-
fectiveness of deep learning algorithms in detecting
cheating during assessments through a systematic lit-
erature review of 137 publications, narrowing down
to 41 relevant studies on AI-based proctoring systems.
The research identifies a key limitation: existing proc-
toring systems cannot fully prevent all cheating meth-
ods due to their evolving nature, necessitating ongo-
ing improvements. Benefits of deep learning include
enhanced detection capabilities and better exam se-
curity, while drawbacks involve technical challenges
and privacy concerns
AI-based online proctor- ing system (Aurelia
et al., 2024) addresses the need for proper online as-
sessment methods that result in a rise of AI-powered
proctoring services. The proposed work here is on de-
veloping a visual proctoring system based on webcam
input, with emotion detection, head movement detec-
tion, and unauthorized objects that occur during ex-
ams using facial expression recognition with convo-
lutional neural networks. The system improves mon-
itoring and automates suspicious behavior detection
INCOFT 2025 - International Conference on Futuristic Technology
230