YOLOv7 (Zhang, 2024), real-time monitoring of
students' behavior in the classroom is conducted to
promptly detect and correct negative behaviors
(Redmon& Farhadi, 2018), such as lack of
concentration and playfulness. By identifying
students' behavior, objective data about classroom
atmosphere and teaching effectiveness can be
provided to teachers, helping them adjust teaching
strategies and improve classroom quality. Analyze
students' behavior patterns, understand their learning
habits and needs, and provide personalized learning
support and guidance for them.
Collect and analyze student behavior data to
provide data support for school management and
educational decision-making, and promote the
scientific and refined management of education.
Explore the application of YOLOv7 in the field of
education, promote the development of computer
vision technology in educational research, and
provide new ideas and methods for research in related
fields.
Classification and Monitoring of Behaviors in
Educational Environments:
Classification and monitoring of behaviors have
become a crucial aspect in the field of educational
technology, and YOLOv7 has emerged as a powerful
tool for achieving this goal. Numerous studies have
utilized the capabilities of YOLOv7 to classify and
detect students' behaviors in classroom settings (Li,
2025). Typically, these studies employ datasets
containing multiple behavior labels to conduct
comprehensive training of the models. The main
focus of these tasks lies in designing a reasonable and
comprehensive labeling system and implementing
effective data augmentation strategies to enhance the
robustness of the models.
Construction of Behavior Recognition Datasets:
A fundamental aspect of any research work in this
field is the construction of high-quality behavior
recognition datasets. These datasets serve as the
cornerstone for training and evaluating models.
Recognizing this, some researchers have embarked
on ambitious projects aimed at collecting behavior
data in diverse classroom environments. These efforts
aim to provide a comprehensive and representative
datasets for the development and improvement of
behavior recognition models.
Real-time Application of YOLOv7 in
Classrooms:
Thanks to the real-time detection capabilities of
YOLOv7, its potential for application in real
classroom environments has attracted widespread
attention. Researchers are currently exploring
methods to integrate YOLOv7 into classroom
management systems, with the ultimate goal of
enhancing teaching effectiveness and student
engagement. Additionally, the real-time feedback
provided by YOLOv7 is invaluable to teachers,
enabling them to adjust teaching strategies in real
time based on students' behavior and engagement
levels (Wang et al., 2022).
3 RESEARCH METHOD
The aim of this experiment is to use the YOLOv7
object detection algorithm(Zhang et al. , 2021) to
achieve image recognition of three typical behaviors
among students in the classroom: "raising hands",
"reading", and "writing". The research content mainly
includes the following aspects:
Dataset construction: Collect and annotate
classroom images containing behaviors such as
"raising hands," "reading," and "writing," and
construct high-quality training and validation
datasets.
Model training: Use YOLOv7 algorithm to train
the dataset, optimize model parameters, and improve
the accuracy and robustness of behavior recognition
(Zhang & Li, 2025).
Performance evaluation: Evaluate the trained
model through a test set and analyze its recognition
performance in classroom scenarios, including
metrics such as accuracy, recall, and real-time
performance.
Application validation: Deploy the model in an
actual classroom environment to verify its feasibility
and practicality in practical applications.
The research objective is to develop an efficient
and accurate student classroom behavior recognition
program, providing technical support for intelligent
classroom management, helping teachers to grasp
students' learning status in real time, optimize
teaching management strategies, and improve
classroom efficiency and learning outcomes.
Created by Chengdu Neusoft College, it contains
5686 images and 45578 tags, covering six behaviors:
raising hands, reading, writing, using mobile phones,
lowering heads, and lying on the table. This
experiment only tests three behaviors: raising hands,
reading, and writing. The dataset covers different
scenarios from kindergarten to university, and was
evaluated using the YOLOv7 algorithm with an
average accuracy of 80.3%, as shown in Figure 1.
This dataset aims to provide a solid foundation for
research on student behavior detection.
Original address: https://github.com/Whiffe
/SCB-dataset?tab=readme -ov-file