2 RESEARCH METHODOLOGY
2.1 Research Area
To develop an optimized YOLOv5s model for road
defect detection, we employed a structured
methodology that includes data collection,
preprocessing, model optimization, training, and
evaluation. The primary goal is to enhance the
performance of YOLOv5s using pruning, pooling,
and attention mechanisms while ensuring real- time
efficiency. This methodology ensures a systematic
approach to achieving high- accuracy defect detection
with reduced computational complexity.
2.1.1 Data Collection and Preprocessing
The first step in the research involved collecting a
diverse dataset of road defects, including potholes,
cracks, ruts, and surface irregularities. Publicly
available road defect datasets, such as the
CRACK500 and Road Damage Detection Dataset
(RDD), were utilized.
2.1.2 Model Optimization: Pruning, Pooling,
and Attention Mechanisms
To improve the efficiency of YOLOv7s, we applied
three key optimization techniques. Pruning was used
to remove redundant parameters, reducing model size
and enhancing inference speed. Pooling layers were
integrated to improve feature extraction by capturing
essential details at multiple scales.
2.1.3 Model Training and Hyperparameter
Tuning
The optimized YOLOv5s model was trained using
transfer learning with pre-trained weights from the
COCO dataset. Training was conducted on high-
performance GPUs using a learning rate scheduler,
adaptive momentum optimization, and focal loss
function to handle class imbalance in defect
detection. Hyperparameters such as batch size,
learning rate, and IoU threshold were fine-tuned to
maximize detection performance.
2.1.4 Evaluation Metrics and Performance
Analysis
To measure the effectiveness of the optimized model,
performance was evaluated using precision, recall,
mean average precision, and inference speed (FPS).
Themodel was tested on real-world road images and
benchmark datasets to ensure generalization.
Comparative analysis was performed against existing
state-of-the-art road defect detection models to
highlight improvements in accuracy and efficiency.
2.1.5 Deployment and Real-Time
Implementation
After training and evaluation, the optimized model
was deployed in a real-time road monitoring system.
Edge devices, such as NVIDIA Jetson Nano and
Raspberry Pi, were used to test inference speed and
practical usability. The model was integrated into an
IoT-based road monitoring framework, where
detected defects were logged in a cloud-based system
for analysis and maintenance scheduling. This real-
time implementation validated the model’s efficiency
in detecting road defects with minimal computational
resources, making it suitable for large-scale
deployment in smart city infrastructure.
2.2 Research Area
2.2.1 Road Infrastructure and Maintenance
Road maintenance is a crucial aspect of transportation
safety and efficiency. Detecting road defects early
helps prevent accidents, reduces vehicle maintenance
costs, and ensures longer infrastructure lifespan. This
research contributes to improving automated road
inspection, reducing manual effort and associated
costs.
2.2.2 Deep Learning and Computer Vision
The integration of deep learning and computer vision
in road defect detection has significantly enhanced
detection accuracy and efficiency. This research
focuses on optimizing YOLOv7s, a state-of-the-art
object detection model, by incorporating techniques
such
as
pruning,
pooling,
and attention
mechanisms. These enhancements improve feature
extraction and model efficiency, making deep
learning-based road monitoring systems more
reliable.
2.2.3 IoT-Based Smart Transportation
Systems
Modern transportation systems are increasingly
adopting IoT-based monitoring solutions. This study
explores how the optimized YOLOv7s model can be
deployed on edge devices to enable real-time road
defect detection. By integrating IoT with computer
vision, road authorities can receive instant alerts about
road conditions, allowing timely interventions and