detect cracks on the bridge deck. The model
demonstrates exceptional performance, with an
accuracy rate of 92.38 percent. Popli et al. (Popli,
Kansal, et al. , 2023) Suggested the use of a robotic
model called Xception to detect cracks in roads. The
newly proposed model achieves an accuracy of 90%.
Tabernik et al. (Tabernik, Šuc, et al. , 2023)
proposed a novel paradigm, SegDecNet++, to
streamline quality control during building and
maintenance processes. The proposed model obtained
a dice score of 81%. Pham et al. (Pham, Ha, et al. ,
2023) concluded the research using Ostu Method to
detect ground cracks along with their length and
width and the accuracy recorded was 86.7% to 99.9%.
Yadav et al. (Yadav, Sharma, et al. , 2024) suggested
a new Convolutional Neural Network based model,
HCTNet to identify cracks in roads ensuring
sustainable road safety. The model achieved the F1
score of 97.20%.
Sun et al. (Sun, Yang, et al. , 2021) suggested
conducting research utilizing the DeepLabV3+ model
to accurately identify cracks and bugholes on the
surface of the concrete. The model attained an
impressive outcome, with a mean average precision
of 95.58%.
Lin et al. (Lin, Li, et al. , 2023) suggested a model,
DeepCrackAt based on the encoder-decoder network
for crack segmentation and recorded an accuracy of
97.41%. Karimi et al. (Karimi, Mishra, et al. , 2024)
proposed the implementation of a YOLO (You Only
Look Only) deep learning model to diagnose damage
in tiles. The CDNet model attained an accuracy rate
of 72%.
Katsigiannis et al. (Katsigiannis, Seyedzadeh, et
al. , 2023) suggested a deep learning-based model,
MobileNetV2 to diagnose the cracks in brickwork
masonry and achieved the F1 score of 100%. Yadav
et al. (Yadav, Prasad, et al. , 2024) proposed the
CCTNet to improve the precision of crack detection
in structures. The model recorded a precision of
99.33% for the proposed dataset. Tasci et al. (Tasci,
Acharya, et al. , 2023) developed a new architecture
inception and concatenation residual (InCR) to
identify damaged buildings. The model outperforms
other models by showing an accuracy of 99.82%.
Reis et al. (Reis, Turk, et al. , 2024) created a
combination model of ResNet152 +SVM to
recognize the cracks in roads after the earthquake.
The hybrid model had the highest level of success,
with accuracy values of 98.68%. Zheng et al. (Zheng,
Lei, et al. , 2020) suggested three deep learning
models out of which RFCN (Richer Fully
Convolutional Network) showed the best results for
identifying the fractures in buildings with a recorded
accuracy as 91%. Akgul et al. (Akgül, 2023)
introduced a novel fusion model called Mobile-
DenseNet for accurately recognizing cracks that
appear on the surface of the concrete. The fusion
model achieved a success percentage of 99.87%.
Roy et al. (Roy, Kukreja, et al. , 2023) developed a
hybrid model of a deep CNN to detect the intensity of
defects in painting in heritage buildings. The hybrid
model resulted in an accuracy of 84.23%. Joshi et al.
(Joshi, Singh, et al. , 2022) created a deep-learning
model to identify surface cracks or defects in various
structures. The model achieved an average precision
of 93.445% in its predictions. ABDELLAOUI et al.
(ABDELLAOUI, Errousso, et al. , 2024) concluded
the study by considering the VGG-based learning
model as the most superior for detecting cracks in the
pavement as the model records an accuracy of 86.5%.
3 ARCHITECTURE OF THE
MODEL
There are several issues with the earlier models that
used deep learning methods that need to be fixed.
Based on the previously employed ResNext and
VGG16 models, a new model is built in order to
optimize the performance of the crack detecting
model through deep learning. This study developed a
CDNet (Crack Detection network) sequential model
for diagnosing building cracks. In this model, 5
convolution blocks.
The sizes of the convolutional
layers, conv1, conv2, conv3, conv4, and conv5, are
16, 32, 64, 128, and 256, respectively. The model
consists of four different stages as shown in Figure 1.
Figure 1: Stages During Crack Detection
3.1 Working of the CDNet model
A 2D convolutional layer processes an input image or
feature map by applying several filters, extracting