Furthermore, the RPN, or regional proposal network,
is not very sensitive to the identification of small and
dense objects because the candidate region it
generates is typically huge. In contrast, Yolov3 works
much better. At the same time, the two stages of the
Faster R-CNN network require a large amount of
data, so the training of Faster R-CNN is relatively
difficult.
4 CHALLENGES AND FUTURE
DEVELOPMENT
The performance of the algorithm may be affected
when processing images under extreme weather and
illumination conditions, and the detection effect of
some specific types of small targets still needs to be
improved. Further research can be carried out to
improve and optimize the above limiting factors to
further improve the practicality and robustness of the
big data target detection algorithm. When working
with a high number of candidate regions, the R-CNN
series technique requires a lot of processing, which
leads to inadequate real-time performance (Zou,
Chen, Shi, 2023). The YOLO algorithm's accuracy
has to be increased while working with small, dense
targets. Therefore, how to balance the detection speed
and accuracy is still a problem to be solved. In
addition, how to deal with scale change and target
occlusion still to be further studied.
From conventional manual feature approaches to
deep learning-based CNN and Transformer
architectures, object identification technology has
advanced tremendously in the field of computer
vision. Accuracy and real-time detection have also
increased significantly. Nevertheless, challenges such
as small target detection, domain adaptation, and
adversarial attacks remain. In the future, multi-modal
fusion, self-supervised learning and open-world
object detection will provide new impetus for the
development of object detection technology.
5 CONCLUSIONS
This study compares the performance of two well-
known deep learning object detection frameworks,
Faster R-CNN and YOLO-v3, by examining the
evolution of object recognition technology from the
manual, feature-based approach to the deep learning-
based convolutional neural network (CNN)
architecture. To assist readers in understanding the
advancement of technology in this area, a detailed
summary of the object detection technology's
evolution from the old method to the deep learning
method is provided. By reviewing the traditional
methods such as HOG and Haar, we reveal their
advantages in computational efficiency and real-time
performance, and point out their limitations in
complex scenarios. Next, two common deep learning
object identification frameworks — YOLO-v3 and
Faster R-CNN — are thoroughly examined and
contrasted. The experimental results show that
YOLO-v3 has a slightly higher average accuracy
(mAP) on the COCO dataset than Faster R-CNN, and
performs better in small target detection and dense
scenes, while Faster R-CNN still has some
advantages in overall accuracy. This comparative
analysis provides a valuable reference for researchers
and helps to choose a suitable detection framework
for practical applications. This paper not only
summarizes the current status of object detection
technology, but also points out the future research
direction and challenges. For example, how to
improve the robustness of the algorithm under
extreme weather and lighting conditions, how to
optimize the detection accuracy of small target
detection and dense scenes, and how to balance the
detection speed and accuracy. These research
directions provide new ideas and impetus for the
further development of object detection technology.
In conclusion, this paper not only provides valuable
reference and inspiration for researchers in this field,
but also provides an important basis for technology
selection and optimization in practical application.
The findings of this study will support the
advancement of target detection technology and
enhance its viability and resilience across a range of
application domains.
REFERENCES
Dalal, N., Triggs, B., 2005. Histograms of oriented
gradients for human detection. In 2005 IEEE Computer
Society Conference on Computer Vision and Pattern
Recognition (CVPR'05), San Diego, CA, USA, pp.
886-893 vol. 1
GUO, H., 2024. Object detection: From traditional methods
to deep learning. Emerging Science and Technology,
3(2), pp.128-145.
Liu, L., Ouyang, W., Wang, X. et al., 2020. Deep Learning
for Generic Object Detection: A Survey. International
Journal of Computer Vision, 128, pp.261–318.
https://doi.org/10.1007/s11263-019-01247-4
Liu, L., Ouyang, W., Wang, X., Fieguth, P.W., Chen, J.,
Liu, X., & Pietikäinen, M., 2018. Deep Learning for