detection with advantages. Lin et al. proposed
RetinaNet and solved the category imbalance
problem in traditional target detection methods by
Focal Loss (Lin et al., 2017). Wei et al. applied
RetinaNet to smart driving scenarios and proposed a
deep learning-based small object detection method to
optimize detection accuracy in low-light
environments (Wei et al., 2020).
4. EXISTING LIMITATIONS AND
FUTURE PROSPECTS
Although existing target detection techniques
have made significant progress in many aspects, they
still face some challenges and limitations. First,
existing target detection methods are less robust in
complex environments. For example, in bad weather
(e.g., rain, haze) or poor lighting conditions, the
image quality of sensors (especially cameras)
degrades significantly, resulting in lower target
detection accuracy. Although LiDAR and millimeter-
wave radar can provide better detection performance
in bad weather, their relatively low resolution and
accuracy cannot yet fully replace visual perception. In
addition, the detection of small objects is still a
difficult problem. Especially in complex
backgrounds, the detection of small objects (e.g.,
pedestrians, low obstacles, etc.) is not ideal. Finally,
target detection algorithms usually need to find a
balance between accuracy and real-time performance.
High-precision models usually require more
computational resources, resulting in slower
detection; while models pursuing speed often
sacrifice accuracy. Therefore, how to balance
accuracy and real-time is still an important issue in
autonomous driving platforms with limited
computational resources.
In the future, target detection techniques are
expected to overcome these limitations through
multimodal data fusion. Multimodal fusion refers to
the combination of different types of sensors, such as
cameras, lidar, millimeter-wave radar, etc., to make
up for the shortcomings of a single sensor. For
example, the combination of visual sensors and radar
can effectively deal with the detection problem under
adverse weather conditions. In addition, unsupervised
learning, as an emerging learning method, does not
rely on manually labeled data but learns through the
structure or contextual information of the data itself.
Unsupervised learning can reduce the need for large-
scale labeled data, thus accelerating model training
and improving model adaptability in new scenarios.
Hardware acceleration and model lightweight are also
directions for future research. With the popularity of
hardware platforms such as GPUs and TPUs, as well
as the development of model optimization techniques
such as quantization and pruning, the target detection
model will be more efficient and adapt to the
application scenarios of low-power devices and
embedded systems.
5. CONCLUSION
This paper summarizes the development status
and challenges of intelligent driving target detection
technology, and deeply analyzes the key factors
affecting the detection effect. With the rapid
development of automatic driving technology, target
detection has become one of the indispensable core
technologies in intelligent driving systems. This
paper analyzes the advantages and limitations of these
methods in practical applications, especially their
adaptability in automatic driving scenarios, through a
detailed discussion of the YOLO series and other
classical target detection methods. The YOLO series
methods, by virtue of their efficient real-time
detection capability and good accuracy, have become
one of the mainstream target detection algorithms that
are widely used in the field of automatic driving at
present.
With the continuous progress of sensor
technology, deep learning algorithms and
computational power, the target detection technology
is expected to make greater breakthroughs in
accuracy, real-time and robustness. Meanwhile, the
optimization of deep learning algorithms, especially
the combination of cutting-edge technologies such as
CNN and Reinforcement Learning (RL), may lead to
significant improvements in the accuracy and
efficiency of target detection. With the continuous
progress of these technologies, the intelligent driving
system will gain greater improvement in perception
capability, thus promoting the rapid development and
commercialization of automatic driving technology.
REFERENCES
Bochkovskiy, A., Wang, C. Y., & Liao, H. Y. M. 2020.
YOLOv4: Optimal speed and accuracy of object
detection. arXiv preprint arXiv:2004.10934.
Girshick, R., Donahue, J., Darrell, T., & Malik, J. 2015.
Region-based convolutional networks for accurate
object detection and segmentation. IEEE Transactions