Authors:
Javier Borau-Bernad
1
;
Álvaro Ramajo-Ballester
1
;
José María Armingol Moreno
1
and
Araceli Sanchis de Miguel
2
Affiliations:
1
Intelligent Systems Lab, University Carlos III of Madrid, Av. de la Universidad, 30, Leganés, 28911, Madrid, Spain
;
2
Control Learning and Systems Optimization Group, University Carlos III of Madrid, Av. de la Universidad, 30, Leganés, 28911, Madrid, Spain
Keyword(s):
3D Object Detection, Intelligent Infrastructures, Smart Cities, Autonomous Driving, Deep Learning.
Abstract:
As smart cities continue to develop, they require scalable and efficient traffic monitoring systems. This paper presents a modular detection system that switches between monocular and multimodal modes, depending on the available sensors. The monocular mode, based on the MonoLSS algorithm, offers a cost-effective vehicle detection solution using a single camera, ideal for simpler or low-budget setups. In contrast, the multimodal mode integrates camera and LiDAR data via the MVX-Net model, enhancing 3D accuracy in complex traffic scenarios. This dual-mode flexibility allows smart cities to adapt the system to their infrastructure and budgetary needs, ensuring scalability as urban demands evolve. Inference results demonstrate the superior accuracy of the multimodal approach in challenging environments while validating the efficiency of the monocular mode for simpler settings. Therefore, the modular detection system offers a flexible solution that optimizes both cost and performance, eff
ectively addressing the varied requirements of smart city traffic management.
(More)