loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Authors: Andrei-Stelian Stan ; Dan Popescu and Loretta Ichim

Affiliation: National University of Science and Technology POLITEHNICA Bucharest, Bucharest, Romania

Keyword(s): Neural Networks, Person Detection, Unmanned Aerial Vehicles, Detection Transformer, Vision Transformer.

Abstract: The study introduces a novel object detection system that combines the strengths of two advanced deep learning models, the Detection Transformer (DETR) and the Vision Transformer (ViT), to enhance detection accuracy and robustness in unmanned aerial vehicle (UAV) applications. Both models were independently fine-tuned on the VisDrone dataset and then deployed in parallel, each processing the same input to leverage their advantages. DETR provides precise localization capabilities, particularly effective in crowded urban settings. At the same time, ViT excels at identifying objects at various scales and under partial occlusions, which is crucial for distant object detection. The fusion of their outputs is managed through a dynamic fusion algorithm, which adjusts the confidence scores based on contextual analysis and the characteristics of detected objects, resulting in a combined detection system that outperforms the individual models. The fused model significantly improved overall acc uracy, achieving up to 90%, with a mean Average Precision (mAP50) of 85%, and a recall of 80%. These results underline the potential of integrating multiple transformer-based models to handle the complexities of UAV-based detection tasks, offering a robust solution that adapts to diverse operational scenarios and environmental conditions. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 216.73.216.202

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Stan, A.-S., Popescu, D. and Ichim, L. (2025). Person Detection from UAV Based on a Dual Transformer Approach. In Proceedings of the 11th International Conference on Geographical Information Systems Theory, Applications and Management - GISTAM; ISBN 978-989-758-741-2; ISSN 2184-500X, SciTePress, pages 95-102. DOI: 10.5220/0013467900003935

@conference{gistam25,
author={Andrei{-}Stelian Stan and Dan Popescu and Loretta Ichim},
title={Person Detection from UAV Based on a Dual Transformer Approach},
booktitle={Proceedings of the 11th International Conference on Geographical Information Systems Theory, Applications and Management - GISTAM},
year={2025},
pages={95-102},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0013467900003935},
isbn={978-989-758-741-2},
issn={2184-500X},
}

TY - CONF

JO - Proceedings of the 11th International Conference on Geographical Information Systems Theory, Applications and Management - GISTAM
TI - Person Detection from UAV Based on a Dual Transformer Approach
SN - 978-989-758-741-2
IS - 2184-500X
AU - Stan, A.
AU - Popescu, D.
AU - Ichim, L.
PY - 2025
SP - 95
EP - 102
DO - 10.5220/0013467900003935
PB - SciTePress