Detection of Imaged Objects with Estimated Scales

Xuesong Li, Ngaiming Kwok, Jose E. Guivant, Karan Narula, Ruowei Li, Hongkun Wu

2019

Abstract

Dealing with multiple sizes of the object in the image has always been a challenge in object detection. Predefined multi-size anchors are usually adopted to address this issue, but they can only accommodate a limited number of object scales and aspect ratios. To cover a wider multi-size variation, we propose a detection method that utilizes depth information to estimate the size of anchors. To be more specific, a general 3D shape is selected, for each class of objects, that represents different sizes of 2D bounding boxes in the image according to the corresponding object depths. Given these 2D bounding boxes, a neural network is used to classify them into different categories and do the regression to obtain more accurate 2D bounding boxes. The KITTI benchmark dataset is used to validate the proposed approach. Compared with the detection method using pre-defined anchors, the proposed method has achieved a significant improvement in detection accuracy.

Download


Paper Citation


in Harvard Style

Li X., Kwok N., Guivant J., Narula K., Li R. and Wu H. (2019). Detection of Imaged Objects with Estimated Scales. In Proceedings of the 14th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2019) - Volume 5: VISAPP; ISBN 978-989-758-354-4, SciTePress, pages 39-47. DOI: 10.5220/0007353600390047


in Bibtex Style

@conference{visapp19,
author={Xuesong Li and Ngaiming Kwok and Jose E. Guivant and Karan Narula and Ruowei Li and Hongkun Wu},
title={Detection of Imaged Objects with Estimated Scales},
booktitle={Proceedings of the 14th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2019) - Volume 5: VISAPP},
year={2019},
pages={39-47},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0007353600390047},
isbn={978-989-758-354-4},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 14th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2019) - Volume 5: VISAPP
TI - Detection of Imaged Objects with Estimated Scales
SN - 978-989-758-354-4
AU - Li X.
AU - Kwok N.
AU - Guivant J.
AU - Narula K.
AU - Li R.
AU - Wu H.
PY - 2019
SP - 39
EP - 47
DO - 10.5220/0007353600390047
PB - SciTePress