Robust People Detection and Tracking from an Overhead Time-of-Flight Camera

Alvaro Fernandez-Rincon, David Fuentes-Jimenez, Cristina Losada-Gutierrez, Marta Marron-Romera, Carlos A. Luna, Javier Macias-Guarasa, Manuel Mazo


In this paper we describe a system for robust detection of people in a scene, by using an overhead Time of Flight (ToF) camera. The proposal addresses the problem of robust detection of people, by three means: a carefully designed algorithm to select regions of interest as candidates to belong to people; the generation of a robust feature vector that efficiently model the human upper body; and a people classification stage, to allow robust discrimination of people and other objects in the scene. The proposal also includes a particle filter tracker to allow people identification and tracking. Two classifiers are evaluated, based on Principal Component Analysis (PCA), and Support Vector Machines (SVM). The evaluation is carried out on a subset of a carefully designed dataset with a broad variety of conditions, providing results comparing the PCA and SVM approaches, and also the performance impact of the tracker, with satisfactory results.


  1. A. Doucet, N. de Freitas, N. G. (2001). Sequential MonteCarlo Methods in Practice. Springer Verlag.
  2. Antic, B., Letic, D., Culibrk, D., and Crnojevic, V. (2009). K-means based segmentation for real-time zenithal people counting. In Proc. of the 16th IEEE International Conference on Image Processing, ICIP'09, pages 2537-2540.
  3. Bar-Shalom, Y., Willett, P. K., and Tian, X. (2011). Tracking and Data Fusion. YBS Publishing.
  4. Bevilacqua, A., Di Stefano, L., and Azzari, P. (2006). People tracking using a time-of-flight depth sensor. In IEEE International Conf. on Video and Signal Based Surveillance. AVSS 7806., pages 89-89.
  5. Burges, C. J. (1998). A tutorial on support vector machines for pattern recognition. Data mining and knowledge discovery, 2(2):121-167.
  6. Cai, Z., Yu, Z. L., Liu, H., and Zhang, K. (2014). Counting people in crowded scenes by video analyzing. In Industrial Electronics and Applications (ICIEA), 2014 IEEE 9th Conference on, pages 1841-1845.
  7. Dan, B.-K., Kim, Y.-S., Suryanto, Jung, J.-Y., and Ko, S.- J. (2012). Robust people counting system based on sensor fusion. IEEE Trans. on Consumer Electronics, 58(3):1013-1021.
  8. Del Pizzo, L., Foggia, P., Greco, A., Percannella, G., and Vento, M. (2016). Counting people by rgb or depth overhead cameras. Pattern Recognition Letters.
  9. Ekman, M. (2008). Particle filters and data association for multi-target tracking. In Information Fusion, 2008 11th International Conference on, pages 1-8.
  10. Galc?ík, F. and Gargal ík, R. (2013). Real-time depth map based people counting. In International Conf. on Advanced Concepts for Intelligent Vision Systems, pages 330-341. Springer.
  11. Hsu, C.-W. and Lin, C.-J. (2002). A comparison of methods for multiclass support vector machines. IEEE trans. on Neural Networks, 13(2):415-425.
  12. Isard, M. and Blake, A. (1998). Condensation - conditional density propagation forvisual tracking. International Journal of Computer Vision, 29(1):5-28.
  13. Jeong, C. Y., Choi, S., and Han, S. W. (2013). A method for counting moving and stationary people by interest point classification. InImage Processing (ICIP), 2013 20th IEEE International Conference on, pages 4545- 4548.
  14. Jia, L. and Radke, R. (2014). Using time-of-flight measurements for privacy-preserving tracking in a smart room. IEEE Trans. on Industrial Informatics, 10(1):689- 696.
  15. Jia, Z., Balasuriya, A., and Challa, S. (2008). Autonomous vehicles navigation with visual target tracking: Technical approaches. Algorithms, 1(2):153-182.
  16. Jiménez, J. A., Mazo, M., Uren˜a, J., Hernández, A., Alvarez, F., García, J. J., and Santiso, E. (2005). Using PCA in time-of-flight vectors for reflector recognition and 3-D localization. IEEE Trans. on Robotics, 21(5):909-924.
  17. Liu, J. S. and Chen, R. (1998). Sequential monte carlo methods for dynamic systems. Journal of the American Statistical Association, 93:1032-1044.
  18. Luna, C. A., Losada-Gutierrez, C., Fuentes-Jimenez, D., Fernandez-Rincon, A., Mazo, M., and MaciasGuarasa, J. (2016). Robust people detection using depth information from an overhead time-of-flight camera. Expert Systems with Applications, pages -.
  19. MacCormick, J. and Blake, A. (2000). A probabilistic exclusion principle for tracking multiple objects. International Journal of Computer Vision, 39(1):57-71.
  20. Macias-Guarasa, J., Losada-Gutierrez, C., FuentesJimenez, D., Fernandez, R., Luna, C. A., FernandezRincon, A., and Mazo, M. (2016). The GEINTRA Overhead ToF People Detection (GOTPD1) database. (accessed June 2016).
  21. Marron, M., Garcia, J. C., Sotelo, M. A., Fernandez, D., and Pizarro, D. (2005). ”xpfcp”: an extended particle filter for tracking multiple and dynamic objects in complex environments. In 2005 IEEE/RSJ International Conference on Intelligent Robots and Systems, pages 2474-2479.
  22. Marron, M., Garcia, J. C., Sotelo, M. A., Pizarro, D., Mazo, M., Canas, J. M., Losada, C., and Marcos, A. (2010). Stereo vision tracking of multiple objects in complex indoor environments. Sensors, 10(10):8865.
  23. Matzner, S., Heredia-Langner, A., Amidan, B., Boettcher, E., Lochtefeld, D., and Webb, T. (2015). Standoff human identification using body shape. In Technologies for Homeland Security (HST), 2015 IEEE International Symposium on, pages 1-6.
  24. Ramanan, D., Forsyth, D. A., and Zisserman, A. (2006). Tracking People by Learning Their Appearance. IEEE Trans. on Pattern Analysis and Machine Intelligence, 29(1):65-81.
  25. Rauter, M. (2013). Reliable human detection and tracking in top-view depth images. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, pages 529-534.
  26. Sell, J. and O'Connor, P. (2014). The Xbox one system on a chip and Kinect sensor. Micro, IEEE, 34(2):44-53.
  27. Shlens, J. (2014). A tutorial on principal component analysis. arXiv preprint arXiv:1404.1100. (accessed June 2016).
  28. Stahlschmidt, C., Gavriilidis, A., Velten, J., and Kummert, A. (2014). Applications for a people detection and tracking algorithm using a time-of-flight camera. Multimedia Tools and Applications, pages 1-18.
  29. Zhang, X., Yan, J., Feng, S., Lei, Z., Yi, D., and Li, S. Z. (2012). Water filling: Unsupervised people counting via vertical kinect sensor. In Advanced Video and Signal-Based Surveillance (AVSS), 2012 IEEE Ninth International Conference on, pages 215-220. IEEE.
  30. Zhu, L. and Wong, K.-H. (2013). Human tracking and counting using the kinect range sensor based on adaboost and kalman filter. InInternational Symposium on Visual Computing, pages 582-591. Springer.

Paper Citation

in Harvard Style

Fernandez-Rincon A., Fuentes-Jimenez D., Losada-Gutierrez C., Marron-Romera M., Luna C., Macias-Guarasa J. and Mazo M. (2017). Robust People Detection and Tracking from an Overhead Time-of-Flight Camera . In Proceedings of the 12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 4: VISAPP, (VISIGRAPP 2017) ISBN 978-989-758-225-7, pages 556-564. DOI: 10.5220/0006169905560564

in Bibtex Style

author={Alvaro Fernandez-Rincon and David Fuentes-Jimenez and Cristina Losada-Gutierrez and Marta Marron-Romera and Carlos A. Luna and Javier Macias-Guarasa and Manuel Mazo},
title={Robust People Detection and Tracking from an Overhead Time-of-Flight Camera},
booktitle={Proceedings of the 12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 4: VISAPP, (VISIGRAPP 2017)},

in EndNote Style

JO - Proceedings of the 12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 4: VISAPP, (VISIGRAPP 2017)
TI - Robust People Detection and Tracking from an Overhead Time-of-Flight Camera
SN - 978-989-758-225-7
AU - Fernandez-Rincon A.
AU - Fuentes-Jimenez D.
AU - Losada-Gutierrez C.
AU - Marron-Romera M.
AU - Luna C.
AU - Macias-Guarasa J.
AU - Mazo M.
PY - 2017
SP - 556
EP - 564
DO - 10.5220/0006169905560564