Part-based 3D Multi-person Tracking using Depth Cue in a Top View

Cyrille Migniot, Fakhreddine Ababsa

2014

Abstract

While the problem of tracking 3D human motion has been widely studied, the top view is never taken into consideration. However, for the video surveillance, the camera is most of the time placed above the persons. This is due to the human shape is more discriminative in the front view. We propose in this paper a markerless 3D human tracking on the top view. To do this we use the depth and color image sequences given by a kinect. First a 3D model is fitted to these cues in a particle filter framework. Then we introduce a process where the body parts are linked in a complete 3D model but weighted separately so as to reduce the computing time and optimize the resampling step. We find that this part-based tracking increases the accuracy. The process is real-time and works with multiple targets.

References

  1. Ababsa, F. (2009). Robust extended kalman filtering for camera pose tracking using 2d to 3d lines correspondences. IEEE/ASME Conference on Advanced Intelligent Mechatronics, pages 1834-1838.
  2. Ababsa, F. and Mallem, M. (2008). A robust circular fiducial detection technique and real-time 3d camera tracking. International Journal of Multimedia, 3(4):34-41.
  3. Bray, M., Koller-Meier, E., and Van Gool, L. (2004). Smart particle filtering for 3d hand tracking. IEEE International Conference on Automatic Face and Gesture Recognition, pages 675-680.
  4. Breitenstein, M., Reichlin, F., Leibe, B., Koller-Meier, E., and Van Gool, L. (2011). Online multiperson tracking-by-detection from a single, uncalibrated camera. IEEE Transaction on Pattern Analysis and Machine Intelligence, XXXIII:1820-1833.
  5. Brox, T., Rosenhahn, B., Gall, J., and Cremers, D. (2010). Combined region and motion-based 3d tracking of rigid and articulated objects. IEEE Transaction on Pattern Analysis and Machine Intelligence, pages 402-415.
  6. Cai, Y., De Freitas, N., and Little, J. (2006). Robust visual tracking for multiple targets. European Conference on Computer Vision, pages 107-118.
  7. Canton-Ferrer, C., Salvador, J., Casas, J. R., and Pardàs, M. (2008). Multi-person tracking strategies based on voxel analysis. Multimodal Technologies for Perception of Humans, pages 91-103.
  8. Deutscher, J. and Reid, I. (2005). Articulated body motion capture by stochastic search. International Journal of Computer Vision, 2:185-205.
  9. Didier, J., Ababsa, F., and Mallem, M. (2008.). Hybrid camera pose estimation combining square fiducials localisation technique and orthogonal iteration algorithm. International Journal of Image and Graphics, 8(1):169-188.
  10. Escalera, S. (2012). Human behavior analysis from depth maps. In International Conference on Articulated Motion and Deformable Objects, pages 282-292.
  11. Gonzalez, M. and Collet, C. (2011). Robust body parts tracking using particle filter and dynamic template. In IEEE International Conference on Image Processing, pages 529 - 532.
  12. Hauberg, S., Sommer, S., and Pedersen, K. S. (2010). Gaussian-like spatial priors for articulated tracking. In European Conference on Computer Vision, pages 425 - 437.
  13. Horaud, R., Niskanen, M., Dewaele, G., and Boyer, E. (2009). Human motion tracking by registering anarticulated surface to 3d points and normals. IEEE Transaction on Pattern Analysis and Machine Intelligence, XXXI:158-163.
  14. Isard, M. and Blake, A. (1998a). Condensation - conditional density propagation for visual tracking. International Journal of Computer Vision, XXIX:5 - 28.
  15. Isard, M. and Blake, A. (1998b). Icondensation - unifying low-level tracking in a stochastic framework. European Conference on Computer Vision, I:893-908.
  16. Kjellström, H., Kragic, D., and Black, M. J. (2010). Tracking people interacting with objects. In IEEE Conference on Computer Vision and Pattern Recognition.
  17. Kobayashi, Y., Sugimura, D., Sato, Y., Hirasawa, K., Suzuki, N., Kage, H., and Sugimoto, A. (2006). 3d head tracking using the particle filter with cascaded classifiers. In British Machine Vision Conference, pages 37 - 46.
  18. Larsen, A., Hauberg, S., and Pedersen, K. (2011). Unscented kalman filtering for articulated human tracking. Scandinavian conference on Image analysis, pages 228-237.
  19. Li, M., Tan, T., Chen, W., and K., H. (1998). Efficient object tracking by incremental self-tuning particle filtering on the affine group. IEEE Transactions on Image Processing, XXI:1298-1313.
  20. Liem, M. and Gavrila, D. (2009). Multi-person tracking with overlapping cameras in complex, dynamic environments. British Machine Vision Conference, pages 1-10.
  21. Lin, J.Y. Wu, Y. and Huang, T. (2004). 3d model-based hand tracking using stochastic direct search method. IEEE International Conference on Automatic Face and Gesture Recognition, pages 693-698.
  22. MacCormick, J. and Blake, A. (1999). A probabilistic exclusion principle for tracking multiple objects. International Journal on Computer Vision, I:572-578.
  23. Navarro, S., López-Méndez, A., Alcoverro, M., and Casas, J. R. (2012). Multi-view body tracking with a detector-driven hierarchical particle filter. In International Conference on Articulated Motion and Deformable Objects, pages 82-91.
  24. Nummiaro, K., Koller-Meier, E., and Van Gool, L. (2002). An adaptive color-based particle filter. Image and Vision Computing, XXI:99-110.
  25. Shan, C., T. T. and Wei, Y. (2007). Real-time hand tracking using a mean shift embedded particle filter. Pattern Recognition, XXXX:1858-1970.
  26. Shu, G., Dehghan, A., Oreifej, O., Hand, E., and Shah, M. (2012). Part-based multiple-person tracking with partial occlusion handling. IEEE Conference on Computer Vision and Pattern Recognition, pages 1815- 1821.
  27. Viola, P. and Jones, M. (2004). Robust real-time face detection. International Journal of Computer Vision, LVII(2):137-154.
  28. Wu, B. and Nevatia, R. (2007). Detection and tracking of multiple, partially occluded humans by bayesian combination of edgelet based part detectors. International Journal on Computer Vision, LXXV:247-266.
  29. Xia, L., Chen, C.-C., and Aggarwal, J. K. (2011). Human detection using depth information by kinect. International Workshop on Human Activity Understanding from 3D Data.
  30. Xing, J., Ai, H., and Lao, S. (2009). Multi-object tracking through occlusions by local tracklets filtering and global tracklets association with detection responses.
  31. IEEE Conf. on Computer Vision and Pattern Recognition, pages 1200-1207.
  32. Xu, R., Agarwal, P., Kumar, S., Krovi, V. N., and Corso, J. J. (2012). Combining skeletal pose with local motion for human activity recognition. In International Conference on Articulated Motion and Deformable Objects, pages 114-123.
  33. Yang, C., Duraiswami, R., and Davis, L. (2005). Fast multiple object tracking via a hierarchical particle filter. In International Conference on Computer Vision, pages 212 - 219.
Download


Paper Citation


in Harvard Style

Migniot C. and Ababsa F. (2014). Part-based 3D Multi-person Tracking using Depth Cue in a Top View . In Proceedings of the 9th International Conference on Computer Vision Theory and Applications - Volume 3: VISAPP, (VISIGRAPP 2014) ISBN 978-989-758-009-3, pages 419-426. DOI: 10.5220/0004648204190426


in Bibtex Style

@conference{visapp14,
author={Cyrille Migniot and Fakhreddine Ababsa},
title={Part-based 3D Multi-person Tracking using Depth Cue in a Top View},
booktitle={Proceedings of the 9th International Conference on Computer Vision Theory and Applications - Volume 3: VISAPP, (VISIGRAPP 2014)},
year={2014},
pages={419-426},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004648204190426},
isbn={978-989-758-009-3},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 9th International Conference on Computer Vision Theory and Applications - Volume 3: VISAPP, (VISIGRAPP 2014)
TI - Part-based 3D Multi-person Tracking using Depth Cue in a Top View
SN - 978-989-758-009-3
AU - Migniot C.
AU - Ababsa F.
PY - 2014
SP - 419
EP - 426
DO - 10.5220/0004648204190426