Low Cost Video Animation of People using a RGBD Sensor

Cathrine J. Thomsen, Thomas B. Moeslund, Troels H. P. Jensen

Abstract

This paper is an investigation in a low cost solution for performing video animation using a Kinect v2 for Windows, where skeleton, depth and colour data are acquired for three different characters. Segmentation of colour and depth frames were based on establishing the range of a person in the depth frame using the skeleton information, and then train a plane of the floor and exclude points close to it. Transitioning between motions were based on minimizing the Euclidean distance between all feasible transitioning frames, where a source and target frame would be found. Intermediate frames were made to create seamless transitions, where new poses were found by moving pixels in the direction of the optical flow between the transitioning frames. The realism of the proposed animation was verified through a user study to have a higher rate of preference and perceived realism compared to no animation and animation using alpha blending.

References

  1. Budd, C., Huang, P., Klaudiny, M., and Hilton, A. (2013). Global non-rigid alignment of surface sequences. International Journal of Computer Vision, 102(1- 3):256-270.
  2. Casas, D., Tejera, M., Guillemaut, J.-Y., and Hilton, A. (2013). Interactive animation of 4d performance capture. IEEE transactions on visualization and computer graphics, 19(5):762-773.
  3. Casas, D., Volino, M., Collomosse, J., and Hilton, A. (2014). 4d video textures for interactive character appearance. In Computer Graphics Forum, volume 33, pages 371-380. Wiley Online Library.
  4. De Aguiar, E., Stoll, C., Theobalt, C., Ahmed, N., Seidel, H.-P., and Thrun, S. (2008). Performance capture from sparse multi-view video. In ACM Transactions on Graphics (TOG), volume 27, page 98. ACM.
  5. Farnebäck, G. (2003). Two-frame motion estimation based on polynomial expansion. In Scandinavian conference on Image analysis, pages 363-370. Springer.
  6. Fechteler, P., Paier, W., and Eisert, P. (2014). Articulated 3d model tracking with on-the-fly texturing. In 2014 IEEE International Conference on Image Processing (ICIP), pages 3998-4002. IEEE.
  7. Huang, P., Hilton, A., and Starck, J. (2010). Shape similarity for 3d video sequences of people. International Journal of Computer Vision, 89(2-3):362-381.
  8. Kovar, L., Gleicher, M., and Pighin, F. (2002). Motion graphs. In ACM transactions on graphics (TOG), volume 21, pages 473-482. ACM.
  9. Møgelmose, A., Bahnsen, C., and Moeslund, T. B. (2015). Comparison of multi-shot models for short-term reidentification of people using rgb-d sensors. InInternational Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications 2015.
  10. Starck, J. and Hilton, A. (2007). Surface capture for performance-based animation. IEEE Computer Graphics and Applications, 27(3):21-31.
  11. Xu, F., Liu, Y., Stoll, C., Tompkin, J., Bharaj, G., Dai, Q., Seidel, H.-P., Kautz, J., and Theobalt, C. (2011). Video-based characters: creating new human performances from a multi-view video database. ACM Transactions on Graphics (TOG), 30(4):32.
Download


Paper Citation


in Harvard Style

Thomsen C., Moeslund T. and Jensen T. (2017). Low Cost Video Animation of People using a RGBD Sensor . In Proceedings of the 12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 1: GRAPP, (VISIGRAPP 2017) ISBN 978-989-758-224-0, pages 244-249. DOI: 10.5220/0006132902440249


in Bibtex Style

@conference{grapp17,
author={Cathrine J. Thomsen and Thomas B. Moeslund and Troels H. P. Jensen},
title={Low Cost Video Animation of People using a RGBD Sensor},
booktitle={Proceedings of the 12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 1: GRAPP, (VISIGRAPP 2017)},
year={2017},
pages={244-249},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006132902440249},
isbn={978-989-758-224-0},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 1: GRAPP, (VISIGRAPP 2017)
TI - Low Cost Video Animation of People using a RGBD Sensor
SN - 978-989-758-224-0
AU - Thomsen C.
AU - Moeslund T.
AU - Jensen T.
PY - 2017
SP - 244
EP - 249
DO - 10.5220/0006132902440249