ROBUST CAMERA MOTION ESTIMATION IN VIDEO SEQUENCES

Xiaobo An, Xueying Qin, Guofeng Zhang, Wei Chen, Hujun Bao

2006

Abstract

Camera motion estimation of video sequences requires robust recovery of camera parameters and is a cumbersome task concerning arbitrarily complex scenes in video sequences. In this paper, we present a novel algorithm for robust and accurate estimation of camera motion. We insert a virtual frame between each pair of consecutive frames, through which the in-between camera motion is decomposed into two separate components, i.e., pure rotation and pure translation. Given matched feature points between two frames, one point set corresponding to the far scene is chosen, which is used to estimate initial camera motion. We further refine it recursively by a non-linear optimizer, yielding the final camera motion parameters. Our approach achieves accurate estimation of camera motion and avoids instability of camera tracking. We demonstrate high stability, accuracy and performance of our algorithm with a set of augmented reality applications based on acquired video sequences.

References

  1. Alon, J. and Sclaroff, S. (2000). Recursive estimation of motion and planar structure. In Proceedings of IEEE Computer Vision and Pattern Recognition, pages 550- 556.
  2. Azarbayejani, A. and Pentland, A. P. (1995). Recursive estimation of motion, structure, and focal length. IEEE Transactions on Pattern Analysis and Machine Intelligence, 17(6):562-575.
  3. Chen, Z., Pears, N., McDermid, J., and Heseltine, T. (2003). Epipolar estimation under pure camera translation. In Proceedings of Digital Image Computing: Techniques and Applications 2003, pages 849-858, Sydney, Australia.
  4. Cornelis, K., Pollefeys, M., and Gool, L. V. (2001). Tracking based structure and motion recovery for augmented video productions.
  5. Cornelis, K., Verbiest, F., and Gool, L. V. (2004). Drift detection and removal for sequential structure for motion algorithms. IEEE Transactions on Pattern Analysis and Machine Intelligence, 26(10):1249-1259.
  6. Georgescu, B. and Meer, P. (2004). Point matching under large image deformations and illumination changes. IEEE Transactions on Pattern Analysis and Machine Intelligence, 26(6):647-688.
  7. Hartley, R. and Zisserman, A. (2000). Multiple View Geometry in Computer Vision. Cambridge University Press.
  8. Jepson, A. D. and Heeger, D. J. (1991). A fast subspace algorithm for recovering rigid motion. In Proceedings of IEEE Workshop on Visual Motion, pages 124-131.
  9. Johansson, B. (1990). View synthesis and 3D reconstruction of piecewise planar scenes using intersection lines between the planes. In Proceedings of International Conference on Pattern Recognition 1999, pages 54- 59.
  10. Kahl, F. and Heyden, A. (2001). Euclidean reconstruction and auto-calibration from continuous motion. In Proceedings of International Conference on Computer Vision 2001, pages 572-577, Vancouver, Canada.
  11. MacLean, W. J. (1999). Removal of translation bias when using subspace methods. In Proceedings of International Conference on Computer Vision 1999, pages 753-758.
  12. Nistér, D. (2004). An efficient solution to the five-point relative pose problem. IEEE Transactions on Pattern Analysis and Machine Intelligence, 26(6):756-777.
  13. Pollefeys, M., Gool, L. V., Vergauwen, M., Verbiest, F., Cornelis, K., Tops, J., and Koch, R. (2004). Visual modeling with a hand-held camera. International Journal of Computer Vision, 59(3):207-232.
  14. Qin, X., Nakamae, E., and Tadamura, K. (2002). Automatically compositing still images and landscape video sequences. IEEE Computer Graphics and Appliactions, 22(1):68-78.
  15. Sharp, G. C., Lee, S. W., and Wehe, D. K. (2004). Multiview registration of 3D scenes by minimizing error between coordinate frames. IEEE Transactions on Pattern Analysis and Machine Intelligence, 26(8):1037- 1050.
  16. Shashua, A. and Werman, M. (1995). Trilinearity of three perspective views and its associated tensor. In Proceedings of International Conference on Computer Vision 1995, pages 920-925.
  17. Stein, G. P. and Shashua, A. (2000). Model-based brightness constraints: On direct estimation of structure and motion. IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(9):992-1025.
  18. Wang, W. and Tsui, H. T. (2000). An SVD decomposition of essential matrix with eight solutions for the relative positions of two perspective cameras. In Proceedings of the International Conference on Pattern Recognition 2000, pages 1362-1365, Barcelona, Spain.
  19. Wong, K. H. and Chang, M. M. Y. (2004). 3D model reconstruction by constrained bundle adjustment. In Proceedings of the 17th International Conference on Pattern Recognition, pages 902-905.
  20. Zhang, Z. (1998). Determining the epipolar geometry and its uncertainty: A review. International Journal of Computer Vision, 27(2):161-198.
  21. Zhang, Z. and Loop, C. (2001). Estimating the fundamental matrix by transforming image points in projective space. Computer Vision and Image Understanding, 82(2):174-180.
  22. Zivkovic, Z. and van der Heijden, F. (2002). Better features to track by estimating the tracking convergence region. In Proceedings of IEEE International Conference on Pattern Recognition 2002, pages 635-638.
  23. 10 Frame No.
Download


Paper Citation


in Harvard Style

An X., Qin X., Zhang G., Chen W. and Bao H. (2006). ROBUST CAMERA MOTION ESTIMATION IN VIDEO SEQUENCES . In Proceedings of the First International Conference on Computer Vision Theory and Applications - Volume 2: VISAPP, ISBN 972-8865-40-6, pages 294-302. DOI: 10.5220/0001361702940302


in Bibtex Style

@conference{visapp06,
author={Xiaobo An and Xueying Qin and Guofeng Zhang and Wei Chen and Hujun Bao},
title={ROBUST CAMERA MOTION ESTIMATION IN VIDEO SEQUENCES},
booktitle={Proceedings of the First International Conference on Computer Vision Theory and Applications - Volume 2: VISAPP,},
year={2006},
pages={294-302},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001361702940302},
isbn={972-8865-40-6},
}


in EndNote Style

TY - CONF
JO - Proceedings of the First International Conference on Computer Vision Theory and Applications - Volume 2: VISAPP,
TI - ROBUST CAMERA MOTION ESTIMATION IN VIDEO SEQUENCES
SN - 972-8865-40-6
AU - An X.
AU - Qin X.
AU - Zhang G.
AU - Chen W.
AU - Bao H.
PY - 2006
SP - 294
EP - 302
DO - 10.5220/0001361702940302