3D AUTOMATIC LOCATION DETECTION BASED ON SOUND LOCALIZATION

Darun Kesrarat, Paitoon Porntrakoon

Abstract

Video conference systems have been widely used. A fix video camera shoots a scene is lacking in changes. There is a method that the computer-controlled camera shoots and finds the sound source. Microphone arrays and distributed microphone arrays are used to localize the sound source based on time delay of arrival (TDOA). In order to minimize the error rate of TDOA, a set of 4 microphone arrays can be used to determine the location of sound in 3D space. TDOA cannot determine the distance of the sound source if the start time of the sound is unknown. A method to determine the distance of the sound source is using a distributed moving-microphone array. In this paper, we propose a model of a set of 4 moving-micorphone array based on TDOA that can determine the angle direction and distance of the sound source toward the video camera at the center of the model in 3D space.

References

  1. Onishi M., Kagebayashi T., and Fukunaga K., 2001. Production of video images by computer controlled cameras and its application to TV conference system. In Proc. IEEE International Conference on Computer Vision and Pattern recognition.
  2. Pirinen T., Pertila P., and Visa A., 2003. Toward intelligent sensors - reliability for time delay based direction of arrival estimates. In Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing.
  3. Rabenstein R. and Strobe N.K., 1999. Classification of time delay estimates for robust speaker localization. In Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing.
  4. Jahromi O. and Aarabi P., 2003. Time delay estimation and signal reconstruction using multi-rate measurement. In Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing.
  5. Aarabi P., 2003. The fusion of distributed Microphone arrays for sound localization. In International Journal on Applied Signal Processing.
  6. Aarabi P. and Mahdavi M., 1996. The relation between speech segment selectivity and time-delay estimation accuracy. In Proc. IEEE International Conference on Acoustics, Speech and Signal Processing.
  7. Brandstein M.S. and Silverman H.F., 1996. A robust method for speech signal time-delay estimation in reverberant rooms. In Proc. IEEE International Cpnference on Acoustics, Speech, and Signal Processing.
  8. Knapp C.H. and Carter C.G., 1976. The generalized correlation method for estimation of time delay. In IEEE Transaction on Acoustics, Speech and Signal Processing.
  9. Porntrakoon P., Kesarat D., Daengdej J., 2004. Auto Focus using Location Detection based on Sound Localization. In Proc. ASM-2004, International Conference on Applied Simulation and Modelling. ACTA Press.
Download


Paper Citation


in Harvard Style

Kesrarat D. and Porntrakoon P. (2005). 3D AUTOMATIC LOCATION DETECTION BASED ON SOUND LOCALIZATION . In Proceedings of the Second International Conference on Informatics in Control, Automation and Robotics - Volume 3: ICINCO, ISBN 972-8865-31-7, pages 239-244. DOI: 10.5220/0001163402390244


in Bibtex Style

@conference{icinco05,
author={Darun Kesrarat and Paitoon Porntrakoon},
title={3D AUTOMATIC LOCATION DETECTION BASED ON SOUND LOCALIZATION},
booktitle={Proceedings of the Second International Conference on Informatics in Control, Automation and Robotics - Volume 3: ICINCO,},
year={2005},
pages={239-244},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001163402390244},
isbn={972-8865-31-7},
}


in EndNote Style

TY - CONF
JO - Proceedings of the Second International Conference on Informatics in Control, Automation and Robotics - Volume 3: ICINCO,
TI - 3D AUTOMATIC LOCATION DETECTION BASED ON SOUND LOCALIZATION
SN - 972-8865-31-7
AU - Kesrarat D.
AU - Porntrakoon P.
PY - 2005
SP - 239
EP - 244
DO - 10.5220/0001163402390244