FOUR-PHASE RE-SPEAKER TRAINING SYSTEM

Aleš Pražák, Zdeněk Loose, Josef Psutka, Vlasta Radová, Luděk Müller

2011

Abstract

Since the re-speaker approach to the automatic captioning of TV broadcastings using large vocabulary continuous speech recognition (LVCSR) is on the increase, there is also a growing demand for training systems that would allow new speakers to learn the procedure. This paper describes a specially designed re-speaker training system that provides gradual four-phase tutoring process with quantitative indicators of a trainee progress to enable faster (and thus cheaper) training of the re-speakers. The performance evaluation of three re-speakers who were trained on the proposed system is also reported.

References

  1. Boulianne, G., Beaumont, J.-F., Boisvert, M., Brousseau, J., Cardinal, P., Chapdelaine, C., Comeau, M., Ouellet, P., Osterrath, F., 2006. In International Conference on Spoken Language Processing.
  2. Evans, M. J., 2003. Speech Recognition in Assisted and Live Subtitling for Television. WHP 065. BBC R&D White Papers.
  3. Homma, S., Kobayashi, A., Oku, T., Sato, S., Imai, T., Takagi, T., 2008. New Real-Time Closed-Captioning System for Japanese Broadcast News Programs. In Computers Helping People with Special Needs. Springer.
  4. Neto, J., Meinedo, H., Viveiros, M., Cassaca, R., Martins, C., Caseiro, D., 2008. Broadcast news subtitling system in Portuguese. In IEEE International Conference on Acoustics, Speech and Signal Processing.
  5. Pražák, A., Müller, L., Psutka, J. V., Psutka, J., 2007. LIVE TV SUBTITLING - Fast 2-pass LVCSR System for Online Subtitling. In International Conference on Signal Processing and Multimedia Applications.
  6. Pražák, A., Zajíc, Z., Machlica, L., Psutka, J. V., 2009. Fast Speaker Adaptation in Automatic Online Subtitling. In International Conference on Signal Processing and Multimedia Applications.
  7. Verhelst, W., 2000. Overlap-add methods for time-scaling of speech. In Speech Communication. Elsevier.
  8. Wald, M. and Bell, J.-M. and Boulain, P. and Doody, K. and Gerrard, J., 2007. Correcting automatic speech recognition captioning errors in real time. In International Journal of Speech Technology.
Download


Paper Citation


in Harvard Style

Pražák A., Loose Z., Psutka J., Radová V. and Müller L. (2011). FOUR-PHASE RE-SPEAKER TRAINING SYSTEM . In Proceedings of the International Conference on Signal Processing and Multimedia Applications - Volume 1: SIGMAP, (ICETE 2011) ISBN 978-989-8425-72-0, pages 217-220. DOI: 10.5220/0003604502170220


in Bibtex Style

@conference{sigmap11,
author={Aleš Pražák and Zdeněk Loose and Josef Psutka and Vlasta Radová and Luděk Müller},
title={FOUR-PHASE RE-SPEAKER TRAINING SYSTEM},
booktitle={Proceedings of the International Conference on Signal Processing and Multimedia Applications - Volume 1: SIGMAP, (ICETE 2011)},
year={2011},
pages={217-220},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003604502170220},
isbn={978-989-8425-72-0},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Signal Processing and Multimedia Applications - Volume 1: SIGMAP, (ICETE 2011)
TI - FOUR-PHASE RE-SPEAKER TRAINING SYSTEM
SN - 978-989-8425-72-0
AU - Pražák A.
AU - Loose Z.
AU - Psutka J.
AU - Radová V.
AU - Müller L.
PY - 2011
SP - 217
EP - 220
DO - 10.5220/0003604502170220