Estimating the Number of Segments of a Turn in Dialogue Systems

Vicent Tamarit, Carlos-D. Martínez-Hinarejos



An important part of a dialogue system is the correct labelling of turns with dialogue-related meaning. This meaning is usually represented by dialogue acts, which give the system semantic information about user intentions. This labelling is usually done in two steps, dividing the turn into segments, and classifying them into DAs. Some works have shown that the segmentation step can be improved by knowing the correct number of segments in the turn before the segmentation. We present an estimation of the probability of the number of segments in the turn. We propose and evaluate some features to estimate the probability of the number of segments based on the transcription of the turn. The experiments include the SwitchBoard and the Dihana corpus and show that this method estimates correctly the number of segments of the 72% and the 78% of the turns in the SwitchBoard corpus and the Dihana corpus respectively.


  1. N. Alcácer, J. Benedí, F. Blat, R. Granell, C. D. Martínez, and F. Torres. Acquisition and labelling of a spontaneous speech dialogue corpus. Proceeding of 10th International Conference on Speech and Computer (SPECOM). Patras, Greece, pages 583-586, 2005.
  2. J. Ang, Y. Liu, and E. Shriberg. Automatic dialog act segmentation and classification in multiparty meetings. In Proceedings of the International Conference of Acoustics, Speech, and Signal Processings, volume 1, pages 1061-1064, Philadelphia, 2005.
  3. J.-M. Benedí, E. Lleida, A. Varona, M.-J. Castro, I. Galiano, R. Justo, I. L ópez de Letona, and A. Miguel. Design and acquisition of a telephone spontaneous speech dialogue corpus in spanish: Dihana. Fifth International Conference on Language Resources and Evaluation (LREC), pages 1636-1639, May 2006.
  4. H. Bunt. Context and dialogue control. THINK Quarterly, 3, 1994.
  5. L. Dybkjaer and W. Minker. Recent Trends in Discourse and Dialogue, volume 39 of Text, Speech and Language Technology. Springer, 2008.
  6. M. Fraser and G. Gilbert. Simulating speech systems. Computer Speech and Language, (5):81-89, 1991.
  7. T. Fukada, D. Koll, A. Waibel, and K. Tanigaki. Probabilistic dialogue act extraction for concept based multilingual translation systems. ICSLP 98, pages 2771-2774, 1998.
  8. J.J. Godfrey, E.C. Holliman, and J. McDaniel. Switchboard: telephone speech corpus for research and development. Acoustics, Speech, and Signal Processing, IEEE International Conference on, 1:517-520, 1992.
  9. A. Gorin, G. Riccardi, and J. Wright. How may I help you? Speech Communication, 23:113- 127, 1997.
  10. D. Jurafsky, E. Shriberg, and D. Biasca. Switchboard swbd-damsl shallow- discoursefunction annotation coders manual - draft 13. Technical Report 97-01, University of Colorado Institute of Cognitive Science, 1997.
  11. A. Lavie, L. Levin, P. Zhan, M. Taboada, D. Gates, M. Lapata, C. Clark, M. Broadhead, and A. Waibel. Expanding the domain of a multi-lingual speech-to-speech translation system. Proceedings of the Workshop on Spoken Language Translation, ACL/EACL-97, 1997.
  12. L. Levin, K. Ries, A. Thymé-Gobbel, and A. Levie. Tagging of speech acts and dialogue games in Spanish call home. In the Workshop: Towards Standards and Tools for Discourse Tagging, pages 42-47, 1999.
  13. C.D. Martínez-Hinarejos. A study of a segmentation technique for dialogue act assignation. In Proceedings of the Eighth International Conference in Computational Semantics IWCS8, pages 299-304, Tilburg, The Nederlands, January 2009. Tilburg University, Department of Communication and Information Sciences.
  14. C.D. Martinez-Hinarejos, J.M. Benedi, and R. Granell. Statistical framework for a spanish spoken dialogue corpus. Speech Communication, 50:992-1008, 2008.
  15. A. Stolcke, N. Coccaro, R. Bates, P. Taylor, C. van Ess-Dykema, K. Ries, E. Shriberg, D. Jurafsky, R. Martin, and M. Meteer. Dialogue act modelling for automatic tagging and recognition of conversational speech. Computational Linguistics, 26(3):1-34, 2000.
  16. D. Vilar, M.J. Castro, and E. Sanchis. Connectionist classification and specific stochastic models in the understanding process of a dialogue system. In European Conference on Speech Communication and Technology, pages 645-648, Geneva, Switzerland, September 2003.
  17. S. Young. Probabilistic methods in spoken dialogue systems. Philosophical Trans Royal Society (Series A), 358(1769):1389-1402, 2000.

Paper Citation

in Harvard Style

Tamarit V. and Martínez-Hinarejos C. (2009). Estimating the Number of Segments of a Turn in Dialogue Systems . In Proceedings of the 9th International Workshop on Pattern Recognition in Information Systems - Volume 1: PRIS, (ICEIS 2009) ISBN 978-989-8111-89-0, pages 9-17. DOI: 10.5220/0002173600090017

in Bibtex Style

author={Vicent Tamarit and Carlos-D. Martínez-Hinarejos},
title={Estimating the Number of Segments of a Turn in Dialogue Systems},
booktitle={Proceedings of the 9th International Workshop on Pattern Recognition in Information Systems - Volume 1: PRIS, (ICEIS 2009)},

in EndNote Style

JO - Proceedings of the 9th International Workshop on Pattern Recognition in Information Systems - Volume 1: PRIS, (ICEIS 2009)
TI - Estimating the Number of Segments of a Turn in Dialogue Systems
SN - 978-989-8111-89-0
AU - Tamarit V.
AU - Martínez-Hinarejos C.
PY - 2009
SP - 9
EP - 17
DO - 10.5220/0002173600090017