Intended Boundaries detection in Topic Change Tracking for Text Segmentation

Alexandre Labadié, Violaine Prince

2008

Abstract

This paper presents a topical text segmentation method based on intended boundaries detection and compares it to a well known default boundaries detection method, c99. Running the two methods on a corpus of twenty two French political discourse and results showed that intended boundaries detection performs better than default boundaries detection on well structured texts.

References

  1. Kaszkiel, M., Zobel, J.: Passage retrieval revisited. Proceedings of theTwentieth International Conference on Research and Development in Information Access (ACMSIGIR) (1997) 178- 185
  2. Prince, V., Labadié, A.: Text segmentation based on document understanding for information retrieval. In Proceedings of NLDB'07 (2007) 295-304
  3. Kan, M., Klavans, J.L., McKeown, K.R.: Linear segmentation and segment significance. Proceedings of WVLC-6 (1998) 197-205
  4. Hearst, M.A.: Text-tilling : segmenting text into multi-paragraph subtopic passages. Computational Linguistics (1997) 59-66
  5. Pevzner, L., Hearst, M.: A critique and improvement of anevaluation metric for text segmentation. Computational Linguistics (2002) 113-125
  6. Choi, F.Y.Y.: Advances in domain independent linear text segmentation. Proceedings of NAACL-00 (2000) 26-33
  7. Morris, J., Hirst, G.: Lexical cohesion computed by thesaural relations as an indicator of the structure of text. Computational Linguistics 17 (1991) 20-48
  8. Bestgen, Y., Piérard, S.: Comment évaluer les algorithmes de segmentation automatiques ? essai de construction d'un matriel de référence. Proceedings of TALN'06 (2006)
  9. Choi, F.Y.Y., Wiemer-Hastings, P., Moore, J.: Latent semantic analysis for text segmentation. Proceedings of EMNLP (2001) 109-117
  10. Reynar, J.C.: Topic Segmentation: Algorithms and Applications. Phd thesis, University of Pennsylvania (1998)
  11. Passonneau, R.J., Litman, D.: Lintention-based segmentation: Humanreliability and correlation with linguistic cues. Proceedings of the 31st Annual Meeting of theAssociation for Computational Linguistics, (1993) 148-155
  12. Chauché, J.: Un outil multidimensionnel de l'analyse du discours. Proceedings of Coling'84 1 (1984) 11-15
  13. Roget, P.: Thesaurus of English Words and Phrases. Longman, London (1852)
  14. Larousse: Thésaurus Larousse - des idées aux mots, des mots aux idées. Larousse, Paris (1992)
  15. Chauché, J., Prince, V.: Classifying texts through natural language parsing and semantic filtering. In Proceedings of LTC'03 (2007)
  16. Labadié, A., Chauché: Segmentation thématique par calcul de distance sémantique. Proceedings of DEFT'06 1 (2006) 45-59
  17. Lelu, A., M., C., Aubain, S.: Coopération multiniveau d'approches non-supervises et supervises pour la détection des ruptures thématiques dans les discours présidentiels franc¸ais. In Proceedings of DEFT'06 (2006)
  18. Azé, J., Heitz, T., Mela, A., Mezaour, A., Peinl, P., Roche, M.: Présentation de deft'06 (defi fouille de textes). Proceedings of DEFT'06 1 (2006) 3-12
Download


Paper Citation


in Harvard Style

Labadié A. and Prince V. (2008). Intended Boundaries detection in Topic Change Tracking for Text Segmentation . In Proceedings of the 5th International Workshop on Natural Language Processing and Cognitive Science - Volume 1: NLPCS, (ICEIS 2008) ISBN 978-989-8111-45-6, pages 13-21. DOI: 10.5220/0001728200130021


in Bibtex Style

@conference{nlpcs08,
author={Alexandre Labadié and Violaine Prince},
title={Intended Boundaries detection in Topic Change Tracking for Text Segmentation},
booktitle={Proceedings of the 5th International Workshop on Natural Language Processing and Cognitive Science - Volume 1: NLPCS, (ICEIS 2008)},
year={2008},
pages={13-21},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001728200130021},
isbn={978-989-8111-45-6},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 5th International Workshop on Natural Language Processing and Cognitive Science - Volume 1: NLPCS, (ICEIS 2008)
TI - Intended Boundaries detection in Topic Change Tracking for Text Segmentation
SN - 978-989-8111-45-6
AU - Labadié A.
AU - Prince V.
PY - 2008
SP - 13
EP - 21
DO - 10.5220/0001728200130021