Conceptual Vectors - A Complementary Tool to Lexical Networks

Didier Schwab, Lim Lian Tze, Mathieu Lafourcade

2007

Abstract

There is currently much research in natural language processing focusing on lexical networks. Most of them, in particular the most famous, WordNet, lack syntagmatic information and especially thematic information (”Tennis Problem”). This article describes conceptual vectors that allows the representation of ideas in any textual segment and offers a continuous vision of related thematic, based on the distances between these thematic. We show the characteristics of conceptual vectors and explain how they complement lexical-semantic networks. We illustrate this purpose by adding conceptual vectors to WordNet by emergence.

References

  1. Quillian, R.: Semantic memory. In: Semantic Informatic processing. MIT Press (1968) 227-270
  2. Mihalcea, R., Tarau, P., Figa, E.: Pagerank on semantic networks, with application toword sense disambiguation. In: COLING'2004 : 20th International Conference on Computational Linguistics, Geneva, Switzerland (2004) 1126-1132
  3. Mangeot-Lerebours, M., Sérasset, G., Lafourcade, M.: Construction collaborative d'une base lexicale multilingue : Le projet papillon. TAL (Traitement Automatique des langues) : Les dictionnaires électroniques 44 (2003) 151-176
  4. Knight, K., Luk, S.: Building a large-scale knowledge base for machine translation. In: AAAI'1994 : National Conference on Artificial Intelligence, Stanford University,Palo Alto, California (1994)
  5. Harabagiu, S., Chai, J., eds.: Usage of WordNet in Natural Language Processing Systems, Université de Montréal, Montréal, Canada (1998)
  6. Fellbaum, C., ed.: WordNet: An Electronic Lexical Database. The MIT Press (1988)
  7. Harabagiu, S.M., Miller, G.A., Moldovan, D.I.: Wordnet 2 - a morphologically and semantically enhanced resource. In: Workshop SIGLEX'99 : Standardizing Lexical Resources. (1999) 1-8
  8. Agirre, E., Ansa, O., Martinez, D., Hovy, E.: Enriching wordnet concepts with topic signatures. In: NAACL worshop on WordNet and Other Lexical Resources: Applications, Extensions and Customizations, Pittsburg, USA (2001)
  9. Stevenson, M.: Augmenting noun taxonomies by combining lexical similarity metrics. In: COLING'2002 : 19th International Conference on Computational Linguistics. Volume 2/2., Taipei, Taiwan (2002) 953-959
  10. Ferret, O., Zock, M.: Enhancing electronic dictionaries with an index based on associations. In: Proceedings of the 21st International Conference on Computational Linguistics, Sydney, Australia, Association for Computational Linguistics (2006) 281-288
  11. Salton, G., McGill, M.: Introduction to Modern Information Retrieval. McGrawHill, New York (1983)
  12. Deerwester, S.C., Dumais, S.T., Landauer, T.K., Furnas, G.W., Harshman, R.A.: Indexing by latent semantic analysis. Journal of the American Society of Information Science 41 (1990) 391-407
  13. Chauché, J.: Détermination sémantique en analyse structurelle : une expérience basée sur une définition de distance. TAL Information 31/1 (1990) 17-24
  14. Larousse, ed.: Thésaurus Larousse - des idées aux mots, des mots aux idées. Larousse (1992)
  15. Kirkpatrick, B., ed.: Roget's Thesaurus of English Words and Phrases. Penguin books, London (1987)
  16. Zock, M.: Sorry, what was your name again, or how to overcome the tip-of-the tongue with the help of a computer? In: SemaNet'02: Building and Using Semantic Networks, Taipei, Taiwan (2002)
  17. Lafourcade, M.: Conceptual vector learning - comparing bootstrapping from a thesaurus or induction by emergence. In: LREC'2006, Genoa, Italia (2006)
  18. Besanc¸on, R.: Intégration de connaissances syntaxiques et sémantiques dans les représentations vectorielles de texte (2001)
  19. Rastier, F.: L'isotopie sémantique, du mot au texte (1985)
  20. Lafourcade, M., Guinand, F.: Ants for natural language processing. International Journal of Computational Intelligence Research (2006) Ì paraˆitre.
  21. Mihalcea, R., Moldovan, D.: extended wordnet: progress report. In: NAACL 2001 - Workshop on WordNet and Other Lexical Resources, Pittsburgh, USA (2001)
  22. Mel'c?uk, I., Clas, A., Polguère, A.: Introduction à la lexicologie explicative et combinatoire. Duculot (1995)
  23. Schwab, D.: Approche hybride - lexicale et thématique - pour la modélisation, la détection et l'exploitation des fonctions lexicales en vue de l'analyse sémantique de texte. (2005)
Download


Paper Citation


in Harvard Style

Schwab D., Lian Tze L. and Lafourcade M. (2007). Conceptual Vectors - A Complementary Tool to Lexical Networks . In Proceedings of the 4th International Workshop on Natural Language Processing and Cognitive Science - Volume 1: NLPCS, (ICEIS 2007) ISBN 978-972-8865-97-9, pages 139-148. DOI: 10.5220/0002434801390148


in Bibtex Style

@conference{nlpcs07,
author={Didier Schwab and Lim Lian Tze and Mathieu Lafourcade},
title={Conceptual Vectors - A Complementary Tool to Lexical Networks},
booktitle={Proceedings of the 4th International Workshop on Natural Language Processing and Cognitive Science - Volume 1: NLPCS, (ICEIS 2007)},
year={2007},
pages={139-148},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002434801390148},
isbn={978-972-8865-97-9},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 4th International Workshop on Natural Language Processing and Cognitive Science - Volume 1: NLPCS, (ICEIS 2007)
TI - Conceptual Vectors - A Complementary Tool to Lexical Networks
SN - 978-972-8865-97-9
AU - Schwab D.
AU - Lian Tze L.
AU - Lafourcade M.
PY - 2007
SP - 139
EP - 148
DO - 10.5220/0002434801390148