TOWARDS AUTOMATIC CONTENT TAGGING - Enhanced Web Services in Digital Libraries using Lexical Chaining

Alexander Mehler, Ulli Waltinger, Gerhard Heyer

2008

Abstract

This paper proposes a web-based application which combines social tagging, enhanced visual representation of a document and the alignment to an open-ended social ontology. More precisely we introduce on the one hand an approach for automatic extraction of document related keywords for indexing and representing document content as an alternative to social tagging. On the other hand a proposal for automatic classification within a social ontology based on the German Wikipedia category taxonomy is proposed. This paper has two main goals: to describe the method of automatic tagging of digital documents and to provide an overview of the algorithmic patterns of lexical chaining that can be applied for topic tracking and –labelling of digital documents.

References

  1. Allan J., 2002. Topic Detection and Tracking. Event-based Information Organization. Kluwer, Boston/Dordrecht/London.
  2. Barr M., Wells C., 1990. Category Theory for Computing Science. Prentice Hall, New York/London/ Toronto.
  3. Barzilay R., Elhadad M., 1997. Using lexical chains for text summarization. In Proceedings of the Intel-ligent Scalable Text Summarization Workshop (ISTS'97), ACL, Madrid, Spain.
  4. Braun S., Schmidt A., Zacharias V., 2007. SO-BOLEO: vom kollaborativen Tagging zur leichtge-wichtigen Ontologie. In Mensch&Computer 2007
  5. Budanitsky A., Hirst G., 2006. Evaluating Word-Netbased measures of semantic distance. Computational Linguistics, 32(1):13-47.
  6. Fellbaum C., editor., 1998. WordNet: An Elec-tronic Lexical Database. MIT Press, Cambridge.
  7. Gleim R., Mehler A., Dehmer M., Pustylnikov O., 2007. Aisles through the category forest. In Pro-ceedings Webist 2007.
  8. Golder S., Huberman B. (2006). Usage patterns of collaborative tagging systems. In Journal of Information Science, pages: 198-208.
  9. Heyer, G., Bordag, S., Quasthoff, U., 2003. Small worlds of concepts and other principles of semantic search, In Innovative Internet Community Systems, Proceedings of the Third International Workshop IICS 2003, June 2003 Leipzig, Lecture Notes in Computer Science, Springer Verlag: Berlin, Heidelberg, New York
  10. Hirst G., St-Onge D., 1997. Lexical Chains as representation of context for the detection and correc-tion malapropisms. In C. Fellbaum, editor, Word-Net: An electronic lexical database and some of its applications. Cambrige, MA: The MIT Press.
  11. Idea N., Pries-Dorman G., 1998. Corpus Encoding Standard. NewYork. URL:http://www.cs.vassar.edu/CES/
  12. Leuf, B., Cunningham W., 2001. The Wiki way: quick collaborationon the Web. In Addison-Wesley.
  13. Lezius, W., 2000. Morphy - German Morphology, Part-ofSpeech Tagging and Applications. In Ulrich Heid; Stefan Evert; Egbert Lehmann and Christian Rohrer, editors, Proceedings of the 9th EURALEX International Congress pp. 619-623 Stuttgart, Germany
  14. Lossau N. (2004). Search Engine Technology and Digital Libraries, Libraries Need to Discover the Academic Internet. In: D-Lib Magazine, Bd. 10, Nr. 6, ISSN 1082-9873
  15. Mayr, W. (2005). Google Scholar - wie tief gräbt diese Suchmaschine? Bonn. URL:http://www.ib.huberlin.de/mayr/arbeiten/Mayr_Walter05-preprint.pdf.
  16. Mika P., 2005. Ontologies are us: A unified model of social networks and semantics. In: Proceedings of the Fourth International Semantic Web Conference(ISWC2005), Lecture Notes in Computer Science no. 3729, page 122-136, Galway, Ireland
  17. Morris J., Hirst G., 1991. Lexical cohesion com-puted by thesaural relations as an indicator of the structure of text. Computational Linquistics.
  18. O'Reilly, T., 2005: What Is Web 2.0. O'Reilly Media.
  19. URL:http://www.oreilly.com/pub/a/oreilly/tim/news/2005/ 09/30/ what-is-web-20.html
  20. Power, R., Scott, D., Bouayad-Agha N., 2003. Document structure. In: Computational Linguistics, 29(2), 211-260
  21. Silber H.G., McCoy K.F., 2002. Efficiently com-puted lexical chains as an intermediate representa-tion for automatic text summarization. Computa-tional Linquistics.
  22. Voss J., 2006. Collaborative thesaurus tagging the Wikipedia way. URL: http:// www.citebase.org/abstract?id=oai:arXiv.org:cs/0604036.
Download


Paper Citation


in Harvard Style

Mehler A., Waltinger U. and Heyer G. (2008). TOWARDS AUTOMATIC CONTENT TAGGING - Enhanced Web Services in Digital Libraries using Lexical Chaining . In Proceedings of the Fourth International Conference on Web Information Systems and Technologies - Volume 2: WEBIST, ISBN 978-989-8111-27-2, pages 231-236. DOI: 10.5220/0001527502310236


in Bibtex Style

@conference{webist08,
author={Alexander Mehler and Ulli Waltinger and Gerhard Heyer},
title={TOWARDS AUTOMATIC CONTENT TAGGING - Enhanced Web Services in Digital Libraries using Lexical Chaining},
booktitle={Proceedings of the Fourth International Conference on Web Information Systems and Technologies - Volume 2: WEBIST,},
year={2008},
pages={231-236},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001527502310236},
isbn={978-989-8111-27-2},
}


in EndNote Style

TY - CONF
JO - Proceedings of the Fourth International Conference on Web Information Systems and Technologies - Volume 2: WEBIST,
TI - TOWARDS AUTOMATIC CONTENT TAGGING - Enhanced Web Services in Digital Libraries using Lexical Chaining
SN - 978-989-8111-27-2
AU - Mehler A.
AU - Waltinger U.
AU - Heyer G.
PY - 2008
SP - 231
EP - 236
DO - 10.5220/0001527502310236