ORGANOGRAPHS - Multi-faceted Hierarchical Categorization of Web Documents

Rodrigo Dias Arruda Senra, Claudia Bauzer Medeiros


The data deluge of information in the Web challenges internauts to organize their references to interesting content in theWeb as well as in their private storage space off-line. Having an automatically managed personal index to content acquired from theWeb is useful for everybody, but critical to researchers and scholars. In this paper, we discuss concepts and problems related to organizing information through multi-faceted hierarchical categorization. We introduce the organograph as a mechanism to specify multiple views of how content is organized. Organographs can help scientists to automatically organize their documents along multiple axes, improving sharing and navigation through themes and concepts according to a particular research objective.


  1. Bloehdorn, S., Cimiano, P., and Hotho, A. (2005). Learning ontologies to improve text clustering and classification. In From Data and Information Analysis to Knowledge Engineering: Proceedings of the 29th Annual Conference of the German Classification Society.
  2. Bonifacio, M., Bouquet, P., and Manzardo, A. (2000). A distributed intelligence paradigm for knowledge management. In AAAI Spring Symposium Series 2000 on Bringing Knowledge to Business Processes.
  3. Chen, L. and Roberts, C. (2007). Semantic tagging for large-scale content management. In WI 7807: Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence.
  4. Dakka, W. and Ipeirotis, P. G. (2008). Automatic extraction of useful facet hierarchies from text databases. In ICDE, pages 466-475.
  5. Dakka, W., Ipeirotis, P. G., and Wood, K. R. (2007). Faceted browsing over large databases of text-annotated objects. In ICDE, pages 1489-1490.
  6. Du, Y. and Chen, L. (2007). Using personalized knowledge portal for information and knowledge integration and sharing. In SKG 7807: Proceedings of the Third International Conference on Semantics, Knowledge and Grid.
  7. Giannakidou, E., Kompatsiaris, I., and Vakali, A. (2008). Semsoc: Semantic, social and content-based clustering in multimedia collaborative tagging systems. In ICSC 7808: Proceedings of the 2008 IEEE International Conference on Semantic Computing.
  8. Gordon, A. (1996). Hierarchical classification. Clustering and classification.
  9. Jackson, P. and Moulinier, I. (2002). Natural language processing for online applications: text retrieval, extraction, and categorization. John Benjamins Publishing Company.
  10. Lacher, M. and Groh, G. (2001). Facilitating the exchange of explicit knowledge through ontology mappings. In Proceedings of the Fourteenth International Florida Artificial Intelligence Research Society Conference.
  11. Senra, R. D. A. and Medeiros, C. B. (2009). SciFrame: a conceptual framework to describe data sharing in eScience. SBBD. III e-Science Workshop.
  12. Uschold, M. and Gruninger, M. (1996). Ontologies: Principles, methods and applications. The Knowledge Engineering Review, 11(02).
  13. Weigend, A. S., Wiener, E. D., and Pedersen, J. O. (1999). Exploiting hierarchy in text categorization. Inf. Retr., 1(3).

Paper Citation

in Harvard Style

Dias Arruda Senra R. and Bauzer Medeiros C. (2011). ORGANOGRAPHS - Multi-faceted Hierarchical Categorization of Web Documents . In Proceedings of the 7th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST, ISBN 978-989-8425-51-5, pages 583-588. DOI: 10.5220/0003319205830588

in Bibtex Style

author={Rodrigo Dias Arruda Senra and Claudia Bauzer Medeiros},
title={ORGANOGRAPHS - Multi-faceted Hierarchical Categorization of Web Documents},
booktitle={Proceedings of the 7th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,},

in EndNote Style

JO - Proceedings of the 7th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,
TI - ORGANOGRAPHS - Multi-faceted Hierarchical Categorization of Web Documents
SN - 978-989-8425-51-5
AU - Dias Arruda Senra R.
AU - Bauzer Medeiros C.
PY - 2011
SP - 583
EP - 588
DO - 10.5220/0003319205830588