Concept Profiles for Filtering Parliamentary Documents

Francisco J. Ribadas, Luis M. de Campos, Juan M. Fernández-Luna, Juan F. Huete

2015

Abstract

Content-based recommender/filtering systems help to appropriately distribute information among the individuals or organizations that could consider it of interest. In this paper we describe a filtering system to deal with the problem of assigning documents to members of the parliament potentially interested on them. The proposed approach exploits subjects taken from a conceptual thesaurus to create the user profiles and to describe the documents to be filtered. The assignment of subjects to documents is modeled as a multilabel classification problem. Experiments with a real parliamentary corpus are reported, evaluating several methods to assign conceptual subjects to documents and to match those sets of subjects with user profiles.

References

  1. Belkin, N.J., and Croft, W.B. (1992). Information Filtering and Information Retrieval: Two Sides of the Same Coin? Communications of the ACM, 35:29-38.
  2. de Campos, L.M., Fernández-Luna, J.M., Huete, J.F., Martin-Dancausa, C.J., Tur-Vigil, C., Tagua, A. (2009). An Integrated System for Managing the Andalusian Parliament's Digital Library. Program: Electronic Library and Information Systems, 43:121-139.
  3. Chang, C.-C and Lin, C.-J (2011). LIBSVM: A Library for Support Vector Machines. ACM Transactions on Intelligent Systems and Technology, 2:27:1-27:27.
  4. Gauch, S., Speretta, M., Chandramouli, A., and Micarelli, A. (2007). User Profiles for Personalized Information Access. In: The Adaptative Web. LCNS, vol. 4321, pages 54-89.
  5. Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., and Witten, I.H. (2009). The WEKA Data Mining Software: An Update. SIGKDD Explorations, 11(1):10-18.
  6. Hanani, U., Shapira, B., and Shoval, P. (2001). Information Filtering: Overview of Issues, Research and Systems. User Modeling and User-Adapted Interaction, 11:203-259.
  7. Lantz, B. (2013). Machine Learning with R. Packt Publishing Ltd.
  8. Lin., D. (1998). An Information-Theoretic Definition of Similarity. Proceedings of the Fifteenth International Conference on Machine Learning (ICML 1998), pages 296-304.
  9. Lops, P., de Gemmis, M., and Semerano, G. (2011). Content-based Recommender Systems: State of the Art and Trends. In: Recommender Systems Handbook, pages 73-105, Springer.
  10. Pazzani, M., and Billsus, D. (2007). Content-based Recommendation Systems. In: The Adaptive Web. LCNS, vol. 4321, pages 325-341.
  11. Read, J., Pfahringer, B., Holmes, G., and Frank, E. (2011). Classifier chains for multi-label classification. Machine Learning, 85(3):333-359.
  12. Silla Jr., C.N., and Freitas, A.A. (2011) A Survey of Hierarchical Classification across different Application Domains. Data Mining and Knowledge Discovery, 22(1- 2):31-72.
  13. Tsoumakas, G., Katakis, I., Vlahavas, I. (2010). Mining Multi-label Data. In Data Mining and Knowledge Discovery Handbook, pages 667-685, O. Maimon, L. Rokach (Eds.), Springer.
  14. Yeh, A. (2000). More accurate tests for the statistical significance of result differences. In Proceedings of the 18th International Conference on Computational Linguistics (COLING), pages 947-953.
Download


Paper Citation


in Harvard Style

Ribadas F., de Campos L., Fernández-Luna J. and Huete J. (2015). Concept Profiles for Filtering Parliamentary Documents . In Proceedings of the 7th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KDIR, (IC3K 2015) ISBN 978-989-758-158-8, pages 409-416. DOI: 10.5220/0005616104090416


in Bibtex Style

@conference{kdir15,
author={Francisco J. Ribadas and Luis M. de Campos and Juan M. Fernández-Luna and Juan F. Huete},
title={Concept Profiles for Filtering Parliamentary Documents},
booktitle={Proceedings of the 7th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KDIR, (IC3K 2015)},
year={2015},
pages={409-416},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005616104090416},
isbn={978-989-758-158-8},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 7th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KDIR, (IC3K 2015)
TI - Concept Profiles for Filtering Parliamentary Documents
SN - 978-989-758-158-8
AU - Ribadas F.
AU - de Campos L.
AU - Fernández-Luna J.
AU - Huete J.
PY - 2015
SP - 409
EP - 416
DO - 10.5220/0005616104090416