A CONTENT-BASED APPROACH TO RELEVANCE FEEDBACK IN XML-IR FOR CONTENT AND STRUCTURE QUERIES

Luis M. de Campos, Juan M. Fernández-Luna, Juan F. Huete, Carlos Martín-Dancausa

2010

Abstract

The use of structured documents following XML representation allows us to create content and structure (CAS) queries which are more specific for the user’s needs. In this paper we are going to study how to enrich this kind of queries with the user feedback in order to get results closer to their needs. More formally, we are considering how to perform Relevance Feedback (RF) for CAS queries in XML Information Retrieval. Our approach maintains the same structural constraints but expands the content of the queries by adding new keywords to the original CAS query. These new terms are selected by considering their presence/absence in the judged units. This RF method is integrated in a XML-based search engine and evaluated with the INEX 2006 and INEX 2007 collections.

References

  1. Chang, Y., Cirillo, C., and Razon, J. (1971). Evaluation of feedback retrieval using modified freezing, residual collection & test and control groups. The SMART Retrieval System: Experiments in Automatic Document Processing, chapter 17, pages 355-370.
  2. Crouch, C. J., Mahajan, A., and Bellamkonda, A. (2005). Flexible XML retrieval based on the extended vector model. In N. Fuhr, M. Lalmas, S. M. and Szlávik, Z., editors, INEX 2004. Lecture Notes in Computer Science, volume 3493, pages 292-302. Springer, Heidelberg.
  3. de Campos, L. M., Fernández-Luna, J. M., Huete, J., and Romero, A. (2006). Garnata: An information retrieval system for structured documents based on probabilistic graphical models. In Proceedings of the IMPU'06 conference, pages 1024-1031.
  4. de Campos, L. M., Fernández-Luna, J. M., Huete, J. F., and Martín-Dancausa, C. (2009). Content-oriented relevance feedback in XML-ir using the Garnata Information Retrieval system. FQAS 2009, Lecture Notes in Artificial Intelligence, 5822:617-628.
  5. Denoyer, L. and Gallinari, P. (2006). The wikipedia XML corpus. SIGIR Forum, 40:64-69.
  6. Fourati, I., Tmar, M., and Hamadou, A. (2009). Structural relevance feedback in XML retrieval. FQAS 2009. Lecture Notes in Artificial Intelligence, 5822:168- 178.
  7. Hlaoua, L., Sauvagnat, K., and Boughanem, M. (2006). Structure-oriented relevance feedback in XML retrieval. In Proceedings of the 15th ACM international conference on Information and knowledge management, pages 780-781. ACM.
  8. Kamps, J., Pehcevski, J., Kazai, G., Lalmas, M., and Robertson, S. (2008). INEX 2007 evaluation measures. INEX 2007. Lecture Notes in Computer Science, 4862:24-33.
  9. Lalmas, M. (2009). XML Retrieval. Morgan&Claypool.
  10. Mass, Y. and Mandelbrod, M. (2004). Relevance feedback for XML retrieval. In INEX 2004 Workshop, pages 154-157.
  11. Mihajlovic, V., Ramirez, G., de Vries, A. P., Hiemstra, D., and Blok, H. E. (2004). Tijah at inex 2004 modeling phrases and relevance feedback. pages 276-291. INEX 2004 Workshop Proceedings.
  12. Pan, H., Theobald, A., and Schenkel, R. (2004). Query refinement by relevance feedback in an XML retrieval system. pages 854-855. ER 2004.
  13. Rocchio, J. J. (1971). Relevance feedback in information retrieval. The SMART retrieval system-experiments in automatic document processing, pages 313-323.
  14. Ruthven, I. and Lalmas, M. (2003). A survey on the use of relevance feedback for information access systems. Knowledge Engineering Review, 18(2):95-145.
  15. Schenkel, R. and Theobald, M. (2006). Structural feedback for keyword-based XML retrieval. In Lalmas, M., editor, In Advances in Information Retrieval. 28th European Conference on IR Research (ECIR 2006) Lecture Notes in Computer Science, volume 3936, pages 326- 337. Springer.
  16. Sigurbjörnsson, B., Kamps, J., and de Rijke, M. (2004). The university of amsterdam at inex 2004. pages 104-109. INEX 2004 Workshop.
  17. Trotman, A., Sigurbjörnsson, B., Fuhr, N., Lalmas, M., Malik, S., and Szlvik, Z. (2005). Narrowed extended XPath i (NEXI). In Lecture Notes in Computer Science, volume 3493, page 1640. Springer Verlag, Heidelberg.
Download


Paper Citation


in Harvard Style

de Campos L., Fernández-Luna J., Huete J. and Martín-Dancausa C. (2010). A CONTENT-BASED APPROACH TO RELEVANCE FEEDBACK IN XML-IR FOR CONTENT AND STRUCTURE QUERIES . In Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2010) ISBN 978-989-8425-28-7, pages 418-427. DOI: 10.5220/0003080104180427


in Bibtex Style

@conference{kdir10,
author={Luis M. de Campos and Juan M. Fernández-Luna and Juan F. Huete and Carlos Martín-Dancausa},
title={A CONTENT-BASED APPROACH TO RELEVANCE FEEDBACK IN XML-IR FOR CONTENT AND STRUCTURE QUERIES},
booktitle={Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2010)},
year={2010},
pages={418-427},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003080104180427},
isbn={978-989-8425-28-7},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2010)
TI - A CONTENT-BASED APPROACH TO RELEVANCE FEEDBACK IN XML-IR FOR CONTENT AND STRUCTURE QUERIES
SN - 978-989-8425-28-7
AU - de Campos L.
AU - Fernández-Luna J.
AU - Huete J.
AU - Martín-Dancausa C.
PY - 2010
SP - 418
EP - 427
DO - 10.5220/0003080104180427