AN ALGORITHM TO USE FEEDBACK ON VIEWED DOCUMENTS TO IMPROVE WEB QUERY - Enabling Naïve Searchers to Search the Web Smartly

Sunanda Patro, Vishv Malhotra, David Johnson

2006

Abstract

This paper presents an algorithm to improve a web search query based on the feedback on the viewed documents. A user who is searching for information on the Web marks the retrieved (viewed) documents as relevant or irrelevant to further expose the information needs expressed in the original query. A new web search query matching this improved understanding of the user’s information needs is synthesized from these text documents. The methodology provides a way for creating web search query that matches the user’s information need even when the user may have difficulty in doing so directly due to lack of experience in the query design or lack of familiarity of the search domain. A user survey has shown that the algorithmically formed query has recall coverage and precision characteristics better than those achieved by the experienced human web searchers.

References

  1. Aho, A. V. and Ullman, J. D., 1992. Foundations of Computer Science. Computer Science Press. NY.
  2. Aula, A., Jhaveri, N. and Kaki, M., 2005. Information Seach and Re-Access Strategies of Experienced Web Users. In the intl World Wide Web (WWW2005) conf. ACM, NY.
  3. Baeza-Yates, R., and Riberio-Neto, B., 1991. Modern Information Retrieval, Addison-Wesley. Reading, Ma,
  4. Cohen, W. W., 1995. Fast Effective Rule Induction. In 12th Intl. Conf. on Machine Learning.
  5. Cohen, W. W. and Singer, Y., 1996. Learning to Query the Web. In AAAI-96 Workshop on Internet-Based Information Systems. AAAI Press, Menlo Park, CA.
  6. Hölscher, C. and Strube, G., 2000. Web Search Behavior of Internet Experts and Newbies. In Proc. of the 9th intl. World Wide Web conf. on Computer networks, : the intl. journal of computer and telecommunications networking. North-Holland Publ.
  7. Jansen, B. J., Spink, A., Bateman, J. and Saracevic, T., 1998. Real Life Information Retrieval: A Study of User Queries on the Web. SIGIR Forum, 32 (1), 5-17.
  8. Kopec, D. and Marsland, T. A., 1997. Search. The CRC Press, Inc.
  9. Malhotra, V., Patro, S. and Johnson, D., 2005. Synthesise Web Queries: Search the Web by Examples. In 7th Intl Conf. on Enterprise Information Systems (ICEIS2005), Volume 2. INSTICC, Portugal.
  10. Oyama, S., Kokubo, T. and Ishida, T., 2004. DomainSpecific Web Search with Keyword Spices. IEEE Transaction on Knowledge and Data Engineering, 16 (1), 17-27.
  11. Patro, S. and Malhotra, V., 2005. Characteristics of the Boolean Web Search Query: Estimating Success from Characteristics. In 1st intl conf. on web info. systems and technologies (WEBIST2005). INSTICC, Portugal.
  12. Patro, S., 2006. Synthesising Web Search Queries from Example Text Documents. Master of Science Thesis, School of Computing, University of Tasmania, Launceston. Website: eprints.comp.utas.edu.au.
  13. Ruthven, I. and Lalmas, M., 2003. A Survey on the Use of Relevance Feedback for Information Systems. Knowledge engineering Review, 18 (2), 95-145.
  14. Sanchez, S. N., Triantaphyllou, E., Chen, J. and Liao, T. W., 2002. An Incremental Learning Algorithm for Constructing Boolean Functions from Positive and Negative Examples. Computers and Operations Research, 29 (12), 1677-700.
  15. Sebastiani, F., 2002. Machine Learning in Automated Text Categorization. ACM Comp. Surveys, 34 (1), 1-47.
Download


Paper Citation


in Harvard Style

Patro S., Malhotra V. and Johnson D. (2006). AN ALGORITHM TO USE FEEDBACK ON VIEWED DOCUMENTS TO IMPROVE WEB QUERY - Enabling Naïve Searchers to Search the Web Smartly . In Proceedings of WEBIST 2006 - Second International Conference on Web Information Systems and Technologies - Volume 1: WEBIST, ISBN 978-972-8865-46-7, pages 287-294. DOI: 10.5220/0001238502870294


in Bibtex Style

@conference{webist06,
author={Sunanda Patro and Vishv Malhotra and David Johnson},
title={AN ALGORITHM TO USE FEEDBACK ON VIEWED DOCUMENTS TO IMPROVE WEB QUERY - Enabling Naïve Searchers to Search the Web Smartly},
booktitle={Proceedings of WEBIST 2006 - Second International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,},
year={2006},
pages={287-294},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001238502870294},
isbn={978-972-8865-46-7},
}


in EndNote Style

TY - CONF
JO - Proceedings of WEBIST 2006 - Second International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,
TI - AN ALGORITHM TO USE FEEDBACK ON VIEWED DOCUMENTS TO IMPROVE WEB QUERY - Enabling Naïve Searchers to Search the Web Smartly
SN - 978-972-8865-46-7
AU - Patro S.
AU - Malhotra V.
AU - Johnson D.
PY - 2006
SP - 287
EP - 294
DO - 10.5220/0001238502870294