AGGREGATION OF IMPLICIT FEEDBACKS FROM SEARCH ENGINE LOG FILES

Ashok Veilumuthu, Parthasarathy Ramachandran

2010

Abstract

The current approaches to information retrieval from the search engine depends heavily on the web linkage structure which is a form of relevance judgment by the page authors. However, to overcome spamming attempts and language semantics, it is important to also incorporate the user feedback on the documents’ relevance for a particular query. Since users can be hardly motivated to give explicit/direct feedback on search quality, it becomes necessary to consider implicit feedback that can be collected from search engine logs. Though there are number of implicit feedback measures proposed to improve the search quality, there is no standard methodology proposed yet to aggregate those implicit feedbacks meaningfully to get a final ranking of he documents. In this article, we propose an extension to the distance based ranking model to aggregate different implicit feedbacks based on their expertise in ranking the documents. The proposed approach has been tested on two implicit feedbacks, namely click sequence and time spent in reading a document from the actual log data of AlltheWeb.com. The results were found to be convincing and indicative of the possibility of expertise based fusion of implicit feedbacks to arrive at a single ranking of documents for the given query.

References

  1. Agichtein, E., Brill, E., and Dumais, S. (2006). Improving web search ranking by incorporating user behavior information. In Procs. SIGIR 7806, pages 19-26, New York, NY, USA. ACM.
  2. Brin, S. and Page, L. (1998). The anatomy of a large-scale hypertextual Web search engine. Computer Networks and ISDN Systems, 30(1-7):107-117.
  3. Dwork, C., Kumar, R., Naor, M., and Sivakumar, D. (2001). Rank aggregation methods for the web. In Procs WWW 7801, pages 613-622, New York, NY, USA. ACM.
  4. Fligner, M. A. and Verducci, J. S. (1986). Distance based ranking models. Journal of the Royal Statistical Society. Series B (Methodological), 48(3):359-369.
  5. Fligner, M. A. and Verducci, J. S. (1988). Multistage ranking models. Journal of the American Statistical Association, 83(403):892-901.
  6. Jansen, B. J. and Spink, A. (2005). An analysis of web searching by european alltheweb.com users. Inf. Process. Manage., 41(2):361-381.
  7. Kelly, D. and Belkin, N. J. (2004). Display time as implicit feedback: understanding task effects. In ACM SIGIR, pages 377-384.
  8. Kim, J., Oard, D., and Romanik, K. (2000). Using implicit feedback for user modeling in internet and intranet searching. Technical report, College of Library and Information Services, University of Maryland at College Park.
  9. Kleinberg, J. M. (1999). Authoritative sources in a hyperlinked environment. J. ACM, 46(5):604-632.
  10. Klementiev, A., Roth, D., and Small, K. (2008). Unsupervised rank aggregation with distance-based models. In Procs. ICML 7808, pages 472-479, New York, NY, USA. ACM.
  11. Lebanon, G. and Lafferty, J. D. (2002). Cranking: Combining rankings using conditional probability models on permutations. In Procs. ICML 7802, pages 363-370, San Francisco, CA, USA. Morgan Kaufmann Publishers Inc.
  12. Mallows, C. L. (1957). Non-null ranking models. i. Biometrika, 44(1/2):114-130.
  13. Ramachandran, P. (2005). Discovering user preferences by using time entries in click-through data to improve search engine results. In Discovery Science, pages 383-385.
  14. Veilumuthu, A. and Ramachandran, P. (2007). Discovering implicit feedbacks from search engine log files. In Discovery Science, pages 231-242.
  15. White, R., Ruthven, I., and Jose, J. M. (2002). The use of implicit evidence for relevance feedback in web retrieval. In Proceedings of the 24th BCS-IRSG European Colloquium on IR Research, pages 93-109. Springer-Verlag.
Download


Paper Citation


in Harvard Style

Veilumuthu A. and Ramachandran P. (2010). AGGREGATION OF IMPLICIT FEEDBACKS FROM SEARCH ENGINE LOG FILES . In Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2010) ISBN 978-989-8425-28-7, pages 269-274. DOI: 10.5220/0003096502690274


in Bibtex Style

@conference{kdir10,
author={Ashok Veilumuthu and Parthasarathy Ramachandran},
title={AGGREGATION OF IMPLICIT FEEDBACKS FROM SEARCH ENGINE LOG FILES},
booktitle={Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2010)},
year={2010},
pages={269-274},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003096502690274},
isbn={978-989-8425-28-7},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2010)
TI - AGGREGATION OF IMPLICIT FEEDBACKS FROM SEARCH ENGINE LOG FILES
SN - 978-989-8425-28-7
AU - Veilumuthu A.
AU - Ramachandran P.
PY - 2010
SP - 269
EP - 274
DO - 10.5220/0003096502690274