loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Author: Silviu Cucerzan

Affiliation: Microsoft Research, United States

Keyword(s): Web search, Queries, Capitalization, Truecasing, Ranking.

Related Ontology Subjects/Areas/Topics: Artificial Intelligence ; Knowledge Discovery and Information Retrieval ; Knowledge-Based Systems ; Pre-Processing and Post-Processing for Data Mining ; Symbolic Systems

Abstract: We investigate the capitalization features of queries submitted to Web search engines and the relation between capitalization information, either as received from users or as hypothesized based on Web statistics, and search relevance. We observe that users tend to lowercase words in their queries significantly more often than as predicted from Web data. More importantly, we determine that document relevance is strongly correlated with the matching in capitalization between the instances of query tokens in the target document and the tokens of the truecased form of the query as obtained by using Web n-gram data.

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 54.147.123.159

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Cucerzan, S. (2010). DOES CAPITALIZATION MATTER IN WEB SEARCH?. In Proceedings of the International Conference on Knowledge Discovery and Information Retrieval (IC3K 2010) - KDIR; ISBN 978-989-8425-28-7; ISSN 2184-3228, SciTePress, pages 302-306. DOI: 10.5220/0003102503020306

@conference{kdir10,
author={Silviu Cucerzan.},
title={DOES CAPITALIZATION MATTER IN WEB SEARCH?},
booktitle={Proceedings of the International Conference on Knowledge Discovery and Information Retrieval (IC3K 2010) - KDIR},
year={2010},
pages={302-306},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003102503020306},
isbn={978-989-8425-28-7},
issn={2184-3228},
}

TY - CONF

JO - Proceedings of the International Conference on Knowledge Discovery and Information Retrieval (IC3K 2010) - KDIR
TI - DOES CAPITALIZATION MATTER IN WEB SEARCH?
SN - 978-989-8425-28-7
IS - 2184-3228
AU - Cucerzan, S.
PY - 2010
SP - 302
EP - 306
DO - 10.5220/0003102503020306
PB - SciTePress