loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Author: Konstantin Clemens

Affiliation: Technische Universität Berlin, Germany

Keyword(s): Geocoding, Postal Address Search, Spelling Variant, Spelling Error, Document Search.

Abstract: The process of resolving names of spatial entities like postal addresses or administrative areas into their whereabouts is called geocoding. It is an error-prone process for multiple reasons: Names of postal address elements like cities, streets, or districts are often reused for historical reasons; structures of postal addresses are only coherent within countries or regions - around the globe addresses are not structured in a canonical way; human users might not adhere even to locally common format for specifying addresses; also, humans often introduce spelling mistakes when referring to a location. In this paper, a log of address searches from human users is used to model user behavior with regards to spelling mistakes. This model is used to generate spelling variants of address tokens which are indexed in addition to the proper spelling. Experiments show that augmenting the index of a geocoder with spelling variants is a valuable approach to handling queries with misspelled toke ns. It enables the system to serve more such queries correctly as compared to a geocoding system supporting edit distances: While this way the recall of such a system is improved, its precision remains on par at the same time. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.144.12.205

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Clemens, K. (2018). Enhanced Address Search with Spelling Variants. In Proceedings of the 4th International Conference on Geographical Information Systems Theory, Applications and Management - GISTAM; ISBN 978-989-758-294-3; ISSN 2184-500X, SciTePress, pages 28-35. DOI: 10.5220/0006646100280035

@conference{gistam18,
author={Konstantin Clemens.},
title={Enhanced Address Search with Spelling Variants},
booktitle={Proceedings of the 4th International Conference on Geographical Information Systems Theory, Applications and Management - GISTAM},
year={2018},
pages={28-35},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006646100280035},
isbn={978-989-758-294-3},
issn={2184-500X},
}

TY - CONF

JO - Proceedings of the 4th International Conference on Geographical Information Systems Theory, Applications and Management - GISTAM
TI - Enhanced Address Search with Spelling Variants
SN - 978-989-758-294-3
IS - 2184-500X
AU - Clemens, K.
PY - 2018
SP - 28
EP - 35
DO - 10.5220/0006646100280035
PB - SciTePress