loading
Papers

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Keerthi Koneru ; Venkata Sai Venkatesh Pulla and Cihan Varol

Affiliation: Sam Houston State University, United States

ISBN: 978-989-758-193-9

Keyword(s): Caverphone, Dmetaphone, Information Retrieval, Misspelled Words, Metaphone, NYSIIS, Phonetic Matching, Soundex.

Related Ontology Subjects/Areas/Topics: Data Engineering ; Data Management and Quality ; Information Quality

Abstract: Researchers confront major problems while searching for various kinds of data in the large imprecise database, as they are not spelled correctly or in the way they were expected to be spelled. As a result, they cannot find the word they sought. Over the years of struggle, pronunciation of words was considered to be one of the practices to solve the problem effectively. The technique used to acquire words based on sounds is known as “Phonetic Matching”. Soundex is the first algorithm proposed and other algorithms like Metaphone, Caverphone, DMetaphone, Phonex etc., are also used for information retrieval in different environments. This paper deals with the analysis and evaluation of different phonetic matching algorithms on several datasets comprising of street names of North Carolina and English dictionary words. The analysis clearly states that there is no clear best technique for generic word lists as Metaphone has best performance for English dictionary words, while NYSIIS has bett er performance for datasets having street names. Though Soundex has high accuracy in correcting the exact words compared to other algorithms, it has lower precision due to more noise in the considered arena. The experimental results paved way for introducing some suggestions that would aid to make databases more concrete and achieve higher data quality. (More)

PDF ImageFull Text

Download
CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.85.245.126

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Koneru, K.; Pulla, V. and Varol, C. (2016). Performance Evaluation of Phonetic Matching Algorithms on English Words and Street Names - Comparison and Correlation.In Proceedings of the 5th International Conference on Data Management Technologies and Applications - Volume 1: DATA, ISBN 978-989-758-193-9, pages 57-64. DOI: 10.5220/0005926300570064

@conference{data16,
author={Keerthi Koneru. and Venkata Sai Venkatesh Pulla. and Cihan Varol.},
title={Performance Evaluation of Phonetic Matching Algorithms on English Words and Street Names - Comparison and Correlation},
booktitle={Proceedings of the 5th International Conference on Data Management Technologies and Applications - Volume 1: DATA,},
year={2016},
pages={57-64},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005926300570064},
isbn={978-989-758-193-9},
}

TY - CONF

JO - Proceedings of the 5th International Conference on Data Management Technologies and Applications - Volume 1: DATA,
TI - Performance Evaluation of Phonetic Matching Algorithms on English Words and Street Names - Comparison and Correlation
SN - 978-989-758-193-9
AU - Koneru, K.
AU - Pulla, V.
AU - Varol, C.
PY - 2016
SP - 57
EP - 64
DO - 10.5220/0005926300570064

Login or register to post comments.

Comments on this Paper: Be the first to review this paper.