loading
Documents

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Mena B. Morgan and Maurice van Keulen

Affiliation: University of Twente, Netherlands

ISBN: 978-989-8565-75-4

Keyword(s): Named Entity Disambiguation, Social Media, Twitter.

Related Ontology Subjects/Areas/Topics: Artificial Intelligence ; Information Extraction ; Knowledge Discovery and Information Retrieval ; Knowledge-Based Systems ; Mining Text and Semi-Structured Data ; Symbolic Systems

Abstract: Social media is a rich source of information. To make use of this information, it is sometimes required to extract and disambiguate named entities. In this paper, we focus on named entity disambiguation (NED) in twitter messages. NED in tweets is challenging in two ways. First, the limited length of Tweet makes it hard to have enough context while many disambiguation techniques depend on it. The second is that many named entities in tweets do not exist in a knowledge base (KB). We share ideas from information retrieval (IR) and NED to propose solutions for both challenges. For the first problem we make use of the gregarious nature of tweets to get enough context needed for disambiguation. For the second problem we look for an alternative home page if there is no Wikipedia page represents the entity. Given a mention, we obtain a list of Wikipedia candidates from YAGO KB in addition to top ranked pages from Google search engine. We use Support Vector Machine (SVM) to rank the candidate pages to find the best representative entities. Experiments conducted on two data sets show better disambiguation results compared with the baselines and a competitor. (More)

PDF ImageFull Text

Download
Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 34.229.76.193

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
B. Morgan, M. and van Keulen, M. (2013). A Generic Open World Named Entity Disambiguation Approach for Tweets.In Proceedings of the International Conference on Knowledge Discovery and Information Retrieval and the International Conference on Knowledge Management and Information Sharing - Volume 1: SNAM, (IC3K 2013) ISBN 978-989-8565-75-4, pages 267-276. DOI: 10.5220/0004536302670276

@conference{snam13,
author={Mena B. Morgan. and Maurice van Keulen.},
title={A Generic Open World Named Entity Disambiguation Approach for Tweets},
booktitle={Proceedings of the International Conference on Knowledge Discovery and Information Retrieval and the International Conference on Knowledge Management and Information Sharing - Volume 1: SNAM, (IC3K 2013)},
year={2013},
pages={267-276},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004536302670276},
isbn={978-989-8565-75-4},
}

TY - CONF

JO - Proceedings of the International Conference on Knowledge Discovery and Information Retrieval and the International Conference on Knowledge Management and Information Sharing - Volume 1: SNAM, (IC3K 2013)
TI - A Generic Open World Named Entity Disambiguation Approach for Tweets
SN - 978-989-8565-75-4
AU - B. Morgan, M.
AU - van Keulen, M.
PY - 2013
SP - 267
EP - 276
DO - 10.5220/0004536302670276

Login or register to post comments.

Comments on this Paper: Be the first to review this paper.