Constructing a Word Similarity Graph from Vector based Word Representation for Named Entity Recognition

Miguel Feria, Juan Paolo Balbin, Francis Michael Bautista

2018

Abstract

In this paper, we discuss a method for identifying a seed word that would best represent a class of named entities in a graphical representation of words and their similarities. Word networks, or word graphs, are representations of vectorized text where nodes are the words encountered in a corpus, and the weighted edges incident on the nodes represent how similar the words are to each other. Word networks are then divided into communities using the Louvain Method for community detection, then betweenness centrality of each node in each community is computed. The most central node in each community represents the most ideal candidate for a seed word of a named entity group which represents the community. Our results from our bilingual data set show that words with similar lexical content, from either language, belong to the same community.

Download


Paper Citation


in Harvard Style

Feria M., Balbin J. and Bautista F. (2018). Constructing a Word Similarity Graph from Vector based Word Representation for Named Entity Recognition.In Proceedings of the 14th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST, ISBN 978-989-758-324-7, pages 166-171. DOI: 10.5220/0006926201660171


in Bibtex Style

@conference{webist18,
author={Miguel Feria and Juan Paolo Balbin and Francis Michael Bautista},
title={Constructing a Word Similarity Graph from Vector based Word Representation for Named Entity Recognition},
booktitle={Proceedings of the 14th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,},
year={2018},
pages={166-171},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006926201660171},
isbn={978-989-758-324-7},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 14th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,
TI - Constructing a Word Similarity Graph from Vector based Word Representation for Named Entity Recognition
SN - 978-989-758-324-7
AU - Feria M.
AU - Balbin J.
AU - Bautista F.
PY - 2018
SP - 166
EP - 171
DO - 10.5220/0006926201660171