loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Ngoc Phuoc An Vo 1 and Octavian Popescu 2

Affiliations: 1 Xerox Research Centre Europe, France ; 2 IBM T.J.Watson Research, United States

Keyword(s): Machine Learning, Natural Language Processing (NLP), Semantic Textual Similarity (STS).

Related Ontology Subjects/Areas/Topics: Artificial Intelligence ; Computational Intelligence ; Evolutionary Computing ; Information Extraction ; Knowledge Discovery and Information Retrieval ; Knowledge-Based Systems ; Machine Learning ; Soft Computing ; Symbolic Systems

Abstract: Building a system able to cope with various phenomena which falls under the umbrella of semantic similarity is far from trivial. It is almost always the case that the performances of a system do not vary consistently or predictably from corpora to corpora. We analyzed the source of this variance and found that it is related to the word-pair similarity distribution among the topics in the various corpora. Then we used this insight to construct a 4-module system that would take into consideration not only string and semantic word similarity, but also word alignment and sentence structure. The system consistently achieves an accuracy which is very close to the state of the art, or reaching a new state of the art. The system is based on a multi-layer architecture and is able to deal with heterogeneous corpora which may not have been generated by the same distribution.

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 107.21.176.63

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Vo, N. and Popescu, O. (2016). A Multi-Layer System for Semantic Textual Similarity. In Proceedings of the 8th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2016) - KDIR; ISBN 978-989-758-203-5; ISSN 2184-3228, SciTePress, pages 56-67. DOI: 10.5220/0006045800560067

@conference{kdir16,
author={Ngoc Phuoc An Vo. and Octavian Popescu.},
title={A Multi-Layer System for Semantic Textual Similarity},
booktitle={Proceedings of the 8th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2016) - KDIR},
year={2016},
pages={56-67},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006045800560067},
isbn={978-989-758-203-5},
issn={2184-3228},
}

TY - CONF

JO - Proceedings of the 8th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2016) - KDIR
TI - A Multi-Layer System for Semantic Textual Similarity
SN - 978-989-758-203-5
IS - 2184-3228
AU - Vo, N.
AU - Popescu, O.
PY - 2016
SP - 56
EP - 67
DO - 10.5220/0006045800560067
PB - SciTePress