loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Author: Zuzana Nevěřilová

Affiliation: Natural Language Processing Centre, Faculty of Informatics, Masaryk University, Botanická 68a, Brno 602 00, Czech Republic

Keyword(s): Named Entity Recognition, Named Entity Alignment, Named Entity Discovery, Named Entity Linking.

Abstract: The paper describes two experiments with named entity discovery and alignment for English-Czech parallel data. In the previous work, we enriched the Parallel Global Voices corpus with named entity recognition (NER) for both languages and named entity linking (NEL) annotations for English. The alignment experiment employs sentence transformers and cosine similarity to identify NE translations from English to Czech and possibly other languages. The discovery experiment uses the same method to find possible translations between named entities in English and Czech n-grams. The described method achieves an F1 score of 0.94 in finding alignments between recognized entities. However, the same method can also discover unknown named entities with an F1 score of 0.70. The result indicates the method can be used to recognize named entities in parallel data in cases where no NER model is available with sufficient quality.

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 216.73.216.12

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Nevěřilová and Z. (2025). Named Entity Discovery and Alignment in Parallel Data. In Proceedings of the 17th International Conference on Agents and Artificial Intelligence - Volume 3: ICAART; ISBN 978-989-758-737-5; ISSN 2184-433X, SciTePress, pages 1215-1220. DOI: 10.5220/0013311300003890

@conference{icaart25,
author={Zuzana Nevě\v{r}ilová},
title={Named Entity Discovery and Alignment in Parallel Data},
booktitle={Proceedings of the 17th International Conference on Agents and Artificial Intelligence - Volume 3: ICAART},
year={2025},
pages={1215-1220},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0013311300003890},
isbn={978-989-758-737-5},
issn={2184-433X},
}

TY - CONF

JO - Proceedings of the 17th International Conference on Agents and Artificial Intelligence - Volume 3: ICAART
TI - Named Entity Discovery and Alignment in Parallel Data
SN - 978-989-758-737-5
IS - 2184-433X
AU - Nevěřilová, Z.
PY - 2025
SP - 1215
EP - 1220
DO - 10.5220/0013311300003890
PB - SciTePress