loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Ying-Chi Lin ; Phillip Hoffmann and Erhard Rahm

Affiliation: Department of Computer Science, Leipzig University, Germany

Keyword(s): Semantic Annotation, UMLS, Sentence Embedding, BERT, Medical Forms.

Abstract: Annotating documents using concepts of ontologies enhances data quality and interoperability. Such semantic annotations also facilitate the comparison of multiple studies and even cross-lingual results. The FDA therefore requires that all submitted medical forms have to be annotated. In this work we aim at annotating medical forms in German. These standardized forms are used in health care practice and biomedical research and are translated/adapted to various languages. We focus on annotations that cover the whole question in the form as required by the FDA. We need to map these non-English questions to English concepts as many of these concepts do not exist in other languages. Due to the process of translation and adaptation, the corresponding non-English forms deviate from the original forms syntactically. This causes the conventional string matching methods to produce low annotation quality results. Consequently, we propose a new approach that incorporates semantics into the mappi ng procedure. By utilizing sentence embeddings generated by deep networks in the cross-lingual annotation process, we achieve a recall of 84.62%. This is an improvement of 134% compared to conventional string matching. Likewise, we also achieve an improvement of 51% in precision and 65% in F-measure. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.235.46.191

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Lin, Y.; Hoffmann, P. and Rahm, E. (2021). Enhancing Cross-lingual Semantic Annotations using Deep Network Sentence Embeddings. In Proceedings of the 14th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2021) - HEALTHINF; ISBN 978-989-758-490-9; ISSN 2184-4305, SciTePress, pages 188-199. DOI: 10.5220/0010256801880199

@conference{healthinf21,
author={Ying{-}Chi Lin. and Phillip Hoffmann. and Erhard Rahm.},
title={Enhancing Cross-lingual Semantic Annotations using Deep Network Sentence Embeddings},
booktitle={Proceedings of the 14th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2021) - HEALTHINF},
year={2021},
pages={188-199},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0010256801880199},
isbn={978-989-758-490-9},
issn={2184-4305},
}

TY - CONF

JO - Proceedings of the 14th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2021) - HEALTHINF
TI - Enhancing Cross-lingual Semantic Annotations using Deep Network Sentence Embeddings
SN - 978-989-758-490-9
IS - 2184-4305
AU - Lin, Y.
AU - Hoffmann, P.
AU - Rahm, E.
PY - 2021
SP - 188
EP - 199
DO - 10.5220/0010256801880199
PB - SciTePress