loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Areej Jaber 1 ; 2 and Paloma Martínez 2

Affiliations: 1 Applied Computer Science Department, PTUK University, Tulkarm, Palestine ; 2 Computer Science Department, Universidad Carlos III de Madrid, Leganés, Spain

Keyword(s): Clinical Natural Language Processing, Word Sense Disambiguation, Word Embeddings, Clinical Abbreviations, Pre-trained Models.

Abstract: Abbreviations are broadly used in clinical texts and most of them have more than one meaning which makes them highly ambiguous. Determining the right sense of an abbreviation is considered a Word Sense Disambiguation (WSD) task in clinical natural language processing (NLP). Many approaches are applied to disambiguate abbreviations in clinical narrative. However, supervised machine learning approaches are studied in this field extensively and have proven a good performance at tackling this problem. We have investigated four strategies that integrate pre-trained word embedding as features to train two supervised machine learning models: Support Vector Machines (SVM) and Naive Bayes (NB). Our training features include information of the context of target abbreviation, which is applied on 500 sentences for each of the 13 abbreviations that have been extracted from public clinical notes data sets from the University of Minnesota-affiliated (UMN) Fairview Health Services in the Twin Cities . Our results showed that SVM performs better than NB in all four strategies; the highest accuracy being 97.08% using a pre-trained model trained from Wikipedia, PubMed and PMC (PubMedCentral) texts. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 52.54.103.76

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Jaber, A. and Martínez, P. (2021). Disambiguating Clinical Abbreviations using Pre-trained Word Embeddings. In Proceedings of the 14th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2021) - HEALTHINF; ISBN 978-989-758-490-9; ISSN 2184-4305, SciTePress, pages 501-508. DOI: 10.5220/0010256105010508

@conference{healthinf21,
author={Areej Jaber. and Paloma Martínez.},
title={Disambiguating Clinical Abbreviations using Pre-trained Word Embeddings},
booktitle={Proceedings of the 14th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2021) - HEALTHINF},
year={2021},
pages={501-508},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0010256105010508},
isbn={978-989-758-490-9},
issn={2184-4305},
}

TY - CONF

JO - Proceedings of the 14th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2021) - HEALTHINF
TI - Disambiguating Clinical Abbreviations using Pre-trained Word Embeddings
SN - 978-989-758-490-9
IS - 2184-4305
AU - Jaber, A.
AU - Martínez, P.
PY - 2021
SP - 501
EP - 508
DO - 10.5220/0010256105010508
PB - SciTePress