De-identification of Clinical Text for Secondary Use: Research Issues

Hanna Berg, Aron Henriksson, Uno Fors, Hercules Dalianis

2021

Abstract

Privacy is challenged by both advances in AI-related technologies and recently introduced legal regulations. The problem of privacy has been extensively studied within the privacy community, but has largely focused on methods for protecting and assessing the privacy of structured data. Research aiming to protect the integrity of patients based on clinical text has primarily referred to US law and relied on automatically recognising predetermined, both direct and indirect, identifiers. This article discusses the various challenges concerning the re-use of unstructured clinical data, in particular in the form of clinical text, and focuses on ambiguous and vague terminology, how different legislation affects the requirements for de-identification, differences between methods for unstructured and structured data, the impact of approaches based on named entity recognition and replacing sensitive data with surrogates, as well as the lack of measures for usability and re-identification risk.

Download


Paper Citation


in Harvard Style

Berg H., Henriksson A., Fors U. and Dalianis H. (2021). De-identification of Clinical Text for Secondary Use: Research Issues. In Proceedings of the 14th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2021) - Volume 5: HEALTHINF; ISBN 978-989-758-490-9, SciTePress, pages 592-599. DOI: 10.5220/0010318705920599


in Bibtex Style

@conference{healthinf21,
author={Hanna Berg and Aron Henriksson and Uno Fors and Hercules Dalianis},
title={De-identification of Clinical Text for Secondary Use: Research Issues},
booktitle={Proceedings of the 14th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2021) - Volume 5: HEALTHINF},
year={2021},
pages={592-599},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0010318705920599},
isbn={978-989-758-490-9},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 14th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2021) - Volume 5: HEALTHINF
TI - De-identification of Clinical Text for Secondary Use: Research Issues
SN - 978-989-758-490-9
AU - Berg H.
AU - Henriksson A.
AU - Fors U.
AU - Dalianis H.
PY - 2021
SP - 592
EP - 599
DO - 10.5220/0010318705920599
PB - SciTePress