Paper-Based Health Records: A Case Study on the Digitization of Handwritten Clinical Records
Vincenza Carchiolo, Michele Malgeri, Lorenzo Spadaro Sapari
2025
Abstract
This paper presents a case study focused on the application of handwriting recognition to digitize historical clinical records containing significant handwritten content. The primary objective is to assess the feasibility of using commercial OCR technologies-in particular, Microsoft Azure’s handwriting recognition API-for processing health documents. The study aims to determine whether these tools can support the extraction of meaningful clinical information, not only by recognizing individual characters but also by leveraging the structural layout of documents, such as forms, to infer semantic content. Our methodology includes empirical evaluation of OCR output on real-world patient records, alongside a qualitative analysis of common recognition errors. In addition, we review relevant approaches from the literature, highlighting recent advances in deep learning for document understanding. The findings indicate that general-purpose OCR systems are currently insufficient for reliable clinical data extraction in such contexts, primarily due to the complexity and variability of handwritten medical records. However, the results also suggest that structural cues present in form-based documents could be harnessed-through tailored AI-based techniques-to significantly improve recognition and downstream information retrieval.
DownloadPaper Citation
in Harvard Style
Carchiolo V., Malgeri M. and Sapari L. (2025). Paper-Based Health Records: A Case Study on the Digitization of Handwritten Clinical Records. In Proceedings of the 21st International Conference on Web Information Systems and Technologies - Volume 1: WEBIST; ISBN 978-989-758-772-6, SciTePress, pages 244-251. DOI: 10.5220/0013853900003985
in Bibtex Style
@conference{webist25,
author={Vincenza Carchiolo and Michele Malgeri and Lorenzo Sapari},
title={Paper-Based Health Records: A Case Study on the Digitization of Handwritten Clinical Records},
booktitle={Proceedings of the 21st International Conference on Web Information Systems and Technologies - Volume 1: WEBIST},
year={2025},
pages={244-251},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0013853900003985},
isbn={978-989-758-772-6},
}
in EndNote Style
TY - CONF
JO - Proceedings of the 21st International Conference on Web Information Systems and Technologies - Volume 1: WEBIST
TI - Paper-Based Health Records: A Case Study on the Digitization of Handwritten Clinical Records
SN - 978-989-758-772-6
AU - Carchiolo V.
AU - Malgeri M.
AU - Sapari L.
PY - 2025
SP - 244
EP - 251
DO - 10.5220/0013853900003985
PB - SciTePress