A Two Step Fine-tuning Approach for Text Recognition on Identity Documents

Francesco Visalli, Antonio Patrizio, Massimo Ruffolo, Massimo Ruffolo

2021

Abstract

Manually extracting data from documents for digitization is a long, tedious and error-prone job. In recent years, technologies capable of automating these processes are gaining ground and managing to obtain surprising results. Research in this field is driven by the strong interest of organizations that have identified how the automation of data entry leads to a reduction in working time and a speed-up of business processes. Documents of interest are heterogeneous in format and content. These can be natively machine readable or not when they are images obtained by scanning paper. Documents in image format require pre-processing before applying information extraction. A typical pre-processing pipeline consists of two steps: text detection and text recognition. This work proposes a two step fine-tuning approach for text recognition in Italian identity documents based on Scene Text Recognition networks. Experiments show promising results.

Download


Paper Citation


in Harvard Style

Visalli F., Patrizio A. and Ruffolo M. (2021). A Two Step Fine-tuning Approach for Text Recognition on Identity Documents.In Proceedings of the 13th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART, ISBN 978-989-758-484-8, pages 837-844. DOI: 10.5220/0010252208370844


in Bibtex Style

@conference{icaart21,
author={Francesco Visalli and Antonio Patrizio and Massimo Ruffolo},
title={A Two Step Fine-tuning Approach for Text Recognition on Identity Documents},
booktitle={Proceedings of the 13th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART,},
year={2021},
pages={837-844},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0010252208370844},
isbn={978-989-758-484-8},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 13th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART,
TI - A Two Step Fine-tuning Approach for Text Recognition on Identity Documents
SN - 978-989-758-484-8
AU - Visalli F.
AU - Patrizio A.
AU - Ruffolo M.
PY - 2021
SP - 837
EP - 844
DO - 10.5220/0010252208370844