Transfer Learning for Handwriting Recognition on Historical Documents

Adeline Granet, Emmanuel Morin, Harold Mouchère, Solen Quiniou, Christian Viard-Gaudin

Abstract

In this work, we investigate handwriting recognition on new historical handwritten documents using transfer learning. Establishing a manual ground-truth of a new collection of handwritten documents is time consuming but needed to train and to test recognition systems. We want to implement a recognition system without performing this annotation step. Our research deals with transfer learning from heterogeneous datasets with a ground-truth and sharing common properties with a new dataset that has no ground-truth. The main difficulties of transfer learning lie in changes in the writing style, the vocabulary, and the named entities over centuries and datasets. In our experiment, we show how a CNN-BLSTM-CTC neural network behaves, for the task of transcribing handwritten titles of plays of the Italian Comedy, when trained on combinations of various datasets such as RIMES, Georges Washington, and Los Esposalles. We show that the choice of the training datasets and the merging methods are determinant to the results of the transfer learning task.

Download


Paper Citation


in Harvard Style

Granet A., Morin E., Mouchère H., Quiniou S. and Viard-Gaudin C. (2018). Transfer Learning for Handwriting Recognition on Historical Documents.In Proceedings of the 7th International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM, ISBN 978-989-758-276-9, pages 432-439. DOI: 10.5220/0006598804320439


in Bibtex Style

@conference{icpram18,
author={Adeline Granet and Emmanuel Morin and Harold Mouchère and Solen Quiniou and Christian Viard-Gaudin},
title={Transfer Learning for Handwriting Recognition on Historical Documents},
booktitle={Proceedings of the 7th International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM,},
year={2018},
pages={432-439},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006598804320439},
isbn={978-989-758-276-9},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 7th International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM,
TI - Transfer Learning for Handwriting Recognition on Historical Documents
SN - 978-989-758-276-9
AU - Granet A.
AU - Morin E.
AU - Mouchère H.
AU - Quiniou S.
AU - Viard-Gaudin C.
PY - 2018
SP - 432
EP - 439
DO - 10.5220/0006598804320439