Experimental Application of Semantic Segmentation Models Fine-Tuned with Synthesized Document Images to Text Line Segmentation in a Handwritten Japanese Historical Document

Sayaka Mori, Tetsuya Suzuki

2024

Abstract

Because it is difficult even for Japanese to read handwritten Japanese historical documents, computer-assisted transcription of such documents is helpful. We plan to apply semantic segmentation to text line segmentation for handwritten Japanese historical documents. We use both synthesized document images resembling a Japanese historical document and annotations for them because it is time-consuming to manually annotate a large set of document images for training data. The purpose of this research is to evaluate the effect of fine-tuning semantic segmentation models with synthesized Japanese historical document images in text line segmentation. The experimental results show that the segmentation results produced by our method are generally satisfactory for test data consisting of synthesized document images and are also satisfactory for Japanese historical document images with straightforward formats.

Download


Paper Citation


in Harvard Style

Mori S. and Suzuki T. (2024). Experimental Application of Semantic Segmentation Models Fine-Tuned with Synthesized Document Images to Text Line Segmentation in a Handwritten Japanese Historical Document. In Proceedings of the 13th International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM; ISBN 978-989-758-684-2, SciTePress, pages 826-832. DOI: 10.5220/0012433100003654


in Bibtex Style

@conference{icpram24,
author={Sayaka Mori and Tetsuya Suzuki},
title={Experimental Application of Semantic Segmentation Models Fine-Tuned with Synthesized Document Images to Text Line Segmentation in a Handwritten Japanese Historical Document},
booktitle={Proceedings of the 13th International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM},
year={2024},
pages={826-832},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0012433100003654},
isbn={978-989-758-684-2},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 13th International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM
TI - Experimental Application of Semantic Segmentation Models Fine-Tuned with Synthesized Document Images to Text Line Segmentation in a Handwritten Japanese Historical Document
SN - 978-989-758-684-2
AU - Mori S.
AU - Suzuki T.
PY - 2024
SP - 826
EP - 832
DO - 10.5220/0012433100003654
PB - SciTePress