loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: James Philips and Nasseh Tabrizi

Affiliation: Department of Computer Science, East Carolina University, Greenville, North Carolina, U.S.A.

Keyword(s): Historical Document Processing, Archival Data, Handwriting Recognition, Optical Character Recognition, Digital Humanities.

Abstract: Historical Document Processing (HDP) is the process of digitizing written material from the past for future use by historians and other scholars. It incorporates algorithms and software tools from computer vision, document analysis and recognition, natural language processing, and machine learning to convert images of ancient manuscripts and early printed texts into a digital format usable in data mining and information retrieval systems. As libraries and other cultural heritage institutions have scanned their historical document archives, the need to transcribe the full text from these collections has become acute. Since HDP encompasses multiple sub-domains of computer science, knowledge relevant to its purpose is scattered across numerous journals and conference proceedings. This paper surveys the major phases of HDP, discussing standard algorithms, tools, and datasets and finally suggests directions for further research.

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.137.185.180

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Philips, J. and Tabrizi, N. (2020). Historical Document Processing: A Survey of Techniques, Tools, and Trends. In Proceedings of the 12th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2020) - KDIR; ISBN 978-989-758-474-9; ISSN 2184-3228, SciTePress, pages 341-349. DOI: 10.5220/0010177403410349

@conference{kdir20,
author={James Philips. and Nasseh Tabrizi.},
title={Historical Document Processing: A Survey of Techniques, Tools, and Trends},
booktitle={Proceedings of the 12th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2020) - KDIR},
year={2020},
pages={341-349},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0010177403410349},
isbn={978-989-758-474-9},
issn={2184-3228},
}

TY - CONF

JO - Proceedings of the 12th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2020) - KDIR
TI - Historical Document Processing: A Survey of Techniques, Tools, and Trends
SN - 978-989-758-474-9
IS - 2184-3228
AU - Philips, J.
AU - Tabrizi, N.
PY - 2020
SP - 341
EP - 349
DO - 10.5220/0010177403410349
PB - SciTePress