Flexible Table Recognition and Semantic Interpretation System

Marcin Namysl, Marcin Namysl, Alexander M. Esser, Sven Behnke, Sven Behnke, Joachim Köhler

2022

Abstract

Table extraction is an important but still unsolved problem. In this paper, we introduce a flexible and modular table extraction system. We develop two rule-based algorithms that perform the complete table recognition process, including table detection and segmentation, and support the most frequent table formats. Moreover, to incorporate the extraction of semantic information, we develop a graph-based table interpretation method. We conduct extensive experiments on the challenging table recognition benchmarks ICDAR 2013 and ICDAR 2019, achieving results competitive with state-of-the-art approaches. Our complete information extraction system exhibited a high F1 score of 0.7380. To support future research on information extraction from documents, we make the resources (ground-truth annotations, evaluation scripts, algorithm parameters) from our table interpretation experiment publicly available.

Download


Paper Citation


in Harvard Style

Namysl M., Esser A., Behnke S. and Köhler J. (2022). Flexible Table Recognition and Semantic Interpretation System. In Proceedings of the 17th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2022) - Volume 4: VISAPP; ISBN 978-989-758-555-5, SciTePress, pages 27-37. DOI: 10.5220/0010767600003124


in Bibtex Style

@conference{visapp22,
author={Marcin Namysl and Alexander M. Esser and Sven Behnke and Joachim Köhler},
title={Flexible Table Recognition and Semantic Interpretation System},
booktitle={Proceedings of the 17th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2022) - Volume 4: VISAPP},
year={2022},
pages={27-37},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0010767600003124},
isbn={978-989-758-555-5},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 17th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2022) - Volume 4: VISAPP
TI - Flexible Table Recognition and Semantic Interpretation System
SN - 978-989-758-555-5
AU - Namysl M.
AU - Esser A.
AU - Behnke S.
AU - Köhler J.
PY - 2022
SP - 27
EP - 37
DO - 10.5220/0010767600003124
PB - SciTePress