Rethinking Image-Based Table Recognition Using Weakly Supervised Methods

Nam Ly; Atsuhiro Takasu; Phuc Nguyen; Hideaki Takeda

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

Rethinking Image-Based Table Recognition Using Weakly Supervised Methods

Topics: Deep Learning and Neural Networks; Document Analysis; Image and Video Analysis and Understanding

In Proceedings of the 12th International Conference on Pattern Recognition Applications and Methods ICPRAM - Volume 1, 872-880, 2023 , Lisbon, Portugal

Authors: Nam Ly ; Atsuhiro Takasu ; Phuc Nguyen and Hideaki Takeda

Affiliation: National Institute of Informatics, Tokyo, Japan

Keyword(s): Table Recognition, End-to-End, Self-Attention, Weakly Supervised Learning, WikiTableSet.

Abstract: Most of the previous methods for table recognition rely on training datasets containing many richly annotated table images. Detailed table image annotation, e.g., cell or text bounding box annotation, however, is costly and often subjective. In this paper, we propose a weakly supervised model named WSTabNet for table recognition that relies only on HTML (or LaTeX) code-level annotations of table images. The proposed model consists of three main parts: an encoder for feature extraction, a structure decoder for generating table structure, and a cell decoder for predicting the content of each cell in the table. Our system is trained end-to-end by stochastic gradient descent algorithms, requiring only table images and their ground-truth HTML (or LaTeX) representations. To facilitate table recognition with deep learning, we create and release WikiTableSet, the largest publicly available image-based table recognition dataset built from Wikipedia. WikiTableSet contains nearly 4 million Engl ish table images, 590K Japanese table images, and 640k French table images with corresponding HTML representation and cell bounding boxes. The extensive experiments on WikiTableSet and two large-scale datasets: FinTabNet and PubTabNet demonstrate that the proposed weakly supervised model achieves better, or similar accuracies compared to the state-of-the-art models on all benchmark datasets. (More)

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 216.73.216.139

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

Ly, N., Takasu, A., Nguyen, P. and Takeda, H. (2023). Rethinking Image-Based Table Recognition Using Weakly Supervised Methods. In Proceedings of the 12th International Conference on Pattern Recognition Applications and Methods - ICPRAM; ISBN 978-989-758-626-2; ISSN 2184-4313, SciTePress, pages 872-880. DOI: 10.5220/0011682600003411

@conference{icpram23,
author={Nam Ly and Atsuhiro Takasu and Phuc Nguyen and Hideaki Takeda},
title={Rethinking Image-Based Table Recognition Using Weakly Supervised Methods},
booktitle={Proceedings of the 12th International Conference on Pattern Recognition Applications and Methods - ICPRAM},
year={2023},
pages={872-880},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0011682600003411},
isbn={978-989-758-626-2},
issn={2184-4313},
}

TY - CONF

JO - Proceedings of the 12th International Conference on Pattern Recognition Applications and Methods - ICPRAM
TI - Rethinking Image-Based Table Recognition Using Weakly Supervised Methods
SN - 978-989-758-626-2
IS - 2184-4313
AU - Ly, N.
AU - Takasu, A.
AU - Nguyen, P.
AU - Takeda, H.
PY - 2023
SP - 872
EP - 880
DO - 10.5220/0011682600003411
PB - SciTePress