SCAN: Sequence-character Aware Network for Text Recognition

Heba Hassan, Marwan Torki, Mohamed Hussein, Mohamed Hussein

Abstract

Text recognition continues to be a challenging problem in the context of text reading in natural scenes. Bearing in mind the sequential nature of text, the problem is usually posed as a sequence prediction problem from a whole-word image. Alternatively, it can also be posed as a character prediction problem. The latter approach is typically more robust to challenging word shapes. Attempting to find the sweet spot that attains the best of the two approaches, we propose Sequence-Character Aware Network (SCAN). SCAN starts by locating and recognizing the characters, and then generates the word using a sequence-based approach. It comprises two modules: a semantic-segmentation-based character prediction, and an encoder-decoder network for word generation. The training is done over two stages. In the first stage, we adopt a multi-task training technique with both character-level and word-level losses and trainable loss weighting. In the second stage, the character-level loss is removed, enabling the use of data with only word-level annotations. Experiments are conducted on several datasets for both regular and irregular text, showing state of the art performance of the proposed approach. It also shows that the proposed approach is robust against noisy word detection.

Download


Paper Citation


in Harvard Style

Hassan H., Torki M. and Hussein M. (2021). SCAN: Sequence-character Aware Network for Text Recognition.In Proceedings of the 16th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 4: VISAPP, ISBN 978-989-758-488-6, pages 602-609. DOI: 10.5220/0010321106020609


in Bibtex Style

@conference{visapp21,
author={Heba Hassan and Marwan Torki and Mohamed Hussein},
title={SCAN: Sequence-character Aware Network for Text Recognition},
booktitle={Proceedings of the 16th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 4: VISAPP,},
year={2021},
pages={602-609},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0010321106020609},
isbn={978-989-758-488-6},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 16th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 4: VISAPP,
TI - SCAN: Sequence-character Aware Network for Text Recognition
SN - 978-989-758-488-6
AU - Hassan H.
AU - Torki M.
AU - Hussein M.
PY - 2021
SP - 602
EP - 609
DO - 10.5220/0010321106020609