loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Zhi-Chen Yan 1 and Stephanie A. Yu 2

Affiliations: 1 Facebook Research, 1 Hacker Way, Menlo Park, CA 94025, U.S.A. ; 2 West Island School, 250 Victoria Road, Pokfulam, Hong Kong, Republic of China

Keyword(s): Attention, Convolution, Deep Learning, LSTM, Text Recognition.

Abstract: Recognizing texts in real-world scenes is an important research topic in computer vision. Many deep learning based techniques have been proposed. Such techniques typically follow an encoder-decoder architecture, and use a sequence of feature vectors as the intermediate representation. In this approach, useful 2D spatial information in the input image may be lost due to vector-based encoding. In this paper, we formulate scene text recognition as a spatiotemporal sequence translation problem, and introduce a novel attention based spatiotemporal decoding framework. We first encode an image as a spatiotemporal sequence, which is then translated into a sequence of output characters using the aforementioned decoder. Our encoding and decoding stages are integrated to form an end-to-end trainable deep network. Experimental results on multiple benchmarks, including IIIT5k, SVT, ICDAR and RCTW-17, indicate that our method can significantly outperform conventional attention frameworks.

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.237.46.120

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Yan, Z. and Yu, S. (2020). Attention-based Text Recognition in the Wild. In Proceedings of the 1st International Conference on Deep Learning Theory and Applications - DeLTA; ISBN 978-989-758-441-1, SciTePress, pages 42-49. DOI: 10.5220/0009970200420049

@conference{delta20,
author={Zhi{-}Chen Yan. and Stephanie A. Yu.},
title={Attention-based Text Recognition in the Wild},
booktitle={Proceedings of the 1st International Conference on Deep Learning Theory and Applications - DeLTA},
year={2020},
pages={42-49},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0009970200420049},
isbn={978-989-758-441-1},
}

TY - CONF

JO - Proceedings of the 1st International Conference on Deep Learning Theory and Applications - DeLTA
TI - Attention-based Text Recognition in the Wild
SN - 978-989-758-441-1
AU - Yan, Z.
AU - Yu, S.
PY - 2020
SP - 42
EP - 49
DO - 10.5220/0009970200420049
PB - SciTePress