loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Gökhan Yildirim ; Radhakrishna Achanta and Sabine Süsstrunk

Affiliation: École Polytechnique Fédérale de Lausanne, Switzerland

Keyword(s): Text Detection and Recognition, Hough Forests, Feature Selection, Natural Images.

Abstract: Text detection and recognition in natural images are popular yet unsolved problems in computer vision. In this paper, we propose a technique that attempts to detect and recognize text in a unified manner by searching for words directly without reducing the image into text regions or individual characters. We present three contributions. First, we modify an object detection framework called Hough Forests (Gall et al., 2011) by introducing “Cross-Scale Binary Features” that compares the information between the same image patch at different scales. We use this modified technique to produce likelihood maps for every text character. Second, our word-formation cost function and computed likelihood maps are used to detect and recognize the text in natural images. We test our technique with the Street View House Numbers (Netzer et al., 2011) and the ICDAR 2003† (Lucas et al., 2003) datasets. For the SVHN dataset, our algorithm outperforms recent methods and has comparable performance using fewer training samples. We also exceed the state-of-the-art word recognition performance for ICDAR 2003 dataset by 4%. Our final contribution is a realistic dataset generation code for text characters. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 44.222.82.133

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Yildirim, G.; Achanta, R. and Süsstrunk, S. (2013). Text Recognition in Natural Images using Multiclass Hough Forests. In Proceedings of the International Conference on Computer Vision Theory and Applications (VISIGRAPP 2013) - Volume 1: VISAPP; ISBN 978-989-8565-47-1; ISSN 2184-4321, SciTePress, pages 737-741. DOI: 10.5220/0004197407370741

@conference{visapp13,
author={Gökhan Yildirim. and Radhakrishna Achanta. and Sabine Süsstrunk.},
title={Text Recognition in Natural Images using Multiclass Hough Forests},
booktitle={Proceedings of the International Conference on Computer Vision Theory and Applications (VISIGRAPP 2013) - Volume 1: VISAPP},
year={2013},
pages={737-741},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004197407370741},
isbn={978-989-8565-47-1},
issn={2184-4321},
}

TY - CONF

JO - Proceedings of the International Conference on Computer Vision Theory and Applications (VISIGRAPP 2013) - Volume 1: VISAPP
TI - Text Recognition in Natural Images using Multiclass Hough Forests
SN - 978-989-8565-47-1
IS - 2184-4321
AU - Yildirim, G.
AU - Achanta, R.
AU - Süsstrunk, S.
PY - 2013
SP - 737
EP - 741
DO - 10.5220/0004197407370741
PB - SciTePress