Text Line Aggregation

Christopher Beck, Alan Broun, Majid Mirmehdi, Tony Pipe, Chris Melhuish

2014

Abstract

We present a new approach to text line aggregation that can work as both a line formation stage for a myriad of text segmentation methods (over all orientations) and as an extra level of filtering to remove false text candidates. The proposed method is centred on the processing of candidate text components based on local and global measures. We use orientation histograms to build an understanding of paragraphs, and filter noise and construct lines based on the discovery of prominent orientations. Paragraphs are then reduced to seed components and lines are reconstructed around these components. We demonstrate results for text aggregation on the ICDAR 2003 Robust Reading Competition data, and also present results on our own more complex data set.

References

  1. Chen, H., Tsai, S., Schroth, G., Chen, D., Grzeszczuk, R., and Girod, B. (2011). Robust text detection in natural images with edge-enhanced maximally stable extremal regions. In ICIP, pages 2609 - 2612.
  2. Chen, X., Yang, J., Zhang, J., and Waibel, A. (2004). Automatic detection and recognition of signs from natural scenes. In ICIP.
  3. Chen, X. and Yuille, A. (2004). Detecting and reading text in natural scenes. In CVPR, volume 2, pages II - 366.
  4. Epshtein, B., Ofek, E., and Wexler, Y. (2010). Detecting text in natural scenes with stroke width transform. In CVPR, pages 2963-2970.
  5. Ezaki, N., Bulacu, M., and Schomaker, L. (2004). Text detection from natural scene images: towards a system for visually impaired persons. In ICPR, volume 2, pages 683 - 686.
  6. Fu, L., Wang, W., and Zhan, Y. (2005). A robust text segmentation approach in complex background based on multiple constraints. In AMIP-PMC, pages 594 - 605.
  7. Jung, C., Liu, Q., and Kim, J. (2009). A stroke filter and its application to text localization. In PRL, volume 30, pages 114 - 122.
  8. León, M., Mallo, S., and Gasull, A. (2005). A tree structured-based caption text detection approach. In ICVIPP, page 220.
  9. Lintern, J. (2008). Recognizing text in Google Street View images. Statistics, 6.
  10. Liu, X. and Samarabandu, J. (2005). An edge-based text region extraction algorithm for indoor mobile robot navigation. In ICMA, volume 2, pages 701 - 706.
  11. Liu, X. and Samarabandu, J. (2006). Multiscale edge-based text extraction from complex images. In ICME, pages 1721 - 1724.
  12. Liu, Y., Goto, S., and Ikenaga, T. (2005). A robust algorithm for text detection in color images. In ICDAR, pages 399 - 403.
  13. Lucas, S. M., Panaretos, A., Sosa, L., Tang, A., Wong, S., and Young, R. (2003). ICDAR 2003 robust reading competitions. In ICDAR.
  14. Merino, C. and Mirmehdi, M. (2007). A framework towards realtime detection and tracking of text. In CBDAR, pages 10 - 17.
  15. Merino-Gracia, C., Lenc, K., and Mirmehdi, M. (2011). A head-mounted device for recognizing text in natural scenes. In CBDAR, pages 29 - 41.
  16. Neumann, L. and Matas, J. (2011a). Estimating hidden parameters for text localization and recognition. In Computer Vision Winter Workshop.
  17. Neumann, L. and Matas, J. (2011b). A method for text localization and recognition in real-world images. In ACCV, pages 770 - 783.
  18. Nistér, D. and Stewénius, H. (2008). Linear time maximally stable extremal regions. In ECCV, pages 183 - 196.
  19. Pan, Y., Hou, X., and Liu, C. (2011). A hybrid approach to detect and localize texts in natural scene images. Image Processing, IEEE Transactions on, 20(3):800 - 813.
  20. Pilu, M. (2001). Extraction of illusory linear clues in perspectively skewed documents. In CVPR, volume 1, pages I - 363.
  21. Pratheeba, T., Kavitha, V., and Rajeswari, S. (2010). Morphology based text detection and extraction from complex video scene. IJET, 2(3):200 - 206.
  22. Retornaz, T. and Marcotegui, B. (2007). Scene text localization based on the ultimate opening. In ISMM, volume 1, pages 177 - 188.
  23. Sedgewick, R. (2002). Algorithms in C, Part 5: Graph Algorithms.
  24. Yi, C. and Tian, Y. (2011). Text string detection from natural scenes by structure-based partition and grouping. Image Processing, IEEE Transactions on, 20(9):2594 - 2605.
  25. Zhang, Z., Lu, T., Su, F., and Yang, R. (2010). A new text detection algorithm for content-oriented line drawing image retrieval. In Advances in Multimedia Information Processing-PCM, pages 338 - 347.
  26. Zini, L., Destrero, A., and Odone, F. (2009). A classification architecture based on connected components for text detection in unconstrained environments. In AVSS, pages 176 - 181.
Download


Paper Citation


in Harvard Style

Beck C., Broun A., Mirmehdi M., Pipe T. and Melhuish C. (2014). Text Line Aggregation . In Proceedings of the 3rd International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM, ISBN 978-989-758-018-5, pages 393-401. DOI: 10.5220/0004817903930401


in Bibtex Style

@conference{icpram14,
author={Christopher Beck and Alan Broun and Majid Mirmehdi and Tony Pipe and Chris Melhuish},
title={Text Line Aggregation},
booktitle={Proceedings of the 3rd International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM,},
year={2014},
pages={393-401},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004817903930401},
isbn={978-989-758-018-5},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 3rd International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM,
TI - Text Line Aggregation
SN - 978-989-758-018-5
AU - Beck C.
AU - Broun A.
AU - Mirmehdi M.
AU - Pipe T.
AU - Melhuish C.
PY - 2014
SP - 393
EP - 401
DO - 10.5220/0004817903930401