N. Nikolaou, E. Badekas, N. Papamarkos, C. Strouthopoulos



Abstract.A new method for text localization in cover color pages and general color document images is presented. The colors of the document image are reduced to a small number using a color reduction technique based on a Kohonen Self Organized Map (KSOM) neural network. Each color defines a color plane in which the connected components (CCs) are extracted. In each color plane a CC filtering procedure is applied which is followed by a local grouping procedure. At the end of this stage, groups of CCs are constructed which are next refined by obtaining the Direction Of Connection (DOC) property for each CC. Using the DOC property, the groups of CCs are classified as text or non text regions. Finally, text regions identified in the different color planes are superimposed and the final text localization of the entire document is achieved. The proposed technique was extensively tested with a large number of color documents.


  1. Atsalakis, A., Papamarkos, A. and Andreadis, I., 2002. On estimation of the number of image principal colors and color reduction through self-organized neural networks, Int. Journal of Imaging Systems and Technology, 12(3), 117-127.
  2. Chen, W.Y. and Chen, S.Y., 1998. Adaptive page segmentation for color technical journals' cover images, Image and Vision Computing, 16(12-13), 855-877.
  3. Fletcher, L. and Kasturi, R., 1988. A robust algorithm for text string separation from mixed text/graphics images, IEEE Trans. PAMI, 10(6), 910-918.
  4. Hase, H., Shinokawa, T., Yoneda, M. and Suen, C.Y., 2001. Character string extraction from color documents, Pattern Recognition, 34(7), 1349-1365.
  5. Jain, A.K. and Zhong, Y., 1996. Page Segmentation Using Texture Analysis, Pattern Recognition, 29(5), 743- 770.
  6. Jain, A.K. and Bhattacharjee, S., 1992. Text segmentation using Gabor Filters for automatic document processing, Mach. Vision Appl., 5, 169-184.
  7. Jung, K. and Han, J., 2004. Hybrid approach to efficient text extraction in complex color images. Pattern Recognition Letters, 25(6), 679-699.
  8. Jung, K., Kim, K.I. and Jain, A.K., 2004. Text information extraction in images and video: A survey, Pattern Recognition, 37(5), 977-997.
  9. O'Gorman, L., 1993. The Document Spectrum for Page Layout Analysis, IEEE Trans. PAMI, 15(11), 1162- 1173.
  10. Papamarkos, N., 1999. Color reduction using local features and a SOFM neural network, Int. Journal of Imaging Systems and Technology, 10(5), 404-409.
  11. Simon, A., Pret, J.C. and Johnson, A.P., 1997. A Fast Algorithm for Bottom-Up Layout Analysis, IEEE Trans. Pattern Analysis and Machine Intelligence, 19(3), 273-277.
  12. Sobottka, K. et al., 2000. Text Extraction from Colored Book and Journal Covers, International Journal on Document Analysis and Recognition, 2(4), 163-176.
  13. Strouthopoulos, C., Papamarkos, N. and Atsalakis, A., 2002. Text extraction in complex color documents. Pattern Recognition, 35(8), 1743-1758.
  14. Zhong, Y., Karu, K., Jain, A.K., 1995. Locating text in complex color images, Pattern Recognition, 28 (10), 1523-1535.

Paper Citation

in Harvard Style

Nikolaou N., Badekas E., Papamarkos N. and Strouthopoulos C. (2006). TEXT LOCALIZATION IN COLOR DOCUMENTS . In Proceedings of the First International Conference on Computer Vision Theory and Applications - Volume 1: VISAPP, ISBN 972-8865-40-6, pages 181-188. DOI: 10.5220/0001365801810188

in Bibtex Style

author={N. Nikolaou and E. Badekas and N. Papamarkos and C. Strouthopoulos},
booktitle={Proceedings of the First International Conference on Computer Vision Theory and Applications - Volume 1: VISAPP,},

in EndNote Style

JO - Proceedings of the First International Conference on Computer Vision Theory and Applications - Volume 1: VISAPP,
SN - 972-8865-40-6
AU - Nikolaou N.
AU - Badekas E.
AU - Papamarkos N.
AU - Strouthopoulos C.
PY - 2006
SP - 181
EP - 188
DO - 10.5220/0001365801810188