COLOR SEGMENTATION OF COMPLEX DOCUMENT IMAGES

N. Nikolaou, N. Papamarkos

2006

Abstract

In this paper we present a new method for color segmentation of complex document images which can be used as a preprocessing step of a text information extraction application. From the edge map of an image, we choose a representative set of samples of the input color image and built the 3D histogram of the RGB color space. These samples are used to locate a relatively large number of proper points in the 3D color space and use them in order to initially reduce the colors. From this step an oversegmented image is produced which usually has no more than 100 colors. To extract the final result, a mean shift procedure starts from the calculated points and locates the final color clusters of the RGB color distribution. Also, to overcome noise problems, a proposed edge preserving smoothing filter is used to enhance the quality of the image. Experimental results showed the method’s capability of producing correctly segmented complex color documents while removing background noise or low contrast objects which is very desirable in text information extraction applications. Additionally, our method has the ability to cluster randomly shaped distributions.

References

  1. Y. Zhong, K. Karu, A.K. Jain, 1995. Locating text in complex color images. Pattern Recognition 28 (10), 1523-1535.
  2. W.Y. Chen and S.Y. Chen, 1998. Adaptive page segmentation for color technical journals' cover images. Image and Vision Computing 16, 855-877.
  3. K. Sobottka et al, 2000. Text Extraction from Colored Book and Journal Covers. International Journal on Document Analysis and Recognition, vol. 2, No. 4, pp. 163-176.
  4. H. Hase, T. Shinokawa, M. Yoneda, C.Y. Suen, 2001. Character string extraction from color documents. Pattern Recognition 34 (7), 1349-1365.
  5. C. Strouthopoulos, N. Papamarkos and A. Atsalakis, 2002. Text extraction in complex color documents. Pattern Recognition, Vol. 35, Issue 8, pp. 1743-1758.
  6. Hiroyuki Hase, Masaaki Yoneda, Shogo Tokai, Jien Kato and Ching Y. Suen, 2003. Color segmentation for text extraction. International Journal on Document Analysis and Recognition 6(4): 271-284.
  7. Bin Wang, Xiang-Feng Li, Feng Liu and Fu-Qiao Hu, 2005. Color text image binarization based on binary texture analysis. Pattern Recognition Letters, Volume 26, Issue 11, Pages 1650-1657.
  8. Roerdink, J.B.T.M., Meijster, A, 2000. The watershed transform: Definitions, algorithms and parallelization strategies. Fundamenta Informaticae 41, 187-228 P. Perona, J. Malik, 1990. Scale-Space and Edge Detection Using Anisotropic Diffusion. IEEE Trans. Pattern Analysis and Machine Intelligence, Vol. 12, 629-639.
  9. ?. Fukunaga and L.D. Hostetler, 1975. The Estimation of the Gradient of a Density Function, with Applications in Pattern Recognition. IEEE Trans. Information Theory, vol. 21, pp. 32-40.
  10. Y. Cheng, 1995. Mean Shift, Mode Seeking, and Clustering. IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 17, no. 8, pp. 790-799.
  11. D. Comaniciu and P. Meer, 2002. Mean Shift: A Robust Approach Toward Feature Space Analysis. IEEE Trans. Pattern Analysis and Machine Intelligence, vol.
  12. 24, no. 5, pp. 603-619.
Download


Paper Citation


in Harvard Style

Nikolaou N. and Papamarkos N. (2006). COLOR SEGMENTATION OF COMPLEX DOCUMENT IMAGES . In Proceedings of the First International Conference on Computer Vision Theory and Applications - Volume 1: VISAPP, ISBN 972-8865-40-6, pages 220-227. DOI: 10.5220/0001366202200227


in Bibtex Style

@conference{visapp06,
author={N. Nikolaou and N. Papamarkos},
title={COLOR SEGMENTATION OF COMPLEX DOCUMENT IMAGES},
booktitle={Proceedings of the First International Conference on Computer Vision Theory and Applications - Volume 1: VISAPP,},
year={2006},
pages={220-227},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001366202200227},
isbn={972-8865-40-6},
}


in EndNote Style

TY - CONF
JO - Proceedings of the First International Conference on Computer Vision Theory and Applications - Volume 1: VISAPP,
TI - COLOR SEGMENTATION OF COMPLEX DOCUMENT IMAGES
SN - 972-8865-40-6
AU - Nikolaou N.
AU - Papamarkos N.
PY - 2006
SP - 220
EP - 227
DO - 10.5220/0001366202200227