ICR DETECTION IN FILLED FORM & FORM REMOVAL

Abhishek Agarwal, Pramod Kumar, Sorabh Kumar

2006

Abstract

This paper presents methods to enhance accuracy rates of ICR detection in structured form processing. Forms are printed at different vendors using a variety of printers and at different settings. Every printer has its own scaling algorithm, so the final printed forms though visibly similar to naked eyes, contains considerable shift, expansion or shrinkage. This poses problems when data zones are close together as the template reference points refer to the neighbouring identical zones, impeding data extraction accuracy. Moreover, these transformational defects result in inaccurate form removal leaving behind line residues and noise that further deteriorates the extraction accuracy. Our proposed algorithm works on filled forms thereby eliminating the problem of difference between template and actual form. Template data can also be provided as an input to our algorithm to increase speed and accuracy. The algorithm has been tested on a variety of forms and the results have been very promising.

References

  1. Liu J., Ding X., Wu Y. 1995. Description and recognition of form and automated form data entry. In ICDAR'95, Third International Conference on Document Analysis and Recognition - Volume 2,pp. 579-582.
  2. Mathur, A., Gur, N.H., 1999. High Performance Form Analysis and Data Extraction. In ICDAR'99, Fifth International Conference on Document Analysis and Recognition.
  3. Pitas,I. and Venetsanopoulos, A. N. , 1990. Nonlinear DigitalFilters. Boston: Kluwer Academic, 1990.
  4. Chih-Hong, K., Hon-Son, D., 2005. Skew Detection of Document Images Using Line Structural Information, In ICITA'05, Third International Conference on Information Technology and Applications Volume 1.
  5. Shi, Z., Govindaraju, V., 2003. Skew Detection for Complex Document Images Using Fuzzy Runlength. In ICDAR'03, Seventh International Conference on Document Analysis and Recognition - Volume 2, 715- 719.
  6. Le D. X., Thoma G.R., Wechsler H.1996. Automated border detection and adaptive segmentation for binary document images. In Proceedings of ICPR 7896.
  7. Illingworth, J., Kittler J., H.1998. A survey of the Hough transform. CVGIP, Vol. 44.87-116, 1998.
  8. Rosito Jung C., Schramm R. 2004. Rectangle detection based on a windowed Hough transform. In SIBGRAPI'04, Computer Graphics and Image Processing, XVII Brazilian Symposium on , 113-120.
  9. Zheng Y., Li H., Doermann D. 2003. Background line detection with a stochastic model. In 2003 Conference on Computer Vision and Pattern Recognition Workshop - Volume 3.
  10. Gattani, A., Mukerji, M. and Gur, H., 2003. A Fast Multifunctional Approach for Document Image Analysis. In Proceedings of the Seventh ICDAR, 2003, 1178-1182.
  11. Yoo J., Kim M., Yong Han S. 1995. Line Removal and restoration of Handwritten Characters on the Form Documents. In ICDAR'97, Fourth International Conference Document Analysis and Recognition , 128-131.
  12. Dillencourt, M.B., Samet,H., and Tammininen,M., 1992. General approach to Connected-Component Labeling for Arbitrary Image Representations, In J.ACM Vol 39, No.2, 1992, pp. 253-280.
Download


Paper Citation


in Harvard Style

Agarwal A., Kumar P. and Kumar S. (2006). ICR DETECTION IN FILLED FORM & FORM REMOVAL . In Proceedings of the First International Conference on Computer Vision Theory and Applications - Volume 1: VISAPP, ISBN 972-8865-40-6, pages 271-276. DOI: 10.5220/0001371202710276


in Bibtex Style

@conference{visapp06,
author={Abhishek Agarwal and Pramod Kumar and Sorabh Kumar},
title={ICR DETECTION IN FILLED FORM & FORM REMOVAL},
booktitle={Proceedings of the First International Conference on Computer Vision Theory and Applications - Volume 1: VISAPP,},
year={2006},
pages={271-276},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001371202710276},
isbn={972-8865-40-6},
}


in EndNote Style

TY - CONF
JO - Proceedings of the First International Conference on Computer Vision Theory and Applications - Volume 1: VISAPP,
TI - ICR DETECTION IN FILLED FORM & FORM REMOVAL
SN - 972-8865-40-6
AU - Agarwal A.
AU - Kumar P.
AU - Kumar S.
PY - 2006
SP - 271
EP - 276
DO - 10.5220/0001371202710276