Actuation-based Shape Representation Applied to Engineering Document Analysis

Thomas C. Henderson, Narong Boonsiribunsum, Anshul Joshi

2016

Abstract

We propose that human generated drawings (including text and graphics) can be represented in terms of actuation processes required to produce them in addition to the visual or geometric properties. The basic theoretical tool is the wreath product introduced by Leyton (Leyton, 2001) (a special form of the semi-direct product from group theory which expresses the action of a control group on a fiber group) which can be used to describe the basic strokes used to form characters and other elements of the drawing. This captures both the geometry (points in the plane) of a shape as well as a generative model (actuation sequences on a kinematic structure). We show that this representation offers several advantages with respect to robust and effective semantic analysis of CAD drawings in terms of classification rates. Document analysis methods have been studied for several decades and much progress has been made; see (Henderson, 2014) for an overview. However, there are many classes of document images which still pose serious problems for effective semantic analysis. Of particular interest here are CAD drawings, and more specifically sets of scanned drawings for which either the electronic CAD no longer exists, or which were produced by hand. We demonstrate results on a set of CAD-generated drawings for automotive parts.

References

  1. Harley, A., Ufkes, A., and Derpanis, K. G. (2015). Evaluation of Deep Convolutional Nets for Document Image Classification and Retrieval. In International Conference on Document Analysis and Recognition (ICDAR), Nancy, France.
  2. Henderson, T. (2014). Analysis of Engineering Drawings and Raster Map Images. Springer Verlag, New York, NY.
  3. Henderson, T. C., Boonsirisumpun, N., and Joshi, A. (2015). Actuation in Perception: Character Classification in Engineering Drawings. In Proceedings IEEE Conference on Multisensor Fusion and Integration for Intelligent Systems, San Diego. IEEE.
  4. Joshi, A., Henderson, T., and Wang, W. (2014). Robot Cognition using Bayesian Symmetry Networks. In Proceedings of the Internaitonal Conference on Agents and Artificial Intelligence, Angers, France. IEEE.
  5. Kang, L., Ye, P., Li, Y., and D.Doermann (2014). A Deep Learning Approach to Document Image Quality Assessment. In Proceedings of the IEEE International Conference on Image Processing, pages 2570-2574, Paris, France. IEEE.
  6. Leyton, M. (2001). Springer, Berlin.
  7. Marinai, S. (2008). Introduction to Document Analysis and Recognition. In Studies in Computational Intelligence, pages 1-20, Berlin, Germany. Springer.
  8. Noƫ, A. (2004). Action in Perception. MIT Press, Cambrideg, MA.
  9. Plamondon, R. (1995a). A Kinematic Theory of Rapid Human Movements. I: Movement Representation and Generation. Bio. Cybernetics, 72(4):295-307.
  10. Plamondon, R. (1995b). A Kinematic Theory of Rapid Human Movements. II: Movement Time and Control. Bio. Cybernetics, 72(4):309-320.
  11. Plamondon, R. (1998). A Kinematic Theory of Rapid Human Movements. III: Kinematic Outcomes. Bio. Cybernetics, 78(2):133-145.
  12. Plamondon, R., O'Reilly, C., Galbslly, J., Almaksour, A., and Anquetil, E. (2014). Recent Developments in the Study of Rapid Human Movements with the Kinematic Theory: Applications to Handwriting and Signature Synthesis. Pattern Recognition Letters, 35(1):225-235.
  13. Viana, M. (2008). Symmetry Studies. Cambridge University Press, Cambridge, UK.
  14. Weyl, H. (1952). Symmetry. Princeton University Press, Princeton, NJ.
Download


Paper Citation


in EndNote Style

TY - CONF
JO - Proceedings of the 8th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART,
TI - Actuation-based Shape Representation Applied to Engineering Document Analysis
SN - 978-989-758-172-4
AU - Henderson T.
AU - Boonsiribunsum N.
AU - Joshi A.
PY - 2016
SP - 500
EP - 505
DO - 10.5220/0005818805000505


in Harvard Style

Henderson T., Boonsiribunsum N. and Joshi A. (2016). Actuation-based Shape Representation Applied to Engineering Document Analysis . In Proceedings of the 8th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART, ISBN 978-989-758-172-4, pages 500-505. DOI: 10.5220/0005818805000505


in Bibtex Style

@conference{icaart16,
author={Thomas C. Henderson and Narong Boonsiribunsum and Anshul Joshi},
title={Actuation-based Shape Representation Applied to Engineering Document Analysis},
booktitle={Proceedings of the 8th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART,},
year={2016},
pages={500-505},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005818805000505},
isbn={978-989-758-172-4},
}