A SPATIAL QUERY LANGUAGE FOR PRESENTATION-ORIENTED DOCUMENTS

Ermelinda Oro, Francesco Riccetti, Massimo Ruffolo

Abstract

In last years the huge relevance of accessing and acquiring information made available byWeb (HTML) pages and business (PDF) documents has grown much further. In this paper we present a textual query language, named ViQueL, whose main feature is to identify and extract relevant information from HTML and PDF documents on the base of their visual appearance by using easy-to-write queries. The proposed language is founded on spatial grammars, i.e. context free grammars extended by spatial constructs. Despite a considerable expressive power, combined complexity of ViQueL is in P-Time. Moreover, experiments show that ViQueL is reasonably efficient for real-life extraction tasks.

References

  1. Adali, S., Sapino, M. L., and Subrahmanian, V. S. (2000). An algebra for creating and querying multimedia presentations. Multimedia Syst., 8(3):212-230.
  2. Kong, J., Zhang, K., and Zeng, X. (2006). Spatial graph grammars for graphical user interfaces. ACM Trans. Comput.-Hum. Interact., 13(2):268-307.
  3. Lee, T., Sheng, L., Bozkaya, T., Balkir, N. H., O zsoyoglu, Z. M., and O zsoyoglu, G. (1999). Querying multimedia presentations based on content. IEEE Trans. on Knowl. and Data Eng., 11(3):361-385.
  4. Navarrete, I. and Sciavicco, G. (2006). Spatial reasoning with rectangular cardinal direction relations. In ECAI, pages 1-9.
Download


Paper Citation


in Harvard Style

Oro E., Riccetti F. and Ruffolo M. (2011). A SPATIAL QUERY LANGUAGE FOR PRESENTATION-ORIENTED DOCUMENTS . In Proceedings of the 3rd International Conference on Agents and Artificial Intelligence - Volume 1: ICAART, ISBN 978-989-8425-40-9, pages 306-312. DOI: 10.5220/0003177603060312


in Bibtex Style

@conference{icaart11,
author={Ermelinda Oro and Francesco Riccetti and Massimo Ruffolo},
title={A SPATIAL QUERY LANGUAGE FOR PRESENTATION-ORIENTED DOCUMENTS},
booktitle={Proceedings of the 3rd International Conference on Agents and Artificial Intelligence - Volume 1: ICAART,},
year={2011},
pages={306-312},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003177603060312},
isbn={978-989-8425-40-9},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 3rd International Conference on Agents and Artificial Intelligence - Volume 1: ICAART,
TI - A SPATIAL QUERY LANGUAGE FOR PRESENTATION-ORIENTED DOCUMENTS
SN - 978-989-8425-40-9
AU - Oro E.
AU - Riccetti F.
AU - Ruffolo M.
PY - 2011
SP - 306
EP - 312
DO - 10.5220/0003177603060312