Understanding Query Interfaces: Automatic Extraction of Data from Domain-specific Deep Web based on Ontology

Li Dong, Zhang Huan, Yu Zitong

Abstract

The resources of many Web-accessible databases, which are a very large portion of the structured data on the Web, are only available through query interfaces but are invisible to the traditional search engines. Many methods, which discovery these resources automatically, rely on the different structures of Web pages and various designing modes of databases. However, some semantic meanings and relations are ignored. Here we introduce a Web information retrieval system that obtains the knowledge from multiple databases automatically by using common ontology WordNet. Also, deep Web query results are post-processed based on domain ontology. That is, given an integrated interface, after inputting a query, our system offers an ordered list of data records to users. We have conducted an extensive experimental evaluation of the Web information retrieval system over real documents. Also, we test our system with hundreds of databases on different topics. Experiments show that our system has low cost and achieves high discovering accuracy across multiple databases.

Download


Paper Citation


in Harvard Style

Dong L., Huan Z. and Zitong Y. (2020). Understanding Query Interfaces: Automatic Extraction of Data from Domain-specific Deep Web based on Ontology.In Proceedings of the 22nd International Conference on Enterprise Information Systems - Volume 1: ICEIS, ISBN 978-989-758-423-7, pages 241-248. DOI: 10.5220/0009514202410248


in Bibtex Style

@conference{iceis20,
author={Li Dong and Zhang Huan and Yu Zitong},
title={Understanding Query Interfaces: Automatic Extraction of Data from Domain-specific Deep Web based on Ontology},
booktitle={Proceedings of the 22nd International Conference on Enterprise Information Systems - Volume 1: ICEIS,},
year={2020},
pages={241-248},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0009514202410248},
isbn={978-989-758-423-7},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 22nd International Conference on Enterprise Information Systems - Volume 1: ICEIS,
TI - Understanding Query Interfaces: Automatic Extraction of Data from Domain-specific Deep Web based on Ontology
SN - 978-989-758-423-7
AU - Dong L.
AU - Huan Z.
AU - Zitong Y.
PY - 2020
SP - 241
EP - 248
DO - 10.5220/0009514202410248