KEYMANTIC: A KEYWORD-BASED SEARCH ENGINE USING STRUCTURAL KNOWLEDGE

Francesco Guerra, Sonia Bergamaschi, Mirko Orsini, Antonio Sala, Claudio Sartori

2009

Abstract

Traditional techniques for query formulation need the knowledge of the database contents, i.e. which data are stored in the data source and how they are represented. In this paper, we discuss the development of a keyword-based search engine for structured data sources. The idea is to couple the ease of use and flexibility of keyword-based search with metadata extracted from data schemata and extensional knowledge which constitute a semantic network of knowledge. Translating keywords into SQL statements, we will develop a search engine that is effective, semantic-based, and applicable also when instance are not continuously available, such as in integrated data sources or in data sources extracted from the deep web.

References

  1. Aditya, B., Bhalotia, G., Chakrabarti, S., Hulgeri, A., Nakhe, C., Parag, and Sudarshan, S. (2002). Banks: Browsing and keyword searching in relational databases. In VLDB 2002, Proceedings of 28th International Conference on Very Large Data Bases, August 20-23, 2002, Hong Kong, China, pages 1083- 1086. Morgan Kaufmann.
  2. Agrawal, S., Chaudhuri, S., and Das, G. (2002). Dbxplorer: A system for keyword-based search over relational databases. In ICDE, pages 5-16. IEEE Computer Society.
  3. Beneventano, D., Bergamaschi, S., Guerra, F., and Vincini, M. (2001). The momis approach to information integration. In ICEIS (1), pages 194-198.
  4. Bergamaschi, S., Castano, S., Vincini, M., and Beneventano, D. (2001). Semantic integration of heterogeneous information sources. Data Knowl. Eng., 36(3):215-249.
  5. Bergamaschi, S., Sartori, C., Guerra, F., and Orsini, M. (2007). Extracting relevant attribute values for improved search. IEEE Internet Computing, 11(5):26- 35.
  6. Chan, C. Y., Ooi, B. C., and Zhou, A., editors (2007). Proceedings of the ACM SIGMOD International Conference on Management of Data, Beijing, China, June 12-14, 2007. ACM.
  7. Doan, A. and Halevy, A. Y. (2005). Semantic Integration Research in the Database Community: A Brief Survey. AI Magazine, 26(1):83-94.
  8. Giunchiglia, F., Yatskevich, M., and Shvaiko, P. (2007). Semantic matching: Algorithms and implementation. J. Data Semantics, 9:1-38.
  9. Guha, R. V., McCool, R., and Miller, E. (2003). Semantic search. In WWW, pages 700-709.
  10. Hristidis, V. and Papakonstantinou, Y. (2002). Discover: Keyword search in relational databases. In VLDB 2002, Proceedings of 28th International Conference on Very Large Data Bases, August 20-23, 2002, Hong Kong, China, pages 670-681. Morgan Kaufmann.
  11. Katifori, A., Halatsis, C., Lepouras, G., Vassilakis, C., and Giannopoulou, E. G. (2007). Ontology visualization methods - a survey. ACM Comput. Surv., 39(4).
  12. Lenzerini, M. (2002). Data integration: A theoretical perspective. In Popa, L., editor, PODS, pages 233-246. ACM.
  13. Li, X., Meng, W., and Meng, X. (2007). Easyquerier: A keyword based interface for web database integration system. In Ramamohanarao, K., Krishna, P. R., Mohania, M. K., and Nantajeewarawat, E., editors, DASFAA, volume 4443 of Lecture Notes in Computer Science, pages 936-942. Springer.
  14. Liu, F., Yu, C. T., Meng, W., and Chowdhury, A. (2006). Effective keyword search in relational databases. In Chaudhuri, S., Hristidis, V., and Polyzotis, N., editors, SIGMOD Conference, pages 563-574. ACM.
  15. Madhavan, J., Cohen, S., Dong, X. L., Halevy, A. Y., Jeffery, S. R., Ko, D., and Yu, C. (2007). Web-scale data integration: You can afford to pay as you go. In CIDR, pages 342-350. www.crdrdb.org.
  16. Madhavan, J., Halevy, A. Y., Cohen, S., Dong, X. L., Jeffery, S. R., Ko, D., and Yu, C. (2006). Structured data meets the web: A few observations. IEEE Data Eng. Bull., 29(4):19-26.
  17. Naumann, F., Bilke, A., Bleiholder, J., and Weis, M. (2006). Data fusion in three steps: Resolving schema, tuple, and value inconsistencies. IEEE Data Eng. Bull., 29(2):21-31.
  18. Navarro, G. (2001). A guided tour to approximate string matching. ACM Comput. Surv., 33(1):31-88.
  19. Sattler, K.-U., Geist, I., and Schallehn, E. (2005). Conceptbased querying in mediator systems. VLDB J., 14(1):97-111.
  20. Sayyadian, M., LeKhac, H., Doan, A., and Gravano, L. (2007). Efficient keyword search across heterogeneous relational databases. In ICDE, pages 346-355. IEEE.
  21. Simitsis, A., Koutrika, G., and Ioannidis, Y. E. (2008). Précis: from unstructured keywords as queries to structured databases as answers. VLDB J., 17(1):117- 149.
  22. Weikum, G. (2007). Db&ir: both sides now. In (Chan et al., 2007), pages 25-30.
  23. Wright, A. (2008). Searching the deep web. Commun. ACM, 51(10):14-15.
  24. Yu, B., Li, G., Sollins, K. R., and Tung, A. K. H. (2007). Effective keyword-based selection of relational databases. In (Chan et al., 2007), pages 139- 150.
  25. Zloof, M. M. (1975). Query-by-example: the invocation and definition of tables and forms. In Kerr, D. S., editor, VLDB, pages 1-24. ACM.
Download


Paper Citation


in Harvard Style

Guerra F., Bergamaschi S., Orsini M., Sala A. and Sartori C. (2009). KEYMANTIC: A KEYWORD-BASED SEARCH ENGINE USING STRUCTURAL KNOWLEDGE . In Proceedings of the 11th International Conference on Enterprise Information Systems - Volume 1: ICEIS, ISBN 978-989-8111-84-5, pages 241-246. DOI: 10.5220/0002155802410246


in Bibtex Style

@conference{iceis09,
author={Francesco Guerra and Sonia Bergamaschi and Mirko Orsini and Antonio Sala and Claudio Sartori},
title={KEYMANTIC: A KEYWORD-BASED SEARCH ENGINE USING STRUCTURAL KNOWLEDGE},
booktitle={Proceedings of the 11th International Conference on Enterprise Information Systems - Volume 1: ICEIS,},
year={2009},
pages={241-246},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002155802410246},
isbn={978-989-8111-84-5},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 11th International Conference on Enterprise Information Systems - Volume 1: ICEIS,
TI - KEYMANTIC: A KEYWORD-BASED SEARCH ENGINE USING STRUCTURAL KNOWLEDGE
SN - 978-989-8111-84-5
AU - Guerra F.
AU - Bergamaschi S.
AU - Orsini M.
AU - Sala A.
AU - Sartori C.
PY - 2009
SP - 241
EP - 246
DO - 10.5220/0002155802410246