Towards Efficient Reorganisation Algorithms of Hybrid Index Structures Supporting Multimedia Search Conditions

Carsten Kropf

2014

Abstract

This paper presents the optimization of the reorganisation algorithms of hybrid index structures supporting multimedia search conditions. Multimedia in this case refers to, on the one hand, the support of high dimensional feature spaces and, on the other, the mix of data of multiple types. We will use an approach which may typically be found in geographic information retrieval (GIR) systems combined of two-dimensional geographical points in combination with textual data. Yet, the dimensions of the points may be arbitrarily set. Currently, most of these access methods implemented for the use in database centric application domains are validated regarding their retrieval efficiency in simulation based environments. Most of the structures and experiments only use synthetic validation in an artificial setup. Additionally, the focus of these tests is to validate the retrieval efficiency. We implemented such an indexing method in a realistic database management system and noticed an unacceptable runtime behaviour of reorganisation algorithms. Hence, a structured and iterative optimization procedure is set up to make hybrid index structures suitable for the use in real world application scenarios. The final outcome is a set of algorithms providing efficient approaches for reorganisations of access methods for hybrid data spaces.

References

  1. Ang, C.-H. and Tan, T. C. (1997). New linear node splitting algorithm for r-trees. In SSD 7897: Proceedings of the 5th International Symposium on Advances in Spatial Databases, pages 339-349, London, UK. SpringerVerlag.
  2. Beckmann, N., Kriegel, H.-P., Schneider, R., and Seeger, B. (1990). The r*-tree: An efficient and robust access method for points and rectangles. SIGMOD Rec., 19(2):322-331.
  3. Bentley, J. L. (1975). Multidimensional binary search trees used for associative searching. Commun. ACM, 18(9):509-517.
  4. Chen, L., Cong, G., Jensen, C. S., and Wu, D. (2013). Spatial keyword query processing: an experimental evaluation. In Proceedings of the 39th international conference on Very Large Data Bases, PVLDB'13, pages 217-228. VLDB Endowment.
  5. Felipe, I. D., Hristidis, V., and Rishe, N. (2008). Keyword search on spatial databases. International Conference on Data Engineering, 0:656-665.
  6. Guttman, A. (1984). R-trees. a dynamic index structure for spatial searching. In SIGMOD 7884: Proceedings of the 1984 ACM SIGMOD international conference on Management of data, pages 47-57, New York, NY, USA. ACM.
  7. Göbel, R. and de la Cruz, A. (2007). Computer science challenges for retrieving security related information from the internet. Global Monitoring for Security and Stability (GMOSS), -:90 - 101.
  8. Göbel, R., Henrich, A., Niemann, R., and Blank, D. (2009). A hybrid index structure for geo-textual searches. In Proceeding of the 18th ACM conference on Information and knowledge management, CIKM 7809, pages 1625-1628, New York, NY, USA. ACM.
  9. Göbel, R. and Kropf, C. (2010). Towards hybrid index structures for multi-media search criteria. In DMS, pages 143-148. Knowledge Systems Institute.
  10. Kropf, C., Ahmmed, S., Göbel, R., and Niemann, R. (2011). A geo-textual search engine approach assisting disaster recovery, crisis management and early warning systems. In Geo-information for Disaster management (Gi4DM).
  11. O'Neil, E. J., O'Neil, P. E., and Weikum, G. (1993). The lru-k page replacement algorithm for database disk buffering. In Proceedings of the 1993 ACM SIGMOD International Conference on Management of Data, SIGMOD 7893, pages 297-306, New York, NY, USA.
  12. Porter, M. F. (1997). Readings in information retrieval. chapter An algorithm for suffix stripping, pages 313- 316. Morgan Kaufmann Publishers Inc., San Francisco, CA, USA.
  13. Rocha-Junior, J. a. B. and Nørva°g, K. (2012). Top-k spatial keyword queries on road networks. In Proceedings of the 15th International Conference on Extending Database Technology, EDBT 7812, pages 168-179, New York, NY, USA. ACM.
  14. Rocha-Junior, J. B., Gkorgkas, O., Jonassen, S., and Nørva°g, K. (2011). Efficient processing of topk spatial keyword queries. In Proceedings of the International Symposium on Spatial and Temporal Databases (SSTD), volume 6849 of LNCS, pages 205- 222. Springer.
  15. Vaid, S., Jones, C. B., Joho, H., and Sanderson, M. (2005). Spatio-textual indexing for geographical search on the web. In 9th International Symposium on Spatial and Temporal Databases SSTD 2005, volume 3633 of Lecture Notes in Computer Science, pages 218-235.
  16. Wu, D., Yiu, M. L., Cong, G., and Jensen, C. S. (2012). Joint top-k spatial keyword query processing. Knowledge and Data Engineering, IEEE Transactions on, 24(10):1889 -1903.
  17. Zhang, D., Chee, Y. M., Mondal, A., Tung, A. K. H., and Kitsuregawa, M. (2009). Keyword search in spatial databases: Towards searching by document. Data Engineering, International Conference on, 0:688-699.
  18. Zhou, Y., Xie, X., Wang, C., Gong, Y., and Ma, W.-Y. (2005). Hybrid index structures for location-based web search. In CIKM 7805: Proceedings of the 14th ACM international conference on Information and knowledge management, pages 155-162, New York, NY, USA. ACM.
  19. Zipf, G. K. (1949). Human Behaviour and the Principle of Least Effort: an Introduction to Human Ecology. Addison-Wesley.
Download


Paper Citation


in Harvard Style

Kropf C. (2014). Towards Efficient Reorganisation Algorithms of Hybrid Index Structures Supporting Multimedia Search Conditions . In Proceedings of 3rd International Conference on Data Management Technologies and Applications - Volume 1: DATA, ISBN 978-989-758-035-2, pages 231-242. DOI: 10.5220/0004996302310242


in Bibtex Style

@conference{data14,
author={Carsten Kropf},
title={Towards Efficient Reorganisation Algorithms of Hybrid Index Structures Supporting Multimedia Search Conditions},
booktitle={Proceedings of 3rd International Conference on Data Management Technologies and Applications - Volume 1: DATA,},
year={2014},
pages={231-242},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004996302310242},
isbn={978-989-758-035-2},
}


in EndNote Style

TY - CONF
JO - Proceedings of 3rd International Conference on Data Management Technologies and Applications - Volume 1: DATA,
TI - Towards Efficient Reorganisation Algorithms of Hybrid Index Structures Supporting Multimedia Search Conditions
SN - 978-989-758-035-2
AU - Kropf C.
PY - 2014
SP - 231
EP - 242
DO - 10.5220/0004996302310242