Authors:
Panagiotis Antonellis
;
Stavros Kontopoulos
;
Christos Makris
;
Yannis Plegas
and
Nikos Tsirakis
Affiliation:
University of Patras, Greece
Keyword(s):
XML Filtering, Distributed Processing, Peer-to-Peer Networks, Bloom Filters, Semantic Filtering, Word Sense Disambiguation.
Related
Ontology
Subjects/Areas/Topics:
Distributed and Parallel Applications
;
Internet Technology
;
Web Information Systems and Technologies
;
XML and Data Management
Abstract:
Information filtering systems constitute a critical component in modern information seeking applications. As the number of users grows and the information available becomes even bigger it is imperative to employ scalable and efficient representation and filtering techniques. Typically the use of XML representation entails the profile representation with the use of the XPath query language and the employment of efficient heuristic techniques for constraining the complexity of the filtering mechanism. However, as the number of XML documents exchanged daily grows rapidly, the need for distributed management is becoming vital. In this paper we introduce the Distributed Bloom Filters and we propose a new distributed XML filtering system for peer-to-peer (P2P) networks. The major advantage of Distributed Bloom Filters, in comparison to the classical structure is their space efficiency and improved performance. The proposed system efficiently filters the incoming XML documents using a virtu
al index created on top of the network. In addition, the proposed system supports semantic disambiguation of both the stored user profiles and the XML documents, thus providing better matching results.
(More)