
 
using the Ant Colony Optimization heuristics 
(ACO). We evaluated the quality and complexities 
of our location method. As ACO consists of ants or 
agents that explore the solution space and cyclically 
improve the results, we found the best number of 
agents that produce the most efficient exploration. 
Each round consists of two phases, traversal and 
prizing. Next, we devised a prizing mechanism that 
accelerates convergence. For the instances of GE 
here addressed, we were able to propose a model 
that predicts the total number of rounds as a linear 
function of each of the parameters under study.  
Some important issues remain as directions for 
further work. For the sake of simplicity we assumed 
that each document in the CSW is labelled with a 
single index. What should we do with multi-labelled  
or highly nested contents? How should we deal with 
the CSW growing? Preliminary work  suggest that, 
multi-labelled documents can fit well in our look-up 
table, by means of linking mechanisms 
subordinating all the indices of a document to a 
couple of main concepts that define the actual 
location of the corresponding file. As for the CSW 
dynamics, we consider that storage capacities must 
be kept in order to foresee a middle-term growing. 
In the long term, it might be the case that the whole 
CSW partitioning, i.e. its granularity, should be 
redefined and a new allocation procedure might be 
invoked. It is also possible that whenever a small 
subset of related concepts shows a rapid growing on 
the size of their documents, the entire collection 
might migrate to a new store node.  
Distributed storage is driving many R&D efforts. 
From the users’ point of view, it may turn into the 
basic mechanism able to unleash the potential 
benefits of knowledge management. Health sciences, 
agriculture, geomatics, are only a few examples of 
the many domains that may dramatically improve 
their operations with the adoption of this new trend.  
REFERENCES 
Berners-Lee, T., Hendler, J., Lassila, O., 2001. The 
semantic web. 
Scientific American. 
Cai, M., Frank, M., 2004. RDFPeers: A Scalable 
Distributed RDF Repository Based on a Structured 
Peer-to-Peer Network. 
In Proceedings of the 13th 
international conference on the World Wide Web, New 
York, USA
. 
Cidon, I. 1988. Yet Another Distributed Depth-First-
Search Algorithm. 
Inf. Process. Lett. 26(6) 
Crespo, A. Garcia-Molina, H. 2002. Semantic Overlay 
Networks for P2P Systems. 
Technical report, Stanford 
University. 
Dorigo, M. 1992. Optimization, Learning and Natural 
Algorithms. 
Ph.D. Thesis, Dept. of Electronics, 
Politecnico di Milano. Italy.
  
Gandon, Fabien. 2002. ONTOLOGY ENGINEERING: A 
SURVEY AND A RETURN ON EXPERIENCE, 
Report of research INRIA, team ACACIA. 
Gutjahr W. 1999. A generalized convergence result for the 
graph-based ant system metaheuristic. 
Technical 
Report 99-09, University of Vienna
. 
Halevy A. Y., Ives Z. G., Mork P., and Tatarinov I. 2003. 
Piazza: Data management infrastructure for semantic 
web applications. 
In Proceedings of the Twelfth 
International World Wide Web Conference 
(WWW’2003), Budapest, Hungary. 
Kjetil, N., Christos, D., Michalis, V. 2006. The SOWES 
Approach to P2P Web Search Using Semantic 
Overlays. 
WWW '06: Proceedings of the 15th 
international conference on World Wide Web.
  
Montresor, A. 2001. Anthill: a Framework for the Design 
and the Analysis of Peer-to-Peer Systems. 
4th 
European Research Seminar on Advances in 
Distributed Systems
. 
Nejdl, W. Wolf,B. Qu, C. Decker, S. Sintek, M. Naeve, A. 
Nilsson, M. Palmer, M. Risch,  T. 2002. EDUTELLA: 
a P2P Networking Infrastructure based on RDF. 
In 
Proceedings of the 11th International World Wide 
Web Conference, Hawaii, USA
. 
Tang, C., Z. Xu, and M. Mahalingam. 2002. PeerSearch: 
Efficient Information retrieval in Peer-Peer Networks. 
Hewlett-Packard Labs: Palo Alto. 
Risson, J, Moors, T. 2006. “Survey of research towards 
robust peer-to-peer networks: search methods”, 
Computer Networks: The International Journal of 
Computer and Telecommunications Networking, 
Volume 50 ,  Issue 17, pp 3485-3521
. 
Rodrigues R. and Liskov B. 2005. High Availability in 
DHT’s: Erasure Coding vs Replication. IPTPS. 226-
239. 
Savage J. E. & Wloka M. G.. 1991. MOB a parallel 
heuristic for graph embedding, 
5th SIAM Conference 
on Parallel Processing for Scientific Computing
. 
Segall A. 1983. Distributed network protocols. 
IEEE 
Transaction on Information Theory IT-29(1):23. 
Wolf-Tilo Balke, W Nejdl, W Siberski, and U Thaden. 
2005. DL meets P2P - Distributed Document Retrieval 
based on Classification and Content. 
European 
Conference on Digital Libraries (ECDL), Vienna, 
Austria
.  
DISTRIBUTED ALLOCATION OF A CORPORATE SEMANTIC WEB
179