ARCO: A LONG-TERM DIGITAL LIBRARY STORAGE SYSTEM BASED ON GRID COMPUTATIONAL INFRASTRUCTURE

Han Fei, Paulo Trezentos, Nuno Almeida, Miguel Lourenço, José Borbinha, João Neves

2005

Abstract

Over the past several years the large scale digital library service has undergone enormous popularity. Arco project is a digital library storage project in Portuguese National library. To a digital library storage system like ARCO system, there are several challenges, such as the availability of peta-scale storage, seamless spanning of storage cluster, administration and utilization of distributed storage and computing resources, safety and stability of data transfer, scalability of the whole system, automatic discovery and monitoring of metadata, etc. Grid computing appears as an effective technology coupling geographically distributed resources for solving large scale problems in the wide area or local area network. The ARCO system has been developed on the Grid computational infrastructure, and on the basis of various other toolkits, such as PostgreSQL, LDAP, and the Apache HTTP server. Main developing languages are C, PHP, and Perl. In this paper, we discuss the logical structure sketch of the digital library ARCO system, resources organization, metadata discovering and usage, the system's operation details and some operations examples, as also the solution of large file transfer problem in Globus grid toolkit

References

  1. IBM Redbook: Globus Toolkit 3.0 Quick Start Guide http://www.redbooks.ibm.com/redpapers/pdfs/redp369 7.pdf
  2. The Globus Project: MDS 2.1 User's Guide http://www.globus.org/mds/mdsuserguide.pdf
  3. Ian Foster and Carl kesselman, The Grid: Blueprint for a New Computing Infrastructure. Morgan Kaufmann, 1999
  4. Ian Foster, Carl kesselman, Jeffrey M. Nick, and Steven Tuecke, Grid Services for Distributed System Integration, Computer, 35(6), 2002
  5. Global Grid Forum Documents and Recommendations: Process and Requirements, GFD-C.1, C.Catlett
  6. Ian Foster, Carl Kesselman, Jeffrey M. Nick, Steven Tuecke, “The Physiology of the Grid - An Open Services Architecture for Distributed Systems Integration”, Draft document, version: 6/22/2002
  7. S. Tuecke, K. K. Czajkowski, I. Foster, J. Frey, S. Graham, C. Kesselman, T. Maquire, T. Sandholm, D. Snelling, P. Vanderbilt, “Open Grid Services Infrastructure (OGSI) ”, Global Grid Forum, Draft document, version 1.0, 5/4/2003
  8. António Serra, Paulo Trezentos, Carlos Serrão, Miguel Dias, “Parallel Jpeg2000 Enconding On A Beowulf Cluster”
  9. Han Fei, Paulo Trezentos, Nuno Almeida et Al, “Enabling Queries Using Grid-brick Approach - A Distributed Data Storage Architecture”, (2002)
Download


Paper Citation


in Harvard Style

Fei H., Trezentos P., Almeida N., Lourenço M., Borbinha J. and Neves J. (2005). ARCO: A LONG-TERM DIGITAL LIBRARY STORAGE SYSTEM BASED ON GRID COMPUTATIONAL INFRASTRUCTURE . In Proceedings of the Seventh International Conference on Enterprise Information Systems - Volume 1: ICEIS, ISBN 972-8865-19-8, pages 44-51. DOI: 10.5220/0002534000440051


in Bibtex Style

@conference{iceis05,
author={Han Fei and Paulo Trezentos and Nuno Almeida and Miguel Lourenço and José Borbinha and João Neves},
title={ARCO: A LONG-TERM DIGITAL LIBRARY STORAGE SYSTEM BASED ON GRID COMPUTATIONAL INFRASTRUCTURE},
booktitle={Proceedings of the Seventh International Conference on Enterprise Information Systems - Volume 1: ICEIS,},
year={2005},
pages={44-51},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002534000440051},
isbn={972-8865-19-8},
}


in EndNote Style

TY - CONF
JO - Proceedings of the Seventh International Conference on Enterprise Information Systems - Volume 1: ICEIS,
TI - ARCO: A LONG-TERM DIGITAL LIBRARY STORAGE SYSTEM BASED ON GRID COMPUTATIONAL INFRASTRUCTURE
SN - 972-8865-19-8
AU - Fei H.
AU - Trezentos P.
AU - Almeida N.
AU - Lourenço M.
AU - Borbinha J.
AU - Neves J.
PY - 2005
SP - 44
EP - 51
DO - 10.5220/0002534000440051