USING CLOUDS FOR SCIENCE, IS IT JUST KICKING THE CAN DOWN THE ROAD?

Ewa Deelman, Gideon Juve, G. Bruce Berriman

2012

Abstract

In this paper we describe issues related to the execution of scientific workflows on clouds, giving particular emphasis to the challenges faced by scientists when using grids and clouds for workflows. We also mention some existing solutions and identify areas requiring additional work.

References

  1. D. Kranzlmüller, J. M. Lucas, et al, "The European Grid Initiative (EGI)," Remote Instrumentation and Virtual Laboratories, pp. 61-66, 2010.
  2. E. Deelman, G. Singh, et al, "The Cost of Doing Science on the Cloud: The Montage Example," SC'08 Austin, TX, 2008.
  3. R. D. Stevens, A. J. Robinson, and C. A. Goble, "myGrid: personalised bioinformatics on the information grid," Bioinformatics vol. 19, 2003.
  4. S. Callaghan, P. Maechling, et al, "Reducing Time-toSolution Using Distributed High-Throughput MegaWorkflows - Experiences from SCEC CyberShake," eScience, Indianapolis, 2008.
  5. D. A. Brown, P. R. Brady, et al, "A Case Study on the Use of Workflow Technologies for Scientific Analysis: Gravitational Wave Data Analysis," in Workflows for e-Science, I. Taylor, et al, Eds., Springer, 2006.
  6. A. S. Bland, R. A. Kendall, et al, "Jaguar: The world's most powerful computer," Memory (TB), vol. 300, p. 362, 2009.
  7. A. Gara, M. A. Blumrich, et al, "Overview of the Blue Gene/L system architecture," IBM Journal of Research and Development, vol. 49, 2005.
  8. I. Foster, "Globus Toolkit Version 4: Software for Service-Oriented Systems," 2006.
  9. M. Litzkow, M. Livny, and M. Mutka, "Condor - A Hunter of Idle Workstations," in Proc. 8th Intl Conf. on Distributed Computing Systems, ed, 1988.
  10. K. Czajkowski, I. Foster, et al, "A Resource Management Architecture for Metacomputing Systems," in 4th Workshop on Job Scheduling Strategies for Parallel Processing, 1998, pp. 62-82.
  11. A. Bayucan, R. L. Henderson, et al, "Portable Batch System: External reference specification," ed, 1999.
  12. W. Allcock, J. Bester, et al, "Data Management and Transfer in High-Performance Computational Grid Environments," Parallel Computing, 2001.
  13. Amazon Elastic Compute Cloud. http://aws.amazon.com/ ec2/
  14. (2010). FutureGrid. http://www.futuregrid.org/
  15. G. B. Berriman, E. Deelman, et al, "Montage: A Grid Enabled Engine for Delivering Custom Science-Grade Mosaics On Demand," in SPIE Conference 5487: Astronomical Telescopes, 2004.
  16. R. W. G., Paul G. Somerville, et al, "Ground motion environment of the Los Angeles region," The Structural Design of Tall and Special Buildings, vol. 15, pp. 483-494, 2006.
  17. J. Dean and S. Ghemawat, "MapReduce: Simplified data processing on large clusters," Communications of the ACM, vol. 51, pp. 107-113, 2008.
  18. E. Deelman, J. Blythe, et al, "Pegasus : Mapping Scientific Workflows onto the Grid," in 2nd European Across Grids Conference, Cyprus, 2004.
  19. E. Deelman, G. Mehta, et al, "Pegasus: Mapping LargeScale Workflows to Distributed Resources," in Workflows in e-Science, I. Taylor, E. Deelman, D. Gannon, and M. Shields, Eds., ed: Springer, 2006.
  20. A. Ramakrishnan, G. Singh, et al, "Scheduling Data - Intensive Workflows onto Storage-Constrained Distributed Resources," in CCGrid 2007.
  21. G. Singh, K. Vahi, et al, "Optimizing Workflow Data Footprint " Scientific Programming Journal, Special issue on Dynamic Computational Workflows, vol. 15, 2007
  22. S. Miles, E. Deelman, et al, "Connecting Scientific Data to Scientific Experiments with Provenance " Third IEEE e-Science 2007, India. , 2007.
  23. S. Miles, P. Groth, et al, "Provenance: The bridge between experiments and data," Computing in Science & Engineering, vol. 10, pp. 38-46, 2008.
  24. P. Groth, E. Deelman, et al, "Pipeline-Centric Provenance Model, "The 4th Workshop on Workflows in Support of Large-Scale Science, Portland, OR, 2009.
  25. E. Deelman, "Grids and Clouds: Making Workflow Applications Work in Heterogeneous Distributed Environments," International Journal of High Performance Computing Applications, 2009.
  26. G. Juve and E. Deelman, "Automating Application Deployment in Infrastructure Clouds," CloudCom 2011,
  27. G. Juve and E. Deelman, "Wrangler: Virtual Cluster Provisioning for the Cloud (short paper), HPDC'11, 2011.
  28. E. Deelman, D. Gannon, et al, "Workflows and e-Science: An overview of workflow system features and capabilities," Future Generation Computer Systems, vol. 25, pp. 528-540, 2009.
  29. I. Krsul, A. Ganguly, et al, "Vmplants: Providing and managing virtual machine execution environments for grid computing," 2004, pp. 7-7.
  30. M. A. Murphy, B. Kagey, et al, "Dynamic provisioning of virtual organization clusters," 2009,
  31. J.-S. Vöckler, G. Juve, et al, "Experiences Using Cloud Computing for A Scientific Workflow Application," ScienceCloud, 2011.
  32. M. Burgess, "A site configuration engine," USENIX Computing Systems, vol. 8, 1995.
  33. L. Kanies, "Puppet: Next Generation Configuration Management," Login, vol. 31, 2006.
  34. C. Sapuntzakis, D. Brumley, et al, "Virtual Appliances for Deploying and Maintaining Software," USENIX 2003.
  35. K. Keahey, R. Figueiredo, et al, "Science clouds: Early experiences in cloud computing for scientific applications," Cloud Computing and Applications, 2008.
  36. J. Bresnahan, T. Freeman, et al, "Managing Appliance Launches in Infrastructure Clouds," Teragrid Conference, 2011.
Download


Paper Citation


in Harvard Style

Deelman E., Juve G. and Bruce Berriman G. (2012). USING CLOUDS FOR SCIENCE, IS IT JUST KICKING THE CAN DOWN THE ROAD? . In Proceedings of the 2nd International Conference on Cloud Computing and Services Science - Volume 1: CLOSER, ISBN 978-989-8565-05-1, pages 127-134. DOI: 10.5220/0003958901270134


in Bibtex Style

@conference{closer12,
author={Ewa Deelman and Gideon Juve and G. Bruce Berriman},
title={USING CLOUDS FOR SCIENCE, IS IT JUST KICKING THE CAN DOWN THE ROAD?},
booktitle={Proceedings of the 2nd International Conference on Cloud Computing and Services Science - Volume 1: CLOSER,},
year={2012},
pages={127-134},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003958901270134},
isbn={978-989-8565-05-1},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 2nd International Conference on Cloud Computing and Services Science - Volume 1: CLOSER,
TI - USING CLOUDS FOR SCIENCE, IS IT JUST KICKING THE CAN DOWN THE ROAD?
SN - 978-989-8565-05-1
AU - Deelman E.
AU - Juve G.
AU - Bruce Berriman G.
PY - 2012
SP - 127
EP - 134
DO - 10.5220/0003958901270134