TOWARDS THE EVOLUTION OF LEGACY APPLICATIONS TO MULTICORE SYSTEMS - Experiences Parallelizing R

Gonzalo Vera, Remo Suppi

2010

Abstract

Current innovations in processor performance, focused to keep the growth rate of the last years, are mainly based on providing several processing units within the same chip. With new underlying multicore processors, traditional sequential applications have to be adapted with parallel programming techniques to take advantage of the new processing capabilities. There exists a great variety of libraries, middlewares, and frameworks to assist the parallelization of such applications. However, in many cases, specially with classical scientific applications, due to several limitations ranging from technical incompatibilities to simply lack of knowledge, this evolution cannot always be achieved. We here present our experiences providing an alternative for two situations where former contributions could not provide a satisfactory solution to our needs: adapting a mature non-thread-safe C coded application, the R language interpreter, and providing support for the automatic parallelization of R scripts in multicore systems.

References

  1. Alberts, R., Vera, G., and Jansen, R. C. (2008). affyGG: computational protocols for genetical genomics with Affymetrix arrays. Bioinformatics, 24(3):433-434.
  2. Bridges, M. J., Vachharajani, N., Zhang, Y., Jablin, T., and August, D. I. (2008). Revisiting the sequential programming model for the multicore era. IEEE Micro, 28(1):12-20.
  3. Burns, G., Daoud, R., and Vaigl, J. (1994). LAM: An open cluster environment for MPI. In Proceedings of Supercomputing Symposium, pages 379-386.
  4. Bystrykh, L., Weersing, E., Dontje, B., Sutton, S., Pletcher, M. T., Wiltshire, T., Su, A. I., Vellenga, E., Wang, J., Manly, K. F., Lu, L., Chesler, E. J., Alberts, R., Jansen, R. C., Williams, R. W., Cooke, M. P., and de Haan, G. (2005). Uncovering regulatory pathways that affect hematopoietic stem cell function using 'genetical genomics'. Nature Genetics, 37(3):225-232.
  5. Dagum, L. and Menon, R. (1998). OpenMP: An industrystandard API for shared-memory programming. IEEE Computing in Science and Engineering, 5(1):46-55.
  6. Gabriel, E., Fagg, G. E., Bosilca, G., Angskun, T., Dongarra, J. J., Squyres, J. M., Sahay, V., Kambadur, P., Barrett, B., Lumsdaine, A., Castain, R. H., Daniel, D. J., Graham, R. L., and Woodall, T. S. (2004). Open MPI: Goals, concept, and design of a next generation MPI implementation. In In Proceedings, 11th European PVM/MPI Users Group Meeting, pages 97-104.
  7. Ihaka, R. and Gentleman, R. (1996). R: A language for data analysis and graphics. Journal of Computational and Graphical Statistics, 5(3):299-314.
  8. GNU Software Foundation, Inc. (2007). GNU general public licence. http://www.gnu.org/licenses/gpl.html.
  9. MPI Forum (1993). MPI: A Message Passing Interface. In Proc. of Supercomputing 93, pages 878-883.
  10. The Perl Foundation (2002). Perl 5.8.0 release announcement. http://dev.perl.org/perl5/news/2002/07/18/ 580ann/.
  11. Sutter, H. and Larus, J. (2005). Software and the Concurrency Revolution. ACM Queue, 3(7):54-62.
  12. Vera, G., Jansen, R., and Suppi, R. (2008). R/parallel - speeding up bioinformatics analysis with R. BMC Bioinformatics, 9(1):390.
  13. Yu, H. (2009). Rmpi: Interface (wrapper) to MPI (message-passing interface). http://www.stats.uwo.ca /faculty/yu/Rmpi.
Download


Paper Citation


in Harvard Style

Vera G. and Suppi R. (2010). TOWARDS THE EVOLUTION OF LEGACY APPLICATIONS TO MULTICORE SYSTEMS - Experiences Parallelizing R . In Proceedings of the First International Conference on Bioinformatics - Volume 1: BIOINFORMATICS, (BIOSTEC 2010) ISBN 978-989-674-019-1, pages 250-256. DOI: 10.5220/0002746302500256


in Bibtex Style

@conference{bioinformatics10,
author={Gonzalo Vera and Remo Suppi},
title={TOWARDS THE EVOLUTION OF LEGACY APPLICATIONS TO MULTICORE SYSTEMS - Experiences Parallelizing R},
booktitle={Proceedings of the First International Conference on Bioinformatics - Volume 1: BIOINFORMATICS, (BIOSTEC 2010)},
year={2010},
pages={250-256},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002746302500256},
isbn={978-989-674-019-1},
}


in EndNote Style

TY - CONF
JO - Proceedings of the First International Conference on Bioinformatics - Volume 1: BIOINFORMATICS, (BIOSTEC 2010)
TI - TOWARDS THE EVOLUTION OF LEGACY APPLICATIONS TO MULTICORE SYSTEMS - Experiences Parallelizing R
SN - 978-989-674-019-1
AU - Vera G.
AU - Suppi R.
PY - 2010
SP - 250
EP - 256
DO - 10.5220/0002746302500256