Big Data Scalability of BayesPhylogenies on Harvard’s Ozone 12k Cores

M. Manjunathaiah, M. Manjunathaiah, A. Meade, R. Thavarajan, P. Protopapas, R. Dave

2019

Abstract

Computational Phylogenetics is classed as a grand challenge data driven problem in the fourth paradigm of scientific discovery due to the exponential growth in genomic data, the computational challenge and the potential for vast impact on data driven biosciences. Petascale and Exascale computing offer the prospect of scaling Phylogenetics to big data levels. However the computational complexity of even approximate Bayesian methods for phylogenetic inference requires scalable analysis for big data applications. There is limited study on the scalability characteristics of existing computational models for petascale class massively parallel computers. In this paper we present strong and weak scaling performance analysis of BayesPhylogenies on Harvard’s Ozone 12k cores. We perform evaluations on multiple data sizes to infer the scaling complexity and find that strong scaling techniques along with novel methods for communication reduction are necessary if computational models are to overcome limitations on emerging complex parallel architectures with multiple levels of concurrency. The results of this study can guide the design and implementation of scalable MCMC based computational models for Bayesian inference on emerging petascale and exascale systems.

Download


Paper Citation


in Harvard Style

Manjunathaiah M., Meade A., Thavarajan R., Protopapas P. and Dave R. (2019). Big Data Scalability of BayesPhylogenies on Harvard’s Ozone 12k Cores. In Proceedings of the 12th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2019) - Volume 3: BIOINFORMATICS; ISBN 978-989-758-353-7, SciTePress, pages 143-148. DOI: 10.5220/0007249601430148


in Bibtex Style

@conference{bioinformatics19,
author={M. Manjunathaiah and A. Meade and R. Thavarajan and P. Protopapas and R. Dave},
title={Big Data Scalability of BayesPhylogenies on Harvard’s Ozone 12k Cores},
booktitle={Proceedings of the 12th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2019) - Volume 3: BIOINFORMATICS},
year={2019},
pages={143-148},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0007249601430148},
isbn={978-989-758-353-7},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 12th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2019) - Volume 3: BIOINFORMATICS
TI - Big Data Scalability of BayesPhylogenies on Harvard’s Ozone 12k Cores
SN - 978-989-758-353-7
AU - Manjunathaiah M.
AU - Meade A.
AU - Thavarajan R.
AU - Protopapas P.
AU - Dave R.
PY - 2019
SP - 143
EP - 148
DO - 10.5220/0007249601430148
PB - SciTePress