loading
Documents

Research.Publish.Connect.

Paper

Paper Unlock

Authors: H. V. Byelas ; M. Dijkstra and M. A. Swertz

Affiliation: University Medical Center Groningen and University of Groningen, Netherlands

ISBN: 978-989-8425-90-4

Keyword(s): Bioinformatics, Workflow Management system, Data provenance, High performance computing.

Related Ontology Subjects/Areas/Topics: Algorithms and Software Tools ; Artificial Intelligence ; Bioinformatics ; Biomedical Engineering ; Computational Intelligence ; Databases and Data Management ; Next Generation Sequencing ; Soft Computing ; Web Services in Bioinformatics

Abstract: Running bioinformatics analyses in a distributed computational environment and monitoring their executions has become a huge challenge due to the size of data and complexity of analysis workflows. Some attempts have been made to combine computational and data management in a single solution using the MOLGENIS software generator. However, it was not clear how to explicitly specify output data for a particular research, evaluate its quality or possibly repeat the analysis depending on results. We present here a new version of a MOLGENIS computational framework for bioinformatics, which reflects lessons learnt and new requirements from end users. We have improved our initial solution in two ways. First, we propose a new data model, which describes a workflow as a graph in a relational database, where nodes are analysis operations and edges are transactions between them. Inputs and outputs of the workflow nodes are explicitly specified. Second, we have extended the executional logic to tr ace data, show how final results were created and how to handle errors in the distributed environment. We illustrate system applications on several analysis workflows for next generation sequencing. (More)

PDF ImageFull Text

Download
CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 35.175.200.4

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
V. Byelas, H.; V. Byelas, H.; Dijkstra, M.; Dijkstra, M. and A. Swertz, M. (2012). INTRODUCING DATA PROVENANCE AND ERROR HANDLING FOR NGS WORKFLOWS WITHIN THE MOLGENIS COMPUTATIONAL FRAMEWORK.In Proceedings of the International Conference on Bioinformatics Models, Methods and Algorithms - Volume 1: BIOINFORMATICS, (BIOSTEC 2012) ISBN 978-989-8425-90-4, pages 42-50. DOI: 10.5220/0003738900420050

@conference{bioinformatics12,
author={H. V. Byelas. and H. V. Byelas. and M. Dijkstra. and M. Dijkstra. and M. A. Swertz.},
title={INTRODUCING DATA PROVENANCE AND ERROR HANDLING FOR NGS WORKFLOWS WITHIN THE MOLGENIS COMPUTATIONAL FRAMEWORK},
booktitle={Proceedings of the International Conference on Bioinformatics Models, Methods and Algorithms - Volume 1: BIOINFORMATICS, (BIOSTEC 2012)},
year={2012},
pages={42-50},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003738900420050},
isbn={978-989-8425-90-4},
}

TY - CONF

JO - Proceedings of the International Conference on Bioinformatics Models, Methods and Algorithms - Volume 1: BIOINFORMATICS, (BIOSTEC 2012)
TI - INTRODUCING DATA PROVENANCE AND ERROR HANDLING FOR NGS WORKFLOWS WITHIN THE MOLGENIS COMPUTATIONAL FRAMEWORK
SN - 978-989-8425-90-4
AU - V. Byelas, H.
AU - V. Byelas, H.
AU - Dijkstra, M.
AU - Dijkstra, M.
AU - A. Swertz, M.
PY - 2012
SP - 42
EP - 50
DO - 10.5220/0003738900420050

Login or register to post comments.

Comments on this Paper: Be the first to review this paper.