loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: H. V. Byelas ; M. Dijkstra and M. A. Swertz

Affiliation: University Medical Center Groningen and University of Groningen, Netherlands

Keyword(s): Bioinformatics, Workflow Management system, Data provenance, High performance computing.

Related Ontology Subjects/Areas/Topics: Algorithms and Software Tools ; Artificial Intelligence ; Bioinformatics ; Biomedical Engineering ; Computational Intelligence ; Databases and Data Management ; Next Generation Sequencing ; Soft Computing ; Web Services in Bioinformatics

Abstract: Running bioinformatics analyses in a distributed computational environment and monitoring their executions has become a huge challenge due to the size of data and complexity of analysis workflows. Some attempts have been made to combine computational and data management in a single solution using the MOLGENIS software generator. However, it was not clear how to explicitly specify output data for a particular research, evaluate its quality or possibly repeat the analysis depending on results. We present here a new version of a MOLGENIS computational framework for bioinformatics, which reflects lessons learnt and new requirements from end users. We have improved our initial solution in two ways. First, we propose a new data model, which describes a workflow as a graph in a relational database, where nodes are analysis operations and edges are transactions between them. Inputs and outputs of the workflow nodes are explicitly specified. Second, we have extended the executional logic to t race data, show how final results were created and how to handle errors in the distributed environment. We illustrate system applications on several analysis workflows for next generation sequencing. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 34.204.177.148

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
V. Byelas, H.; Dijkstra, M. and A. Swertz, M. (2012). INTRODUCING DATA PROVENANCE AND ERROR HANDLING FOR NGS WORKFLOWS WITHIN THE MOLGENIS COMPUTATIONAL FRAMEWORK. In Proceedings of the International Conference on Bioinformatics Models, Methods and Algorithms (BIOSTEC 2012) - BIOINFORMATICS; ISBN 978-989-8425-90-4; ISSN 2184-4305, SciTePress, pages 42-50. DOI: 10.5220/0003738900420050

@conference{bioinformatics12,
author={H. {V. Byelas}. and M. Dijkstra. and M. {A. Swertz}.},
title={INTRODUCING DATA PROVENANCE AND ERROR HANDLING FOR NGS WORKFLOWS WITHIN THE MOLGENIS COMPUTATIONAL FRAMEWORK},
booktitle={Proceedings of the International Conference on Bioinformatics Models, Methods and Algorithms (BIOSTEC 2012) - BIOINFORMATICS},
year={2012},
pages={42-50},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003738900420050},
isbn={978-989-8425-90-4},
issn={2184-4305},
}

TY - CONF

JO - Proceedings of the International Conference on Bioinformatics Models, Methods and Algorithms (BIOSTEC 2012) - BIOINFORMATICS
TI - INTRODUCING DATA PROVENANCE AND ERROR HANDLING FOR NGS WORKFLOWS WITHIN THE MOLGENIS COMPUTATIONAL FRAMEWORK
SN - 978-989-8425-90-4
IS - 2184-4305
AU - V. Byelas, H.
AU - Dijkstra, M.
AU - A. Swertz, M.
PY - 2012
SP - 42
EP - 50
DO - 10.5220/0003738900420050
PB - SciTePress