Sharing Bioinformatic Data for Machine Learning: Maximizing Interoperability through License Selection

Alexander Bernier, Adrian Thorogood

2020

Abstract

Efficient machine learning in bioinformatics requires a large volume of data from different sources. Bioinformatics is shifting from a paradigm of siloed analysis of individual datasets by researchers to the aggregation and analysis of disparate sets of health and biomedical data across from academic, healthcare and commercial settings. Data generating organizations must give thought to selecting legal terms for dataset release that will promote compatibility with other datasets. In releasing bioinformatic data for open use, care must be taken to ensure that the terms of the licenses selected ensure maximum interoperability. The following technical elements should inform the choice of license: License hybridity; waivers of liability, warranties and guarantees; commercial/non-commercial use; attribution and copyleft; granular permission and bilateral or multilateral licensing. Licenses are compared to inform optimal license selection and enable data integration and analysis; consideration is given to an eventual standard license for open sharing of bioinformatic data.

Download


Paper Citation


in Harvard Style

Bernier A. and Thorogood A. (2020). Sharing Bioinformatic Data for Machine Learning: Maximizing Interoperability through License Selection. In Proceedings of the 13th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2020) - Volume 3: BIOINFORMATICS; ISBN 978-989-758-398-8, SciTePress, pages 226-232. DOI: 10.5220/0009179502260232


in Bibtex Style

@conference{bioinformatics20,
author={Alexander Bernier and Adrian Thorogood},
title={Sharing Bioinformatic Data for Machine Learning: Maximizing Interoperability through License Selection},
booktitle={Proceedings of the 13th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2020) - Volume 3: BIOINFORMATICS},
year={2020},
pages={226-232},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0009179502260232},
isbn={978-989-758-398-8},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 13th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2020) - Volume 3: BIOINFORMATICS
TI - Sharing Bioinformatic Data for Machine Learning: Maximizing Interoperability through License Selection
SN - 978-989-758-398-8
AU - Bernier A.
AU - Thorogood A.
PY - 2020
SP - 226
EP - 232
DO - 10.5220/0009179502260232
PB - SciTePress