Reproducibility, Transparency and Evaluation of Machine Learning in Health Applications

Janusz Wojtusiak

Abstract

This paper argues for the importance of detailed reporting of results of machine learning modeling applied in medical, healthcare and health applications. It describes ten criteria under which results of modeling should be reported. The ten proposed criteria are experimental design, statistical model evaluation, model calibration, top predictors, global sensitivity analysis, decision curve analysis, global model explanation, local prediction explanation, programming interface and source code. The criteria are discussed and illustrated in the context of existing models. The goal of the reporting is to ensure that results are reproducible, and models gain trust of end users. A brief checklist is provided to help facilitate model evaluation.

Download


Paper Citation


in Harvard Style

Wojtusiak J. (2021). Reproducibility, Transparency and Evaluation of Machine Learning in Health Applications.In Proceedings of the 14th International Joint Conference on Biomedical Engineering Systems and Technologies - Volume 4: HEALTHINF, ISBN 978-989-758-490-9, pages 685-692. DOI: 10.5220/0010348306850692


in Bibtex Style

@conference{healthinf21,
author={Janusz Wojtusiak},
title={Reproducibility, Transparency and Evaluation of Machine Learning in Health Applications},
booktitle={Proceedings of the 14th International Joint Conference on Biomedical Engineering Systems and Technologies - Volume 4: HEALTHINF,},
year={2021},
pages={685-692},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0010348306850692},
isbn={978-989-758-490-9},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 14th International Joint Conference on Biomedical Engineering Systems and Technologies - Volume 4: HEALTHINF,
TI - Reproducibility, Transparency and Evaluation of Machine Learning in Health Applications
SN - 978-989-758-490-9
AU - Wojtusiak J.
PY - 2021
SP - 685
EP - 692
DO - 10.5220/0010348306850692