Developing a Machine Learning Workflow to Explain Black-box Models for Alzheimer’s Disease Classification

Louise Bloch, Louise Bloch, Christoph M. Friedrich, Christoph M. Friedrich

2021

Abstract

Many research articles used difficult-to-interpret black-box Machine Learning (ML) models to classify Alzheimer’s disease (AD) without examining their biological relevance. In this article, an ML workflow was developed to interpret black-box models based on Shapley values. This workflow enabled the model-agnostic visualization of complex relationships between model features and predictions and also the explanation of individual predictions, which is important in clinical practice. To demonstrate this workflow, eXtreme Gradient Boosting (XGBoost) and Random Forest (RF) classifiers were trained for AD classification. All models were trained on the Alzheimer’s Disease Neuroimaging Initiative (ADNI) or Australian Imaging and Lifestyle flagship study of Ageing (AIBL) dataset and were validated for independent test datasets of both cohorts. The results showed improved performances for black-box models in comparison to simple Classification and Regression Trees (CARTs). For the classification of Mild Cognitive Impairment (MCI) conversion and the ADNI training dataset, the best model achieved a classification accuracy of 71.03 % for the ADNI test dataset and 67.65 % for the entire AIBL dataset. This RF used a logical long-term memory test, the count of Apolipoprotein E ε4 (ApoEε4) alleles and the volume of the left hippocampus as the most important features.

Download


Paper Citation


in Harvard Style

Bloch L. and Friedrich C. (2021). Developing a Machine Learning Workflow to Explain Black-box Models for Alzheimer’s Disease Classification. In Proceedings of the 14th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2021) - Volume 5: HEALTHINF; ISBN 978-989-758-490-9, SciTePress, pages 87-99. DOI: 10.5220/0010211300870099


in Bibtex Style

@conference{healthinf21,
author={Louise Bloch and Christoph M. Friedrich},
title={Developing a Machine Learning Workflow to Explain Black-box Models for Alzheimer’s Disease Classification},
booktitle={Proceedings of the 14th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2021) - Volume 5: HEALTHINF},
year={2021},
pages={87-99},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0010211300870099},
isbn={978-989-758-490-9},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 14th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2021) - Volume 5: HEALTHINF
TI - Developing a Machine Learning Workflow to Explain Black-box Models for Alzheimer’s Disease Classification
SN - 978-989-758-490-9
AU - Bloch L.
AU - Friedrich C.
PY - 2021
SP - 87
EP - 99
DO - 10.5220/0010211300870099
PB - SciTePress