loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Carly A. Bobak 1 ; Alexander J. Titus 2 and Jane E. Hill 1

Affiliations: 1 Dartmouth School of Graduate and Advanced Studies, United States ; 2 Dartmouth School of Graduate and Advanced Studies and Dartmouth Geisel School of Medicine, United States

Keyword(s): Tuberculosis, Random Forest, Machine Learning, Transcriptional Signatures, Data Integration.

Abstract: There has been increasing concern amongst the scientific community of a reproducibility crisis, particularly in the field of bioinformatics. Often, published research results do not correlate with clinical success. One theory explaining this phenomenon is that findings from homogeneous cohort studies are not generalizable to an inherently heterogeneous population. In this work, we integrate data from 4 distinct tuberculosis (TB) cohorts, for a total of 1164 samples, to find common differentially regulated genes which may be used to diagnose active TB from latent TB, treated TB, other diseases, and healthy controls. We selected 25 genes using random forest to get an AUC of 0.89 in our training data, and 0.86 in our test data. A total of 18 out of 25 genes had been previously associated with TB in independent studies, suggesting that integrating data may be an important tool for increasing micro-array research reproducibility.

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.141.244.201

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Bobak, C.; Titus, A. and Hill, J. (2018). Investigating Random Forest Classification on Publicly Available Tuberculosis Data to Uncover Robust Transcriptional Biomarkers. In Proceedings of the 11th International Joint Conference on Biomedical Engineering Systems and Technologies - AI4Health; ISBN 978-989-758-281-3; ISSN 2184-4305, SciTePress, pages 695-701. DOI: 10.5220/0006752406950701

@conference{ai4health18,
author={Carly A. Bobak. and Alexander J. Titus. and Jane E. Hill.},
title={Investigating Random Forest Classification on Publicly Available Tuberculosis Data to Uncover Robust Transcriptional Biomarkers},
booktitle={Proceedings of the 11th International Joint Conference on Biomedical Engineering Systems and Technologies - AI4Health},
year={2018},
pages={695-701},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006752406950701},
isbn={978-989-758-281-3},
issn={2184-4305},
}

TY - CONF

JO - Proceedings of the 11th International Joint Conference on Biomedical Engineering Systems and Technologies - AI4Health
TI - Investigating Random Forest Classification on Publicly Available Tuberculosis Data to Uncover Robust Transcriptional Biomarkers
SN - 978-989-758-281-3
IS - 2184-4305
AU - Bobak, C.
AU - Titus, A.
AU - Hill, J.
PY - 2018
SP - 695
EP - 701
DO - 10.5220/0006752406950701
PB - SciTePress