MOT: A Multi-Omics Transformer for Multiclass Classification Tumour Types Predictions

Mazid Osseni, Prudencio Tossou, François Laviolette, Jacques Corbeil, Jacques Corbeil

2023

Abstract

Motivation: Breakthroughs in high-throughput technologies and machine learning methods have enabled the shift towards multi-omics modelling as the preferred means to understand the mechanisms underlying biological processes. Machine learning enables and improves complex disease prognosis in clinical settings. However, most multi-omic studies primarily use transcriptomics and epigenomics due to their over-representation in databases and their early technical maturity compared to others omics. For complex phenotypes and mechanisms, not leveraging all the omics despite their varying degree of availability can lead to a failure to understand the underlying biological mechanisms and leads to less robust classifications and predictions. Results: We proposed MOT (Multi-Omic Transformer), a deep learning based model using the transformer architecture, that discriminates complex phenotypes (herein cancer types) based on five omics data types: transcriptomics (mRNA and miRNA), epigenomics (DNA methylation), copy number variations (CNVs), and proteomics. This model achieves an F1-score of 98:37% among 33 tumour types on a test set without missing omics views and an F1-score of 96:74% on a test set with missing omics views. It also identifies the required omic type for the best prediction for each phenotype and therefore could guide clinical decision-making when acquiring data to confirm a diagnostic. The newly introduced model can integrate and analyze five or more omics data types even with missing omics views and can also identify the essential omics data for the tumour multiclass classification tasks. It confirms the importance of each omic view. Combined, omics views allow a better differentiation rate between most cancer diseases. Our study emphasized the importance of multi-omic data to obtain a better multiclass cancer classification. Availability and implementation: MOT source code is available at https://github.com/dizam92/multiomic predictions.

Download


Paper Citation


in Harvard Style

Osseni M., Tossou P., Laviolette F. and Corbeil J. (2023). MOT: A Multi-Omics Transformer for Multiclass Classification Tumour Types Predictions. In Proceedings of the 16th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2023) - Volume 3: BIOINFORMATICS; ISBN 978-989-758-631-6, SciTePress, pages 252-261. DOI: 10.5220/0011780100003414


in Bibtex Style

@conference{bioinformatics23,
author={Mazid Osseni and Prudencio Tossou and François Laviolette and Jacques Corbeil},
title={MOT: A Multi-Omics Transformer for Multiclass Classification Tumour Types Predictions},
booktitle={Proceedings of the 16th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2023) - Volume 3: BIOINFORMATICS},
year={2023},
pages={252-261},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0011780100003414},
isbn={978-989-758-631-6},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 16th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2023) - Volume 3: BIOINFORMATICS
TI - MOT: A Multi-Omics Transformer for Multiclass Classification Tumour Types Predictions
SN - 978-989-758-631-6
AU - Osseni M.
AU - Tossou P.
AU - Laviolette F.
AU - Corbeil J.
PY - 2023
SP - 252
EP - 261
DO - 10.5220/0011780100003414
PB - SciTePress