Metagenomic Clustering in Search of Common Origin

Jolanta Kawulok, Michal Kawulok

2020

Abstract

Analysis of metagenomic samples is aimed at extracting relevant information on these samples, including their composition and origin. To determine where a sample comes from, it is commonly compared with a set of reference samples extracted from known locations. However, if such reference samples are unavailable or when the origins of the investigated samples are not covered by the reference set, it may be helpful to identify groups of similar samples that may have a common origin. In this paper, we tackle this problem with hierarchical clustering applied to analyse a matrix of mutual similarities obtained using the Mash and our CoMeta programs. We report initial, yet encouraging results of our experimental study performed for the metagenomic data extracted from two large metropolises, downloaded from the Sequence Read Archive repository. The obtained results indicate that the proposed approach is effective, which justifies further exploration of the topic using more extensive data.

Download


Paper Citation


in Harvard Style

Kawulok J. and Kawulok M. (2020). Metagenomic Clustering in Search of Common Origin. In Proceedings of the 13th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2020) - Volume 3: BIOINFORMATICS; ISBN 978-989-758-398-8, SciTePress, pages 218-225. DOI: 10.5220/0009177702180225


in Bibtex Style

@conference{bioinformatics20,
author={Jolanta Kawulok and Michal Kawulok},
title={Metagenomic Clustering in Search of Common Origin},
booktitle={Proceedings of the 13th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2020) - Volume 3: BIOINFORMATICS},
year={2020},
pages={218-225},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0009177702180225},
isbn={978-989-758-398-8},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 13th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2020) - Volume 3: BIOINFORMATICS
TI - Metagenomic Clustering in Search of Common Origin
SN - 978-989-758-398-8
AU - Kawulok J.
AU - Kawulok M.
PY - 2020
SP - 218
EP - 225
DO - 10.5220/0009177702180225
PB - SciTePress