Reconstruction of Mitochondrial Genotypes from Diverse next Generation Sequencing Datasets

Peter Ulz, Michael R. Speicher, Gerhard G. Thallinger

Abstract

The exponential growth of sequence databases in recent years opens up a lot of possibilities for reanalysis of public datasets. Here, we reanalyzed sequencing data from various experimental procedures to reconstruct the mitochondrial genome from sequence data of human samples. In a first step eight human cell lines were used to validate the approach and to ensure consistent genotype information across different library preparation techniques. Subsequently, 19,337 sequencing datasets were downloaded and checked for single-nucleotide variants and insertion or deletion events. We show that the mitochondrial genome can be inferred from many different library preparation techniques. We also generated reference mitochondrial genomes for eight cell lines. This approach may be used for sample identification as well as a general approach to study the mitochondrial genome from public sequencing data.

References

  1. Sequence Read Archive (2015) Available from: http://www.ncbi.nlm.nih.gov/Traces/sra/sra.cgi. [19 July 2015].
  2. Mamanova L, Coffey AJ, Scott CE et al. (2010): "Targetenrichment strategies for next-generation sequencing". In: Nature Methods. 7 (2), S. 111-118, DOI: 10.1038/nmeth.1419.
  3. Diroma MA, Calabrese C, Simone D et al. (2014): "Extraction and annotation of human mitochondrial genomes from 1000 Genomes Whole Exome Sequencing data". In: BMC Genomics. 15 (Suppl 3), S. S2, DOI: 10.1186/1471-2164-15-s3-s2.
  4. Smith DR. (2013): "RNA-Seq data: a goldmine for organelle research". In: Briefings in Functional Genomics. 12 (5), S. 454-456, DOI: 10.1093/bfgp/els066.
  5. Taylor RW.Turnbull DM. (2005): "Mitochondrial DNA mutations in human disease". In: Nature Reviews Genetics. 6 (5), S. 389-402, DOI: 10.1038/nrg1606.
  6. Cann RL, Stoneking M, Wilson, AC. (1987): "Mitochondrial DNA and human evolution". In: Nature. 325 (6099), S. 31-36, DOI: 10.1038/325031a0.
  7. Herrnstadt C, Preston G, Andrews R et al. (2002): "A high frequency of mtDNA polymorphisms in HeLa cell sublines". In: Mutation Research/Fundamental and Molecular Mechanisms of Mutagenesis. 501 (1-2), S. 19-28, DOI: 10.1016/s0027-5107(01)00304-9.
  8. Lott M, Leipzig JN, Derbeneva O et al. (2013): "mtDNA Variation and Analysis Using MITOMAP and MITOMASTER". In: Current Proctocols in Bioinformatics 44, pp. 1.23.1{1.23.26. doi: 10.1002/0471250953.bi0123s44.
  9. Picardi E, Pesole G (2012): "Mitochondrial genomes gleaned from human whole-exome sequencing". In: Nature Methods. 9 (6), S. 523-524, DOI: 10.1038/nmeth.2029.
  10. Nakamura K, Oshima T, Morimoto T et al. (2011): "Sequence-specific error profile of Illumina sequencers". In: Nucleic Acids Research. 39 (13), S. e90-e90, DOI: 10.1093/nar/gkr344.
  11. Li H, Handsaker B, Wysoker A et al. (2009): "The Sequence Alignment/Map format and SAMtools". In: Bioinformatics. 25 (16), S. 2078-2079, DOI: 10.1093/bioinformatics/btp352.
  12. Andrews RM, Kubacka I, Chinnery PF et al. (1999): "Reanalysis and revision of the Cambridge reference sequence for human mitochondrial DNA". In: Nature Genetics 23 p. 147.
  13. Li H, Durbin R. (2009): "Fast and accurate short read alignment with Burrows-Wheeler transform". In: Bioinformatics. 25 (14), S. 1754-1760, DOI: 10.1093/bioinformatics/btp324.
  14. R Development Core Team. (2008): "R: A Language and Environment for Statistical Computing. ISBN 3- 900051-07-0. R Foundation for Statistical Computing". Vienna, Austria, 2008. url: http://www.R-project.org.
  15. MitoMap Haplogroup markers.(2015) Available from: http://www.mitomap.org/bin/view.pl/MITOMAP/Hapl ogroupMarkers. [19-July-2015].
Download


Paper Citation


in Harvard Style

Ulz P., Speicher M. and Thallinger G. (2017). Reconstruction of Mitochondrial Genotypes from Diverse next Generation Sequencing Datasets . In Proceedings of the 10th International Joint Conference on Biomedical Engineering Systems and Technologies - Volume 3: BIOINFORMATICS, (BIOSTEC 2017) ISBN 978-989-758-214-1, pages 29-36. DOI: 10.5220/0006110200290036


in Bibtex Style

@conference{bioinformatics17,
author={Peter Ulz and Michael R. Speicher and Gerhard G. Thallinger},
title={Reconstruction of Mitochondrial Genotypes from Diverse next Generation Sequencing Datasets},
booktitle={Proceedings of the 10th International Joint Conference on Biomedical Engineering Systems and Technologies - Volume 3: BIOINFORMATICS, (BIOSTEC 2017)},
year={2017},
pages={29-36},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006110200290036},
isbn={978-989-758-214-1},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 10th International Joint Conference on Biomedical Engineering Systems and Technologies - Volume 3: BIOINFORMATICS, (BIOSTEC 2017)
TI - Reconstruction of Mitochondrial Genotypes from Diverse next Generation Sequencing Datasets
SN - 978-989-758-214-1
AU - Ulz P.
AU - Speicher M.
AU - Thallinger G.
PY - 2017
SP - 29
EP - 36
DO - 10.5220/0006110200290036