DNA Analysis: Principles and Sequencing Algorithms

Veronika Abramova, Bruno Cabral, Jorge Bernardino

Abstract

DNA discovery has put humans one step closer to deciphering their own structure stored as biological data. Such data could provide us with a huge amount of information, necessary for studying ourselves and learn all the variants that pre-determine one’s characteristics. Although, these days, we are able to extract DNA from our cells and transform it into sequences, there is still a long road ahead since DNA has not been easy to process or even extract in one go. Over the past years, bioinformatics has been evolving more and more, constantly aiding biologists on the attempts to “break” the code. In this paper, we present some of the most relevant algorithms and principles applied on the analysis of our DNA. We attempt to provide basic genome overview but, moreover, the focus of our study is on assembly, one of the main phases of DNA analysis.

References

  1. Alberts, B., Johnson, A., Lewis,J., Raff, M., Roberts, K., Walters, P. 2002. Molecular Biology of the Cell; Fourth Edition. New York and London: Garland Science. ISBN 0-8153-3218-1.
  2. Barnett, J. H. 2009. Early Writings on Graph Theory: Euler Circuits and The Königsberg Bridge Problem.
  3. Compeau, P. E. C., Pevzner, P. A. and Tesler, G. 2011. How to apply de Bruijn graphs to genome assembly.
  4. Domingues, F.S., Lackner, P., Andreeva, A., Sippl, M.J. 2000. Structure-based evaluation of squence coparison and fold recognition alignment accuracy. J Mol Biol. 2000;297:1003-1013.
  5. El-Metwally, S., Taher, H., Magdi, Z. and Helmy, M. 2013. Next-Generation Sequence Assembly: Four Stages of Data Processing and Computational Challenges. PLoS Comput Biol. 2013 Dec; 9(12): e1003345.
  6. Era7. 2016. https://era7bioinformatics.com/en/page.cfm? id=1500 retrieved on 20.01.2016.
  7. Fleury, M. 1883. Deux problèmes de Géométrie de situation. Journal de mathématiques élémentaires, 2nd ser. (in French) 2: 257-261.
  8. Gilles, A., Meglécz, E., Pech, N., Ferreira, S., Malausa, T., Martin, J.F. Accuracy and quality assessment of 454 GS-FLX Titanium pyrosequencing, 2011, 12:245.
  9. Gregory, S. 2005. Contig Assembly. Encyclopedia of Life Sciences.
  10. Held, M. and Karp, R. M. 1962. A dynamic programming approach to sequencing problems. J. Siam 10 (1): 196-210.
  11. Hert, D.G., Fredlake, C.P., Barron, A.E. 2008. Advantages and limitations of next-generation sequencing technologies: a comparison of electrophoresis and non-electrophoresis methods. Electrophoresis. 29(23): 4618-26.
  12. Hopkins, B, and Wilson, R. The Truth about Königsberg. College Mathematics Journal (2004), 35, 198-207.
  13. Illumina, Inc. 2010. De Novo Assembly Using Illumina Reads. Nature 171:737-738.
  14. Mount, DM. 2004. Bioinformatics: Sequence and Genome Analysis (2nd ed.). Cold Spring Harbor Laboratory Press: Cold Spring Harbor, NY. ISBN 0-87969-608-7.
  15. Munib, A., Ishfaq, A., Mohammad S. A. 2015. A survey of genome sequence assembly techniques and algorithms using high-performance computing. The Journal of Supercomputing. Vol. 71 (1), pp 293-339.
  16. Jones, N.C. and Pevzner, P.A. 2004. An Introduction to Bioinformatics Algorithms. © 2004 Massachusetts Institute of Technology.
  17. Niedringhaus, T.P., Milanova, D., Kerby, M.B., Snyder, M.P., Barron, A.E. 2011 Landscape of next-generation sequencing technologies. Anal Chem 83: 4327- 4341.
  18. Paoletti, T. 2011. Leonard Euler's Solution to the Konigsberg Bridge Problem. Convergence 2011.
  19. Polyanovsky, V. O., Roytberg, M. A., Tumanyan, V. G. 2011. Comparative analysis of the quality of a global algorithm and a local algorithm for alignment of two sequences. Algorithms for Molecular Biology 6.
  20. Polyanovsky, V., Roytberg, M.A., Tumanyan, V.G. 2008. Reconstruction of genuine pair-wise sequence alignment. J Comp Biol. 2008;15:379-391.
  21. Posada, D. 2009. Bioinformatics for DNA Sequence Analysis. (Ed.). ISBN 978-1-59745-251-9.
  22. Rubin, F. 1974. A Search Procedure for Hamilton Paths and Circuits. Journal of the ACM 21 (4): 576-80.
  23. Saenger, W. 1984. Principles of Nucleic Acid Structure. New York: Springer-Verlag. ISBN 0-387-90762-9.
  24. Salzberg, S. L., Phillippy, A.M., Zimin, A., Puiu1, D., Magoc, T., Koren, S., Treangen, TJ., Schatz, M.C., Delcher, A.L., Roberts, M., Marçais, G., Pop, M. and Yorke, J.A. 2011. GAGE: A critical evaluation of genome assemblies and assembly algorithms.
  25. Sunyaev, S.R., Bogopolsky, G.A., Oleynikova, N.V., Vlasov, P.K., Finkelstein, A.V., Roytberg, M.A. 2004. From analysis of protein structural alignments toward a novel approach to align protein sequences. Proteins: Structure, Function and Bioinforrmatics. 2004; 54:569-582.
  26. Vazirani, U. V. 2001. Algorithms.
  27. Voelkerding, K.V., Dames, S.A., Durtschi, J.D. 2009. Next-generation sequencing: from basic research to diagnostics. Clin Chem 55: 641-658.
  28. Watson J, Crick. 1953. Molecular structure of nucleic acids: a structure for deoxyribose nucleic acid.
  29. Wheeler, DA, Srinivasan, M., Egholm, M., Shen, Y., Chen, L., McGuire, A. 2008. The complete genome of an individual by massively parallel DNA sequencing. Nature 2008; 452:872-876.
  30. Zhou, X., Ren, L., Meng, Q., Li, Y., Yu, Y. 2010. The next-generation sequencing technology and application. Protein Cell 1: 520-536.
Download


Paper Citation


in Harvard Style

Abramova V., Cabral B. and Bernardino J. (2016). DNA Analysis: Principles and Sequencing Algorithms . In Proceedings of the 8th International Joint Conference on Computational Intelligence - Volume 1: ECTA, (IJCCI 2016) ISBN 978-989-758-201-1, pages 245-250. DOI: 10.5220/0006084102450250


in Bibtex Style

@conference{ecta16,
author={Veronika Abramova and Bruno Cabral and Jorge Bernardino},
title={DNA Analysis: Principles and Sequencing Algorithms},
booktitle={Proceedings of the 8th International Joint Conference on Computational Intelligence - Volume 1: ECTA, (IJCCI 2016)},
year={2016},
pages={245-250},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006084102450250},
isbn={978-989-758-201-1},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 8th International Joint Conference on Computational Intelligence - Volume 1: ECTA, (IJCCI 2016)
TI - DNA Analysis: Principles and Sequencing Algorithms
SN - 978-989-758-201-1
AU - Abramova V.
AU - Cabral B.
AU - Bernardino J.
PY - 2016
SP - 245
EP - 250
DO - 10.5220/0006084102450250