APPLYING CONCEPTUAL MODELING TO ALIGNMENT TOOLS ONE STEP TOWARDS THE AUTOMATION OF DNA SEQUENCE ANALYSIS

Maria José Villanueva, Francisco Valverde, Oscar Pastor

2011

Abstract

Nowadays, the search of variations in DNA samples according to a reference sequence is performed using several bioinformatic tools. Due to the process complexity, none of these tools fulfill all the functionality required by biologists. For that reason, the definition of an integration process between these different tools becomes a mandatory requirement. One interesting issue is that bioinformatic tools do not comply with any standard format for expressing the output reports. As a consequence, the flow among tools must be manually solved. This paper proposes a conceptual model in order to formalize how the output from alignment tools must be produced. This work also provides a textual format based on this conceptual model. Thanks to both contributions, the integration is handled in the problem space and the related technological details are avoided. As a proof of concept of these ideas, the proposed format has been applied in a DNA sequence analysis process which uses two bioinformatic tools.

References

  1. Applied Biosystems (2010). Seqscape. http://www3.app liedbiosystems.com/ABHome/index.htm.
  2. Biron, P. V. and Malhotra, A., editors (2004). XML Schema Part 2: Datatypes. W3C Recommendation. W3C, 2nd edition.
  3. Brookes, A. J. et al. (2009). The Phenotype and Genotype Experiment Object Model (PaGE-OM): A Robust Data Structure for Information Related to DNA Variation. Human Mutation, 30(6):968-77.
  4. Codon Code Corporation (2010). Codon Code Aligner. http://www.codoncode.com/aligner/.
  5. Den Dunnen, J. T. and Antonarakis, S. E. (2000). Mutation Nomenclature Extensions and Suggestions to Describe Complex Mutations: A Discussion. Human Mutation, 15(1):7-12.
  6. Department Genomic Sciences (2010). http://droog.gs.washington.edu/polyphred/. Codes Corporation (2010).
  7. Gene Ontology Consortium (2004). The Gene Ontology (GO) Database and Informatics Resource. Nucleic Acids Research, 32(suppl1):D258-261.
  8. Kühne, T. (2005). What is a Model. In Language Engineering for Model-Driven Software Development, number 04101 in Dagstuhl Seminar Proceedings, pages 200- 0. IBFI, Schloss Dagstuhl, Germany.
  9. Li, H. et al. (2009). The Sequence Alignment/Map Format and SAM Tools. Bioinformatics, 25(16):2078-2079.
  10. Manaster, C. et al. (2005). InSNP: a tool for automated detection and visualization of SNPs and InDels. Human mutation, 26(1):11-19.
  11. Martinez, A. M. et al. (2010). Facing the challenges of genome information systems: a variation analysis prototype. Caise Forum.
  12. NCBI (2010). BLAST (Basic Local Alignment Search Tool). http://blast.ncbi.nlm.nih.gov/Blast.cgi.
  13. Ort, E. and Mehta, B. (2003). Java Architecture for XML Binding (JAXB). Technical Report Sun Developer Network.
  14. Paton, N. W. et al. (2000). Conceptual modelling of genomic information. Bioinformatics, 16(6):548-57.
  15. Povey, S., Lovering, R., Bruford, E., Wright, M., Lush, M., and Wain, H. (2001). The HUGO Gene Nomenclature Committee (HGNC). Human Genetics, 109(6):678- 680.
  16. Rusk, N. (2009). Focus on Next-Generation Sequencing Data Analysis. Nature Methods, 6(11s):S1.
  17. Shah, S. et al. (2005). Atlas - A Data Warehouse for Integrative Bioinformatics. BMC Bioinformatics, 6(1):34.
  18. Softgenetics (2010). Mutation Surveyor. http://www.soft genetics.com/.
Download


Paper Citation


in Harvard Style

Villanueva M., Valverde F. and Pastor O. (2011). APPLYING CONCEPTUAL MODELING TO ALIGNMENT TOOLS ONE STEP TOWARDS THE AUTOMATION OF DNA SEQUENCE ANALYSIS . In Proceedings of the International Conference on Bioinformatics Models, Methods and Algorithms - Volume 1: BIOINFORMATICS, (BIOSTEC 2011) ISBN 978-989-8425-36-2, pages 137-142. DOI: 10.5220/0003142001370142


in Bibtex Style

@conference{bioinformatics11,
author={Maria José Villanueva and Francisco Valverde and Oscar Pastor},
title={APPLYING CONCEPTUAL MODELING TO ALIGNMENT TOOLS ONE STEP TOWARDS THE AUTOMATION OF DNA SEQUENCE ANALYSIS},
booktitle={Proceedings of the International Conference on Bioinformatics Models, Methods and Algorithms - Volume 1: BIOINFORMATICS, (BIOSTEC 2011)},
year={2011},
pages={137-142},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003142001370142},
isbn={978-989-8425-36-2},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Bioinformatics Models, Methods and Algorithms - Volume 1: BIOINFORMATICS, (BIOSTEC 2011)
TI - APPLYING CONCEPTUAL MODELING TO ALIGNMENT TOOLS ONE STEP TOWARDS THE AUTOMATION OF DNA SEQUENCE ANALYSIS
SN - 978-989-8425-36-2
AU - Villanueva M.
AU - Valverde F.
AU - Pastor O.
PY - 2011
SP - 137
EP - 142
DO - 10.5220/0003142001370142