loading
Documents

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Diogo Pratas ; Armando J. Pinho and Sara P. Garcia

Affiliation: University of Aveiro, Portugal

ISBN: 978-989-8425-90-4

Keyword(s): Normalized-compression distance, Finite-context models, Human chromosomal similarity.

Related Ontology Subjects/Areas/Topics: Algorithms and Software Tools ; Bioinformatics ; Biomedical Engineering ; Sequence Analysis

Abstract: A compression-based similarity measure assesses the similarity between two objects using the number of bits needed to describe one of them when a description of the other is available. For being effective, these measures have to rely on “normal” compression algorithms, roughly meaning that they have to be able to build an internal model of the data being compressed. Often, we find that good “normal” compression methods are slow and those that are fast do not provide acceptable results. In this paper, we propose a method for measuring the similarity of DNA sequences that balances these two goals. The method relies on a mixture of finite-context models and is compared with other methods, including XM, the state-of-the-art DNA compression technique. Moreover, we present a comprehensive study of the inter-chromosomal similarity of the human genome.

PDF ImageFull Text

Download
Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 34.229.126.29

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Pratas, D.; J. Pinho, A. and P. Garcia, S. (2012). COMPUTATION OF THE NORMALIZED COMPRESSION DISTANCE OF DNA SEQUENCES USING A MIXTURE OF FINITE-CONTEXT MODELS.In Proceedings of the International Conference on Bioinformatics Models, Methods and Algorithms - Volume 1: BIOINFORMATICS, (BIOSTEC 2012) ISBN 978-989-8425-90-4, pages 308-311. DOI: 10.5220/0003780203080311

@conference{bioinformatics12,
author={Diogo Pratas. and Armando J. Pinho. and Sara P. Garcia.},
title={COMPUTATION OF THE NORMALIZED COMPRESSION DISTANCE OF DNA SEQUENCES USING A MIXTURE OF FINITE-CONTEXT MODELS},
booktitle={Proceedings of the International Conference on Bioinformatics Models, Methods and Algorithms - Volume 1: BIOINFORMATICS, (BIOSTEC 2012)},
year={2012},
pages={308-311},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003780203080311},
isbn={978-989-8425-90-4},
}

TY - CONF

JO - Proceedings of the International Conference on Bioinformatics Models, Methods and Algorithms - Volume 1: BIOINFORMATICS, (BIOSTEC 2012)
TI - COMPUTATION OF THE NORMALIZED COMPRESSION DISTANCE OF DNA SEQUENCES USING A MIXTURE OF FINITE-CONTEXT MODELS
SN - 978-989-8425-90-4
AU - Pratas, D.
AU - J. Pinho, A.
AU - P. Garcia, S.
PY - 2012
SP - 308
EP - 311
DO - 10.5220/0003780203080311

Login or register to post comments.

Comments on this Paper: Be the first to review this paper.