loading
Documents

Research.Publish.Connect.

Paper

Paper Unlock

Authors: S. García-López 1 ; J. A. Jaramillo-Garzón 2 ; L. Duque-Muñoz 2 and C. G. Castellanos-Domínguez 1

Affiliations: 1 Universidad Nacional de Colombia, Colombia ; 2 Universidad Nacional de Colombia and Instituto Tecnológico Metropolitano, Colombia

ISBN: 978-989-8565-35-8

Keyword(s): Molecular Functions Prediction, Proteins, Cuckoo Search, Cost Sensitive Learning, Class Imbalance.

Related Ontology Subjects/Areas/Topics: Algorithms and Software Tools ; Artificial Intelligence ; Bioinformatics ; Biomedical Engineering ; Computational Intelligence ; Genomics and Proteomics ; Pattern Recognition, Clustering and Classification ; Sequence Analysis ; Soft Computing

Abstract: Due to the large amount of data generated by genomics and proteomics research, the use of computational methods has been a great support tool for this purpose. However, tools based on machine learning, face several problems associated to the nature of the data, one of them is the class-imabalance problem. Several balancing techniques exist to obtain an improvement in prediction performance, such as boosting and resampling, but they have multiple weaknesses in difficult data spaces. On the other hand, cost sensitive learning is an alternative solution, yet, the obtention of appropriate cost matrix to induce a good prediction model is complex, and still remains an open problem. In this paper, a methodology to obtain an optimal cost matrix to train models based on cost sensitive learning is proposed. The results show that cost sensitive learning with a proper cost can be very competitive, and even outperform many class-balance strategies in the state of the art. Tests were applied to pre diction of molecular functions in Embryophyta plants. (More)

PDF ImageFull Text

Download
Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 54.224.220.72

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
García-López S., A. Jaramillo-Garzón J., Duque-Muñoz L. and G. Castellanos-Domínguez C. (2013). A Methodology for Optimizing the Cost Matrix in Cost Sensitive Learning Models applied to Prediction of Molecular Functions in Embryophyta Plants.In Proceedings of the International Conference on Bioinformatics Models, Methods and Algorithms - Volume 1: BIOINFORMATICS, (BIOSTEC 2013) ISBN 978-989-8565-35-8, pages 71-80. DOI: 10.5220/0004250900710080

@conference{bioinformatics13,
author={S. García-López and J. A. Jaramillo-Garzón and L. Duque-Muñoz and C. G. Castellanos-Domínguez},
title={A Methodology for Optimizing the Cost Matrix in Cost Sensitive Learning Models applied to Prediction of Molecular Functions in Embryophyta Plants},
booktitle={Proceedings of the International Conference on Bioinformatics Models, Methods and Algorithms - Volume 1: BIOINFORMATICS, (BIOSTEC 2013)},
year={2013},
pages={71-80},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004250900710080},
isbn={978-989-8565-35-8},
}

TY - CONF

JO - Proceedings of the International Conference on Bioinformatics Models, Methods and Algorithms - Volume 1: BIOINFORMATICS, (BIOSTEC 2013)
TI - A Methodology for Optimizing the Cost Matrix in Cost Sensitive Learning Models applied to Prediction of Molecular Functions in Embryophyta Plants
SN - 978-989-8565-35-8
AU - García-López S.
AU - A. Jaramillo-Garzón J.
AU - Duque-Muñoz L.
AU - G. Castellanos-Domínguez C.
PY - 2013
SP - 71
EP - 80
DO - 10.5220/0004250900710080

Login or register to post comments.

Comments on this Paper: Be the first to review this paper.