loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Amirhossein Tavanaei 1 ; Alireza Ghasemi 2 ; Mohammad Tavanaei 3 ; Hossein Sameti 1 and Mohammad T. Manzuri 1

Affiliations: 1 Sharif University of Technology, Iran, Islamic Republic of ; 2 École Polytechnique Fédérale de Lausanne, Switzerland ; 3 SAIPA Company, Iran, Islamic Republic of

Keyword(s): Speech recognition, Machine learning, Pattern recognition, Mel frequency discrete wavelet transform, One-class learning, Support vector data description.

Related Ontology Subjects/Areas/Topics: Artificial Intelligence ; Biomedical Engineering ; Biomedical Signal Processing ; Data Manipulation ; Health Engineering and Technology Applications ; Human-Computer Interaction ; Methodologies and Methods ; Neurocomputing ; Neurotechnology, Electronics and Informatics ; Pattern Recognition ; Physiological Computing Systems ; Sensor Networks ; Soft Computing ; Speech Recognition ; Wavelet Transform

Abstract: A classifier based on Support Vector Data Description (SVDD) is proposed for spoken digit recognition. We use the Mel Frequency Discrete Wavelet Coefficients (MFDWC) and the Mel Frequency cepstral Coefficients (MFCC) as the feature vectors. The proposed classifier is compared to the HMM and results are promising and we show the HMM and SVDD classifiers have equal accuracy rates. The performance of the proposed features and SVDD classifier with several kernel functions are evaluated and compared in clean and noisy speech. Because of multi resolution and localization of the Wavelet Transform (WT) and using SVDD, experiments on the spoken digit recognition systems based on MFDWC features and SVDD with weighted polynomial kernel function give better results than the other methods.

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 54.147.102.111

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Tavanaei, A.; Ghasemi, A.; Tavanaei, M.; Sameti, H. and T. Manzuri, M. (2012). SUPPORT VECTOR DATA DESCRIPTION FOR SPOKEN DIGIT RECOGNITION. In Proceedings of the International Conference on Bio-inspired Systems and Signal Processing (BIOSTEC 2012) - BIOSIGNALS; ISBN 978-989-8425-89-8; ISSN 2184-4305, SciTePress, pages 32-37. DOI: 10.5220/0003764400320037

@conference{biosignals12,
author={Amirhossein Tavanaei. and Alireza Ghasemi. and Mohammad Tavanaei. and Hossein Sameti. and Mohammad {T. Manzuri}.},
title={SUPPORT VECTOR DATA DESCRIPTION FOR SPOKEN DIGIT RECOGNITION},
booktitle={Proceedings of the International Conference on Bio-inspired Systems and Signal Processing (BIOSTEC 2012) - BIOSIGNALS},
year={2012},
pages={32-37},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003764400320037},
isbn={978-989-8425-89-8},
issn={2184-4305},
}

TY - CONF

JO - Proceedings of the International Conference on Bio-inspired Systems and Signal Processing (BIOSTEC 2012) - BIOSIGNALS
TI - SUPPORT VECTOR DATA DESCRIPTION FOR SPOKEN DIGIT RECOGNITION
SN - 978-989-8425-89-8
IS - 2184-4305
AU - Tavanaei, A.
AU - Ghasemi, A.
AU - Tavanaei, M.
AU - Sameti, H.
AU - T. Manzuri, M.
PY - 2012
SP - 32
EP - 37
DO - 10.5220/0003764400320037
PB - SciTePress