VISUAL SPEECH RECOGNITION USING WAVELET TRANSFORM AND MOMENT BASED FEATURES

Wai C. Yau; Dinesh K. Kumar; Sridhar P. Arjunan; Sanjay Kumar

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

VISUAL SPEECH RECOGNITION USING WAVELET TRANSFORM AND MOMENT BASED FEATURES

Topics: Feature Extraction; Image Processing; Speech Recognition; Vision, Recognition and Reconstruction

In Proceedings of the Third International Conference on Informatics in Control, Automation and Robotics - Volume 1: ICINCO, 340-345, 2006 , Setúbal, Portugal

Authors: Wai C. Yau ; Dinesh K. Kumar ; Sridhar P. Arjunan and Sanjay Kumar

Affiliation: School of Electrical and Computer Engineering, RMIT University, Australia

Keyword(s): Visual Speech Recognition, Motion History Image, Discrete Stationary Wavelet Transform, Image Moments, Artificial Neural Network.

Related Ontology Subjects/Areas/Topics: Computer Vision, Visualization and Computer Graphics ; Feature Extraction ; Features Extraction ; Image and Video Analysis ; Image Processing ; Informatics in Control, Automation and Robotics ; Robotics and Automation ; Signal Processing, Sensors, Systems Modeling and Control ; Speech Recognition ; Vision, Recognition and Reconstruction

Abstract: This paper presents a novel vision based approach to identify utterances consisting of consonants. A view based method is adopted to represent the 3-D image sequence of the mouth movement in a 2-D space using grayscale images named as motion history image (MHI). MHI is produced by applying accumulative image differencing technique on the sequence of images to implicitly capture the temporal information of the mouth movement. The proposed technique combines Discrete Stationary Wavelet Transform (SWT) and image moments to classify the MHI. A 2-D SWT at level 1 is applied to decompose MHI to produce one approximate and three detail sub images. The paper reports on the testing of the classification accuracy of three different moment-based features, namely Zernike moments, geometric moments and Hu moments computed from the approximate representation of MHI. Supervised feed forward multilayer perceptron (MLP) type artificial neural network (ANN) with back propagation learning algorithm is used to classify the moment-based features. The performance and image representation ability of the three moments features are compared in this paper. The preliminary results show that all these moments can achieve high recognition rate in classification of 3 consonants. (More)

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 3.145.191.214

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

C. Yau, W.; K. Kumar, D.; P. Arjunan, S. and Kumar, S. (2006). VISUAL SPEECH RECOGNITION USING WAVELET TRANSFORM AND MOMENT BASED FEATURES. In Proceedings of the Third International Conference on Informatics in Control, Automation and Robotics - Volume 1: ICINCO; ISBN 978-972-8865-60-3; ISSN 2184-2809, SciTePress, pages 340-345. DOI: 10.5220/0001209903400345

@conference{icinco06,
author={Wai {C. Yau}. and Dinesh {K. Kumar}. and Sridhar {P. Arjunan}. and Sanjay Kumar.},
title={VISUAL SPEECH RECOGNITION USING WAVELET TRANSFORM AND MOMENT BASED FEATURES},
booktitle={Proceedings of the Third International Conference on Informatics in Control, Automation and Robotics - Volume 1: ICINCO},
year={2006},
pages={340-345},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001209903400345},
isbn={978-972-8865-60-3},
issn={2184-2809},
}

TY - CONF

JO - Proceedings of the Third International Conference on Informatics in Control, Automation and Robotics - Volume 1: ICINCO
TI - VISUAL SPEECH RECOGNITION USING WAVELET TRANSFORM AND MOMENT BASED FEATURES
SN - 978-972-8865-60-3
IS - 2184-2809
AU - C. Yau, W.
AU - K. Kumar, D.
AU - P. Arjunan, S.
AU - Kumar, S.
PY - 2006
SP - 340
EP - 345
DO - 10.5220/0001209903400345
PB - SciTePress