Speech Emotion Recognition using MFCC and Hybrid Neural Networks

Youakim Badr; Partha Mukherjee; Sindhu Thumati

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

Speech Emotion Recognition using MFCC and Hybrid Neural Networks

Topics: Convolutional Neural Networks; Deep Learning

In Proceedings of the 13th International Joint Conference on Computational Intelligence - Volume 1: NCTA, 366-373, 2021

Authors: Youakim Badr ; Partha Mukherjee and Sindhu Thumati

Affiliation: The Pennsylvania State University, Great Valley, U.S.A.

Keyword(s): Hybrid Neural Network, Speech Emotion Recognition, MFCC, ConvLSTM, RAVDESS Data.

Abstract: Speech emotion recognition is a challenging task and feature extraction plays an important role in effectively classifying speech into different emotions. In this paper, we apply traditional feature extraction methods like MFCC for feature extraction from audio files. Instead of using traditional machine learning approaches like SVM to classify audio files, we investigate different neural network architectures. Our baseline model implemented as a convolutional neural network results in 60% classification accuracy. We propose a hybrid neural network architecture based on Convolutional and Long Short-Term Memory (ConvLSTM) networks to capture spatial and sequential information of audio files. Our experimental results show that our ComvLSTM model has achieved an accuracy of 59%. We improved our model with data augmentation techniques and re-trained it with augmented dataset. The classification accuracy achieves 91% for multi-class classification of RAVDESS dataset outperforming the accu racy of state-of-the-art multi-class classification models that used the similar data. (More)

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 3.144.93.73

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

Badr, Y.; Mukherjee, P. and Thumati, S. (2021). Speech Emotion Recognition using MFCC and Hybrid Neural Networks. In Proceedings of the 13th International Joint Conference on Computational Intelligence (IJCCI 2021) - NCTA; ISBN 978-989-758-534-0; ISSN 2184-3236, SciTePress, pages 366-373. DOI: 10.5220/0010707400003063

@conference{ncta21,
author={Youakim Badr. and Partha Mukherjee. and Sindhu Thumati.},
title={Speech Emotion Recognition using MFCC and Hybrid Neural Networks},
booktitle={Proceedings of the 13th International Joint Conference on Computational Intelligence (IJCCI 2021) - NCTA},
year={2021},
pages={366-373},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0010707400003063},
isbn={978-989-758-534-0},
issn={2184-3236},
}

TY - CONF

JO - Proceedings of the 13th International Joint Conference on Computational Intelligence (IJCCI 2021) - NCTA
TI - Speech Emotion Recognition using MFCC and Hybrid Neural Networks
SN - 978-989-758-534-0
IS - 2184-3236
AU - Badr, Y.
AU - Mukherjee, P.
AU - Thumati, S.
PY - 2021
SP - 366
EP - 373
DO - 10.5220/0010707400003063
PB - SciTePress