An Attempt to Create Speech Synthesis Model That Retains Lombard Effect Characteristics

Gražina Korvel; Olga Kurasova; Bożena Kostek

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

An Attempt to Create Speech Synthesis Model That Retains Lombard Effect Characteristics

Topics: Multidimensional Signal Processing; Multimodal Signal Processing; Music, Speech and Audio Processing; Perceptual/Human Audiovisual System Modeling

In Proceedings of the 16th International Joint Conference on e-Business and Telecommunications - Volume 1: SIGMAP, 280-289, 2019 , Prague, Czech Republic

Authors: Gražina Korvel ¹ ; Olga Kurasova ¹ and Bożena Kostek ²

Affiliations: ¹ Institute of Data Science and Digital Technologies, Vilnius University, Akademijos str. 4, LT-04812, Vilnius and Lithuania ; ² Audio Acoustics Laboratory, Faculty of Electronics, Telecommunications and Informatics, Gdańsk University of Technology, G. Narutowicza 11/12, 80-233 Gdansk and Poland

Keyword(s): Speech Analysis and Synthesis, Lombard Effect, SISO (Single-Input and Single-Output) System, Sinusoidal Model.

Related Ontology Subjects/Areas/Topics: Multidimensional Signal Processing ; Multimedia ; Multimedia Signal Processing ; Multimodal Signal Processing ; Perceptual/Human Audiovisual System Modeling ; Telecommunications

Abstract: The speech with the Lombard effect has been extensively studied in the context of speech recognition or speech enhancement. However, few studies have investigated the Lombard effect in the context of speech synthesis. The aim of this paper is to create a mathematical model that allows for retaining the Lombard effect. These models could be used as a basis of a formant speech synthesizer. The proposed models are based on dividing the speech signal into harmonics and modeling them as the output of a SISO system whose transfer function poles are multiple, and inputs vary in time. An analysis of the Lombard effect of the synthesized signal is performed on the noise residual. The synthesized signal residual is described by vectors of acoustic parameters related to the Lombard effect. For testing the performance of the created models in various noise conditions two classifiers are employed, namely kNN and Naive Bayes. For comparison of results, we created models of sinusoids based on frequ ency tracks. The results show that a model based on the residual sinewave sum demonstrates the possibility of retaining the Lombard effect. Finally, future work directions are outlined in conclusions. (More)

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 216.73.216.55

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

Korvel, G., Kurasova, O., Kostek and B. (2019). An Attempt to Create Speech Synthesis Model That Retains Lombard Effect Characteristics. In Proceedings of the 16th International Joint Conference on e-Business and Telecommunications - SIGMAP; ISBN 978-989-758-378-0; ISSN 2184-3236, SciTePress, pages 280-289. DOI: 10.5220/0007854302800289

@conference{sigmap19,
author={Gražina Korvel and Olga Kurasova and Bożena Kostek},
title={An Attempt to Create Speech Synthesis Model That Retains Lombard Effect Characteristics},
booktitle={Proceedings of the 16th International Joint Conference on e-Business and Telecommunications - SIGMAP},
year={2019},
pages={280-289},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0007854302800289},
isbn={978-989-758-378-0},
issn={2184-3236},
}

TY - CONF

JO - Proceedings of the 16th International Joint Conference on e-Business and Telecommunications - SIGMAP
TI - An Attempt to Create Speech Synthesis Model That Retains Lombard Effect Characteristics
SN - 978-989-758-378-0
IS - 2184-3236
AU - Korvel, G.
AU - Kurasova, O.
AU - Kostek, B.
PY - 2019
SP - 280
EP - 289
DO - 10.5220/0007854302800289
PB - SciTePress