Model Smoothing using Virtual Adversarial Training for Speech Emotion Estimation using Spontaneity

Toyoaki Kuwahara, Ryohei Orihara, Yuichi Sei, Yasuyuki Tahara, Akihiko Ohsuga

Abstract

Speech-based emotion estimation increases accuracy through the development of deep learning. However, most emotion estimation using deep learning requires supervised learning, and it is difficult to obtain large datasets used for training. In addition, if the training data environment and the actual data environment are significantly different, the problem is that the accuracy of emotion estimation is reduced. Therefore, in this study, to solve these problems, we propose a emotion estimation model using virtual adversarial training (VAT), a semi-supervised learning method that improves the robustness of the model. Furthermore, research on the spontaneity of speech has progressed year by year, and recent studies have shown that the accuracy of emotion classification is improved when spontaneity is taken into account. We would like to investigate the effect of the spontaneity in a cross-language situation. First, VAT hyperparameters were first set by a preliminary experiment using a single corpus. Next, the robustness of the model generated by the evaluation experiment by the cross corpus was shown. Finally, we evaluate the accuracy of emotion estimation by considering spontaneity and showed improvement in the accuracy of the model using VAT by considering spontaneity.

Download


Paper Citation


in Harvard Style

Kuwahara T., Orihara R., Sei Y., Tahara Y. and Ohsuga A. (2020). Model Smoothing using Virtual Adversarial Training for Speech Emotion Estimation using Spontaneity.In Proceedings of the 12th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART, ISBN 978-989-758-395-7, pages 570-577. DOI: 10.5220/0008958405700577


in Bibtex Style

@conference{icaart20,
author={Toyoaki Kuwahara and Ryohei Orihara and Yuichi Sei and Yasuyuki Tahara and Akihiko Ohsuga},
title={Model Smoothing using Virtual Adversarial Training for Speech Emotion Estimation using Spontaneity},
booktitle={Proceedings of the 12th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART,},
year={2020},
pages={570-577},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0008958405700577},
isbn={978-989-758-395-7},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 12th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART,
TI - Model Smoothing using Virtual Adversarial Training for Speech Emotion Estimation using Spontaneity
SN - 978-989-758-395-7
AU - Kuwahara T.
AU - Orihara R.
AU - Sei Y.
AU - Tahara Y.
AU - Ohsuga A.
PY - 2020
SP - 570
EP - 577
DO - 10.5220/0008958405700577