loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: George Pikramenos 1 ; 2 ; Georgios Smyrnis 3 ; Ioannis Vernikos 4 ; 2 ; Thomas Konidaris 2 ; Evaggelos Spyrou 4 ; 2 and Stavros Perantonis 2

Affiliations: 1 Department of Informatics & Telecommunications, National Kapodistrian University of Athens, Athens, Greece ; 2 Institute of Informatics & Telecommunications, NSCR-“Demokritos”, Athens, Greece ; 3 School of Electrical & Computer Engineering, National Technical University of Athens, Athens, Greece ; 4 Department of Computer Science and Telecommunications, University of Thessaly, Lamia, Greece

Keyword(s): Sentiment Analysis, Speech Analysis, Bag-of-Visual-Words.

Abstract: Monitoring and analysis of human sentiments is currently one of the hottest research topics in the field of human-computer interaction, having many applications. However, in order to become practical in daily life, sentiment recognition techniques should analyze data collected in an unobtrusive way. For this reason, analyzing audio signals of human speech (as opposed to say biometrics) is considered key to potential emotion recognition systems. In this work, we expand upon previous efforts to analyze speech signals using computer vision techniques on their spectrograms. In particular, we utilize ORB descriptors on keypoints distributed on a regular grid over the spectrogram to obtain an intermediate representation. Firstly, a technique similar to Bag-of-Visual-Words (BoVW) is used, where a visual vocabulary is created by clustering keypoint descriptors, but instead a soft candidacy score is used to construct the histogram descriptors of the signal. Furthermore, a technique which take s into account the temporal structure of the spectrograms is examined, allowing for effective model regularization. Both of these techniques are evaluated in several popular emotion recognition datasets, with results indicating an improvement over the simple BoVW method. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 18.222.113.111

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Pikramenos, G.; Smyrnis, G.; Vernikos, I.; Konidaris, T.; Spyrou, E. and Perantonis, S. (2020). Sentiment Analysis from Sound Spectrograms via Soft BoVW and Temporal Structure Modelling. In Proceedings of the 9th International Conference on Pattern Recognition Applications and Methods - ICPRAM; ISBN 978-989-758-397-1; ISSN 2184-4313, SciTePress, pages 361-369. DOI: 10.5220/0009174503610369

@conference{icpram20,
author={George Pikramenos. and Georgios Smyrnis. and Ioannis Vernikos. and Thomas Konidaris. and Evaggelos Spyrou. and Stavros Perantonis.},
title={Sentiment Analysis from Sound Spectrograms via Soft BoVW and Temporal Structure Modelling},
booktitle={Proceedings of the 9th International Conference on Pattern Recognition Applications and Methods - ICPRAM},
year={2020},
pages={361-369},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0009174503610369},
isbn={978-989-758-397-1},
issn={2184-4313},
}

TY - CONF

JO - Proceedings of the 9th International Conference on Pattern Recognition Applications and Methods - ICPRAM
TI - Sentiment Analysis from Sound Spectrograms via Soft BoVW and Temporal Structure Modelling
SN - 978-989-758-397-1
IS - 2184-4313
AU - Pikramenos, G.
AU - Smyrnis, G.
AU - Vernikos, I.
AU - Konidaris, T.
AU - Spyrou, E.
AU - Perantonis, S.
PY - 2020
SP - 361
EP - 369
DO - 10.5220/0009174503610369
PB - SciTePress