loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Shi-wook Lee 1 ; Hiroaki Kojima 1 ; Kazuyo Tanaka 2 and Yoshiaki Itoh 3

Affiliations: 1 National Institute of Advanced Industrial Science and Technology (AIST), Japan ; 2 Tsukuba University, Japan ; 3 Iwate Prefectural University, Japan

Keyword(s): Speech Recognition, Spoken Term Detection, Probabilistic Similarity, Likelihood Ratio, Gaussian Mixture Models.

Related Ontology Subjects/Areas/Topics: Applications ; Audio and Speech Processing ; Digital Signal Processing ; Multimedia ; Multimedia Signal Processing ; Pattern Recognition ; Software Engineering ; Telecommunications

Abstract: In this paper, the use of probabilistic similarity and the likelihood ratio for spoken term detection is investigated. The object of spoken term detection is to rank retrieved spoken terms according to their distance from a query. First, we evaluate several probabilistic similarity functions for use as a sophisticated distance. In particular, we investigate probabilistic similarity for Gaussian mixture models using the closed-form solutions and pseudo-sampling approximation of Kullback–Leibler divergence. And then we propose additive scoring factors based on the likelihood ratio of each individual subword. An experimental evaluation demonstrates that we can achieve an improved detection performance by using probabilistic similarity functions and applying the likelihood ratio.

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.22.249.158

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Lee, S.; Kojima, H.; Tanaka, K. and Itoh, Y. (2013). Experimental Evaluation of Probabilistic Similarity for Spoken Term Detection. In Proceedings of the 2nd International Conference on Pattern Recognition Applications and Methods - ICPRAM; ISBN 978-989-8565-41-9; ISSN 2184-4313, SciTePress, pages 441-446. DOI: 10.5220/0004264304410446

@conference{icpram13,
author={Shi{-}wook Lee. and Hiroaki Kojima. and Kazuyo Tanaka. and Yoshiaki Itoh.},
title={Experimental Evaluation of Probabilistic Similarity for Spoken Term Detection},
booktitle={Proceedings of the 2nd International Conference on Pattern Recognition Applications and Methods - ICPRAM},
year={2013},
pages={441-446},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004264304410446},
isbn={978-989-8565-41-9},
issn={2184-4313},
}

TY - CONF

JO - Proceedings of the 2nd International Conference on Pattern Recognition Applications and Methods - ICPRAM
TI - Experimental Evaluation of Probabilistic Similarity for Spoken Term Detection
SN - 978-989-8565-41-9
IS - 2184-4313
AU - Lee, S.
AU - Kojima, H.
AU - Tanaka, K.
AU - Itoh, Y.
PY - 2013
SP - 441
EP - 446
DO - 10.5220/0004264304410446
PB - SciTePress