loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Rong Fu and Ian D. Benest

Affiliation: University of York, United Kingdom

Keyword(s): Speaker Diarization, Model Complexity Selection, Universal Background Model.

Related Ontology Subjects/Areas/Topics: Multimedia ; Multimedia Databases, Indexing, Recognition and Retrieval ; Multimedia Systems and Applications ; Telecommunications

Abstract: This paper describes an automatic speaker diarization system for natural, multi-speaker meeting conversations using one central microphone. It is based on the ICSI-SRI Fall 2004 diarization system (Wooters et al., 2004), but it has a number of significant modifications. The new system is robust to different acoustic environments - it requires neither pre-training models nor development sets to initialize the parameters. It determines the model complexity automatically. It adapts the segment model from a Universal Background Model (UBM), and uses the cross-likelihood ratio (CLR) instead of the Bayesian Information Criterion (BIC) for merging. Finally it uses an intra-cluster/inter-cluster ratio as the stopping criterion. Altogether this reduces the speaker diarization error rate from 25.36% to 21.37% compared to the baseline system (Wooters et al., 2004).

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.135.190.101

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Fu, R. and D. Benest, I. (2007). IMPROVEMENTS IN SPEAKER DIARIZATION SYSTEM. In Proceedings of the Second International Conference on Signal Processing and Multimedia Applications (ICETE 2007) - SIGMAP; ISBN 978-989-8111-13-5, SciTePress, pages 313-319. DOI: 10.5220/0002140703130319

@conference{sigmap07,
author={Rong Fu. and Ian {D. Benest}.},
title={IMPROVEMENTS IN SPEAKER DIARIZATION SYSTEM},
booktitle={Proceedings of the Second International Conference on Signal Processing and Multimedia Applications (ICETE 2007) - SIGMAP},
year={2007},
pages={313-319},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002140703130319},
isbn={978-989-8111-13-5},
}

TY - CONF

JO - Proceedings of the Second International Conference on Signal Processing and Multimedia Applications (ICETE 2007) - SIGMAP
TI - IMPROVEMENTS IN SPEAKER DIARIZATION SYSTEM
SN - 978-989-8111-13-5
AU - Fu, R.
AU - D. Benest, I.
PY - 2007
SP - 313
EP - 319
DO - 10.5220/0002140703130319
PB - SciTePress