IMPROVEMENTS IN SPEAKER DIARIZATION SYSTEM

Rong Fu; Ian D. Benest

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

IMPROVEMENTS IN SPEAKER DIARIZATION SYSTEM

Topics: Multimedia Databases, Indexing, Recognition and Retrieval

In Proceedings of the Second International Conference on Signal Processing and Multimedia Applications - Volume 0ICETE, 313-319, 2007 , Barcelona, Spain

Authors: Rong Fu and Ian D. Benest

Affiliation: University of York, United Kingdom

Keyword(s): Speaker Diarization, Model Complexity Selection, Universal Background Model.

Related Ontology Subjects/Areas/Topics: Multimedia ; Multimedia Databases, Indexing, Recognition and Retrieval ; Multimedia Systems and Applications ; Telecommunications

Abstract: This paper describes an automatic speaker diarization system for natural, multi-speaker meeting conversations using one central microphone. It is based on the ICSI-SRI Fall 2004 diarization system (Wooters et al., 2004), but it has a number of significant modifications. The new system is robust to different acoustic environments - it requires neither pre-training models nor development sets to initialize the parameters. It determines the model complexity automatically. It adapts the segment model from a Universal Background Model (UBM), and uses the cross-likelihood ratio (CLR) instead of the Bayesian Information Criterion (BIC) for merging. Finally it uses an intra-cluster/inter-cluster ratio as the stopping criterion. Altogether this reduces the speaker diarization error rate from 25.36% to 21.37% compared to the baseline system (Wooters et al., 2004).

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 3.135.190.101

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

Fu, R. and D. Benest, I. (2007). IMPROVEMENTS IN SPEAKER DIARIZATION SYSTEM. In Proceedings of the Second International Conference on Signal Processing and Multimedia Applications (ICETE 2007) - SIGMAP; ISBN 978-989-8111-13-5, SciTePress, pages 313-319. DOI: 10.5220/0002140703130319

@conference{sigmap07,
author={Rong Fu. and Ian {D. Benest}.},
title={IMPROVEMENTS IN SPEAKER DIARIZATION SYSTEM},
booktitle={Proceedings of the Second International Conference on Signal Processing and Multimedia Applications (ICETE 2007) - SIGMAP},
year={2007},
pages={313-319},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002140703130319},
isbn={978-989-8111-13-5},
}

TY - CONF

JO - Proceedings of the Second International Conference on Signal Processing and Multimedia Applications (ICETE 2007) - SIGMAP
TI - IMPROVEMENTS IN SPEAKER DIARIZATION SYSTEM
SN - 978-989-8111-13-5
AU - Fu, R.
AU - D. Benest, I.
PY - 2007
SP - 313
EP - 319
DO - 10.5220/0002140703130319
PB - SciTePress