loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Zhihao Zhang and Jinlong Lin

Affiliation: Peking University, China

Keyword(s): Voice activity detection, Pitch, Sub-band energy criteria.

Related Ontology Subjects/Areas/Topics: Applications ; Audio and Speech Processing ; Digital Signal Processing ; Multimedia ; Multimedia Signal Processing ; Pattern Recognition ; Software Engineering ; Telecommunications

Abstract: A new Voice Activity Detection (VAD) method is proposed to track the various background noises and it can be robust in both stationary and variable noise environments. Many previous VAD methods assume that the background only contains certain kinds of noises, so they could not deal with the noise in practical applications efficiently. In proposed approach, determinate speech, determinate noise and potential speech regions are defined. The first two regions are located with extracted pitch contour information and the ambiguous region will be further retrieved using updated thresholds of sub-bands energy in obtained determinate noise’s frequency domain. Experiments are carried out with an exhaustive comparison to three standard VAD methods: G729b, ETSI AFE and AMR. The result shows that our approach has a more robust performance than others in the real circumstances.

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 18.226.93.209

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Zhang, Z. and Lin, J. (2009). ROBUST VOICE ACTIVITY DETECTION BASED ON PITCH AND SUB-BAND ENERGY. In Proceedings of the International Conference on Signal Processing and Multimedia Applications (ICETE 2009) - SIGMAP; ISBN 978-989-674-007-8, SciTePress, pages 44-48. DOI: 10.5220/0002221000440048

@conference{sigmap09,
author={Zhihao Zhang. and Jinlong Lin.},
title={ROBUST VOICE ACTIVITY DETECTION BASED ON PITCH AND SUB-BAND ENERGY},
booktitle={Proceedings of the International Conference on Signal Processing and Multimedia Applications (ICETE 2009) - SIGMAP},
year={2009},
pages={44-48},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002221000440048},
isbn={978-989-674-007-8},
}

TY - CONF

JO - Proceedings of the International Conference on Signal Processing and Multimedia Applications (ICETE 2009) - SIGMAP
TI - ROBUST VOICE ACTIVITY DETECTION BASED ON PITCH AND SUB-BAND ENERGY
SN - 978-989-674-007-8
AU - Zhang, Z.
AU - Lin, J.
PY - 2009
SP - 44
EP - 48
DO - 10.5220/0002221000440048
PB - SciTePress