loading
Documents

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Quentin Labourey 1 ; Olivier Aycard 2 ; Denis Pellerin 3 and Michele Rombaut 3

Affiliations: 1 LIG and GIPSA-lab, France ; 2 LIG, France ; 3 GIPSA-lab, France

ISBN: 978-989-758-003-1

Keyword(s): Audiovisual Data Fusion, Skin Detection, Sound Source Tracking, Talking Face Tracking.

Related Ontology Subjects/Areas/Topics: Computer Vision, Visualization and Computer Graphics ; Image and Video Analysis ; Visual Attention and Image Saliency

Abstract: In this paper, a human speaker tracking method on audio and video data is presented. It is applied to conversation tracking with a robot. Audiovisual data fusion is performed in a two-steps process. Detection is performed independently on each modality: face detection based on skin color on video data and sound source localization based on the time delay of arrival on audio data. The results of those detection processes are then fused thanks to an adaptation of bayesian filter to detect the speaker. The robot is able to detect the face of the talking person and to detect a new speaker in a conversation.

PDF ImageFull Text

Download
Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 54.226.58.177

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Labourey Q., Aycard O., Pellerin D. and Rombaut M. (2014). Audiovisual Data Fusion for Successive Speakers Tracking.In Proceedings of the 9th International Conference on Computer Vision Theory and Applications - Volume 1: VISAPP, (VISIGRAPP 2014) ISBN 978-989-758-003-1, pages 696-701. DOI: 10.5220/0004852506960701

@conference{visapp14,
author={Quentin Labourey and Olivier Aycard and Denis Pellerin and Michele Rombaut},
title={Audiovisual Data Fusion for Successive Speakers Tracking},
booktitle={Proceedings of the 9th International Conference on Computer Vision Theory and Applications - Volume 1: VISAPP, (VISIGRAPP 2014)},
year={2014},
pages={696-701},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004852506960701},
isbn={978-989-758-003-1},
}

TY - CONF

JO - Proceedings of the 9th International Conference on Computer Vision Theory and Applications - Volume 1: VISAPP, (VISIGRAPP 2014)
TI - Audiovisual Data Fusion for Successive Speakers Tracking
SN - 978-989-758-003-1
AU - Labourey Q.
AU - Aycard O.
AU - Pellerin D.
AU - Rombaut M.
PY - 2014
SP - 696
EP - 701
DO - 10.5220/0004852506960701

Login or register to post comments.

Comments on this Paper: Be the first to review this paper.