loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Quentin Labourey 1 ; Olivier Aycard 2 ; Denis Pellerin 3 and Michele Rombaut 3

Affiliations: 1 LIG and GIPSA-lab, France ; 2 LIG, France ; 3 GIPSA-lab, France

Keyword(s): Audiovisual Data Fusion, Skin Detection, Sound Source Tracking, Talking Face Tracking.

Related Ontology Subjects/Areas/Topics: Computer Vision, Visualization and Computer Graphics ; Image and Video Analysis ; Visual Attention and Image Saliency

Abstract: In this paper, a human speaker tracking method on audio and video data is presented. It is applied to conversation tracking with a robot. Audiovisual data fusion is performed in a two-steps process. Detection is performed independently on each modality: face detection based on skin color on video data and sound source localization based on the time delay of arrival on audio data. The results of those detection processes are then fused thanks to an adaptation of bayesian filter to detect the speaker. The robot is able to detect the face of the talking person and to detect a new speaker in a conversation.

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.144.17.45

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Labourey, Q.; Aycard, O.; Pellerin, D. and Rombaut, M. (2014). Audiovisual Data Fusion for Successive Speakers Tracking. In Proceedings of the 9th International Conference on Computer Vision Theory and Applications (VISIGRAPP 2014) - Volume 2: VISAPP; ISBN 978-989-758-003-1; ISSN 2184-4321, SciTePress, pages 696-701. DOI: 10.5220/0004852506960701

@conference{visapp14,
author={Quentin Labourey. and Olivier Aycard. and Denis Pellerin. and Michele Rombaut.},
title={Audiovisual Data Fusion for Successive Speakers Tracking},
booktitle={Proceedings of the 9th International Conference on Computer Vision Theory and Applications (VISIGRAPP 2014) - Volume 2: VISAPP},
year={2014},
pages={696-701},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004852506960701},
isbn={978-989-758-003-1},
issn={2184-4321},
}

TY - CONF

JO - Proceedings of the 9th International Conference on Computer Vision Theory and Applications (VISIGRAPP 2014) - Volume 2: VISAPP
TI - Audiovisual Data Fusion for Successive Speakers Tracking
SN - 978-989-758-003-1
IS - 2184-4321
AU - Labourey, Q.
AU - Aycard, O.
AU - Pellerin, D.
AU - Rombaut, M.
PY - 2014
SP - 696
EP - 701
DO - 10.5220/0004852506960701
PB - SciTePress