loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Pedro Cano ; Markus Koppenberger ; Sylvain Le Groux ; Perfecto Herrera ; Julien Ricard and Nicolas Wack

Affiliation: Universitat Pompeu Fabra, Spain

Keyword(s): MPEG7, content-based audio, sound effects, ontology management,WordNet, audio classication, audio asset management.

Related Ontology Subjects/Areas/Topics: Multimedia ; Multimedia Indexing and Retrieval ; Multimedia Signal Processing ; Telecommunications

Abstract: Sound producers create the sound that goes along the image in cinema and video productions, as well as spots and documentaries. Some sounds are recorded for the occasion. Many occasions, however, require the engineer to have access to massive libraries of music and sound effects. Of the three major facets of audio in post-production: music, speech and sound effects, this document focuses on sound effects (Sound FX or SFX). Main professional on-line sound-fx providers offer their collections using standard text-retrieval technologies. Library construction is an error-prone and labor consuming task. Moreover, the ambiguity and informality of natural languages affects the quality of the search. The use of ontologies alleviates some of the ambiguity problems inherent to natural languages, yet it is very complicated to devise and maintain an ontology that account for the level of detail needed in a production-size sound effect management system. To address this problem we use WordNet, an ontology that organizes over 100.000 concepts of real world knowledge: e.g: it relates doors to locks, to wood and to the actions of opening, closing or knocking. However a fundamental issue remains: sounds without caption are invisible to the users. Content-based audio tools offer perceptual ways of navigating the audio collections, like “nd similar sound”, even if unlabeled, or query-byexample, possibly restricting the search to a semantic subspace, such as “vehicles”. The proposed contentbased technologies also allow semi-automatic sound annotation. We describe the integration of semanticallyenhanced management of metadata using WordNet together with content-based methods in a commercial sound effect management system. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 18.221.146.223

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Cano, P.; Koppenberger, M.; Le Groux, S.; Herrera, P.; Ricard, J. and Wack, N. (2004). KNOWLEDGE AND CONTENT-BASED AUDIO RETRIEVAL USING WORDNET. In Proceedings of the First International Conference on E-Business and Telecommunication Networks - Volume 3: ICETE; ISBN 972-8865-15-5; ISSN 2184-3236, SciTePress, pages 301-308. DOI: 10.5220/0001397503010308

@conference{icete04,
author={Pedro Cano. and Markus Koppenberger. and Sylvain {Le Groux}. and Perfecto Herrera. and Julien Ricard. and Nicolas Wack.},
title={KNOWLEDGE AND CONTENT-BASED AUDIO RETRIEVAL USING WORDNET},
booktitle={Proceedings of the First International Conference on E-Business and Telecommunication Networks - Volume 3: ICETE},
year={2004},
pages={301-308},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001397503010308},
isbn={972-8865-15-5},
issn={2184-3236},
}

TY - CONF

JO - Proceedings of the First International Conference on E-Business and Telecommunication Networks - Volume 3: ICETE
TI - KNOWLEDGE AND CONTENT-BASED AUDIO RETRIEVAL USING WORDNET
SN - 972-8865-15-5
IS - 2184-3236
AU - Cano, P.
AU - Koppenberger, M.
AU - Le Groux, S.
AU - Herrera, P.
AU - Ricard, J.
AU - Wack, N.
PY - 2004
SP - 301
EP - 308
DO - 10.5220/0001397503010308
PB - SciTePress