loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Balaji Thoshkahna and K. R. Ramakrishnan

Affiliation: Indian Institute of Science, India

Keyword(s): Moore’s loudness model, Psychoacoustics, Onset detection, Polyphonic audio.

Related Ontology Subjects/Areas/Topics: Applications ; Audio and Speech Processing ; Digital Signal Processing ; Multimedia ; Multimedia Signal Processing ; Pattern Recognition ; Perceptual/Human Audiovisual System Modeling ; Software Engineering ; Telecommunications

Abstract: We propose an algorithm for sound onset detection applying principles of psychoacoustics. A popular model of loudness perception in human auditory system is used to compute a novelty function that allows for a more robust detection of onsets. The psychoacoustics paradigm also allows us to define thresholds for the novelty function that are both physically and perceptually meaningful and hence easy to manipulate according to the application. The algorithm performs well with an overall accuracy of detection of 86% for monophonic audio and 82% for polyphonic audio.

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.140.242.165

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Thoshkahna, B. and R. Ramakrishnan, K. (2009). A PSYCHOACOUSTICALLY MOTIVATED SOUND ONSET DETECTION ALGORITHM FOR POLYPHONIC AUDIO. In Proceedings of the International Conference on Signal Processing and Multimedia Applications (ICETE 2009) - SIGMAP; ISBN 978-989-674-007-8, SciTePress, pages 94-99. DOI: 10.5220/0002238400940099

@conference{sigmap09,
author={Balaji Thoshkahna. and K. {R. Ramakrishnan}.},
title={A PSYCHOACOUSTICALLY MOTIVATED SOUND ONSET DETECTION ALGORITHM FOR POLYPHONIC AUDIO},
booktitle={Proceedings of the International Conference on Signal Processing and Multimedia Applications (ICETE 2009) - SIGMAP},
year={2009},
pages={94-99},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002238400940099},
isbn={978-989-674-007-8},
}

TY - CONF

JO - Proceedings of the International Conference on Signal Processing and Multimedia Applications (ICETE 2009) - SIGMAP
TI - A PSYCHOACOUSTICALLY MOTIVATED SOUND ONSET DETECTION ALGORITHM FOR POLYPHONIC AUDIO
SN - 978-989-674-007-8
AU - Thoshkahna, B.
AU - R. Ramakrishnan, K.
PY - 2009
SP - 94
EP - 99
DO - 10.5220/0002238400940099
PB - SciTePress