Sound Research WIKINDX

WIKINDX Resources

Martinez Hernandez, E., Adiloglu, K., Annies, R., Purwins, H., & Obermayer, K. 2007, September 28–29, Perceptual representations for classification of everyday sounds. Paper presented at Audio Mostly 2007, Röntgenbau, Ilmenau, Germany. 
Added by: Mark Grimshaw-Aagaard (08/05/2008, 07:48)   Last edited by: Mark Grimshaw-Aagaard (08/05/2008, 07:48)
Resource type: Proceedings Article
BibTeX citation key: MartinezHerbandez2007a
Email resource to friend
View all bibliographic details
Categories: General
Keywords: Perception
Creators: Adiloglu, Annies, Martinez Hernandez, Obermayer, Purwins
Publisher: Audio Mostly (Röntgenbau, Ilmenau, Germany)
Collection: Audio Mostly 2007
Views: 9/654
Abstract
In the recognition and classification of sounds, extracting perceptually and biologically relevant features yields much better results than the standard low-level methods (e.g zero-crossings, roll-off, centroid, energy, etc.). Gamma-tone filters are biologically relevant, as they simulate the motion of the basilar membrane. The representation techniques that we propose in this paper make use of the gamma-tone filters, combined with the Hilbert transform or hair cell models, to represent everyday sounds. Different combinations of these methods have been evaluated and compared in perceptual classification tasks to classify everyday sounds like doors and footsteps by using support vector classification. After calculating the features a feature integration technique is applied, in order to reduce the high dimensionality of the features. The everyday sounds are obtained from the commercial sound database “SoundIdeas”. However, perceptual labels assigned by human listeners are considered rather than the labels delivered by the actual soundsource. These perceptual classification tasks are performed to classify the everyday sounds according to their function, like classifying the door sounds as ”opening” and” closing” doors. In this paper, among the gamma-tone-based representation techniques, other spectral and psycho-acoustical representation techniques are also evaluated. The experiments show that the gamma-tone-based representation techniques are superior for perceptual classification tasks of everyday sounds. The gamma-tone filters combined with a inner hair cell model and with the Hilbert transform yield the most accurate results in classifying everyday sounds.
Added by: Mark Grimshaw-Aagaard  Last edited by: Mark Grimshaw-Aagaard
WIKINDX 6.8.2 | Total resources: 1301 | Username: -- | Bibliography: WIKINDX Master Bibliography | Style: American Psychological Association (APA)