Sound Research WIKINDX

WIKINDX Resources

Bakker, E. M., & Lewis, M. S. (2002). Semantic video retrieval using audio analysis. Lecture Notes in Computer Science, 2383, 271–277. 
Added by: sirfragalot (04/20/2005 04:19:07 PM)   
Resource type: Journal Article
BibTeX citation key: Bakker2002
View all bibliographic details
Categories: Typologies/Taxonomies
Keywords: Semantic categorization, Silence
Creators: Bakker, Lewis
Collection: Lecture Notes in Computer Science
Views: 6/678
Abstract
Semantic understanding of video is an important frontier in content based retrieval. In the research literature, significant attention has been given to the visual aspect of video, however, relatively little work directly uses audio content for video retrieval. Our paper gives an overview of our current research directions in semantic video retrieval using audio content. We discuss the effectiveness of classifying audio into semantic categories by combining both global and local audio features based in the frequency spectrum. Furthermore, we introduce two novel features called Frequency Spectrum Differentials (FSD), and
Differential Swap Rate (DSR), that both model the shape of the spectrum.
Added by: sirfragalot  
Notes
Searching video clips on the semantic properties of audio in those clips. If by semantic they intedn meaning, then their technique is not really semantic but rather one based on the means of sound production. Their categories include type of musical instrument, explosions speech etc.
Added by: sirfragalot  
WIKINDX 6.4.12 | Total resources: 1102 | Username: -- | Bibliography: WIKINDX Master Bibliography | Style: American Psychological Association (APA)


PHP execution time: 0.13442 s
SQL execution time: 0.08497 s
TPL rendering time: 0.00394 s
Total elapsed time: 0.22333 s
Peak memory usage: 9.2642 MB
Memory at close: 9.1047 MB
Database queries: 56