loading...
Generating Expressive Summaries for Speech and Musical Audio using Self-Similarity Clues
Toronto, ON, Canada July 09-July 12
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/ICME.2006.2626752006 IEEE International Conference on ...
 This Article 
 
PDF
HTML
 
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Mustafa Sert, Middle East Technical University, Department of Electrical and Electronics Engineering, 06531 Ankara, TURKEY; Middle East Technical University, Department of Computer Engineering, 06531 Anka
Buyurman Baykal, Middle East Technical University, Department of Electrical and Electronics Engineering, 06531 Ankara, TURKEY; Middle East Technical University, Department of Computer Engineering, 06531 Anka
Adnan Yazici, Middle East Technical University, Department of Electrical and Electronics Engineering, 06531 Ankara, TURKEY; Middle East Technical University, Department of Computer Engineering, 06531 Anka
We present a novel algorithm for structural analysis of audio to detect repetitive patterns that are suitable for content-based audio information retrieval systems, since repetitive patterns can provide valuable information about the content of audio, such as a chorus or a concept. The Audio Spectrum Flatness (ASF) feature of the MPEG-7 standard, although not having been considered as much as other feature types, has been utilized and evaluated as the underlying feature set. Expressive summaries are chosen as the longest patterns by the k-means clustering algorithm. Proposed approach is evaluated on a test bed consisting of popular song and speech clips based on the ASF feature. The well known Mel Frequency Cepstral Coefficients (MFCCs) are also considered in the experiments for the evaluation of features. Experiments show that, all the repetitive patterns and their locations are obtained with the accuracy of 93% and 78% for music and speech, respectively.
Citation:
Mustafa Sert, Buyurman Baykal, Adnan Yazici, "Generating Expressive Summaries for Speech and Musical Audio using Self-Similarity Clues," icme, pp.941-944, 2006 IEEE International Conference on Multimedia and Expo, 2006
Usage of this product signifies your acceptance of the Terms of Use.


Suggestions