loading...
Polyphonic Audio Key Finding Using the Spiral Array CEG Algorithm
Amsterdam, Netherlands July 06-July 06
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/ICME.2005.15213502005 IEEE International Conference on ...
 This Article 
 
PDF
HTML
 
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
null Ching-Hua Chuan, Department of Computer Science, University of Southern California Viterbi School of Engineering, Los Angeles, CA, chinghuc@usc.edu
Key finding is an integral step in content-based music indexing and retrieval. In this paper, we present an O(n) real-time algorithm for determining key from polyphonic audio. We use the standard Fast Fourier Transform with a local maximum detection scheme to extract pitches and pitch strengths from polyphonic audio. Next, we use Chew's Spiral Array Center of Effect Generator (CEG) algorithm to determine the key from pitch strength information. We test the proposed system using Mozart's Symphonies. The test data is audio generated from MIDI source. The algorithm achieves a maximum correct key recognition rate of 96% within the first fifteen seconds, and exceeds 90% within the first three seconds. Starting from the extracted pitch strength information, we compare the CEG algorithm's performance to the classic Krumhansl-Schmuckler (K-S) probe tone profile method and Temperley's modified version of the K-S method. Correct key recognition rates for the K-S and modified K-S methods remain under 50% in the first three seconds, with maximum values of 80% and 87% respectively within the first fifteen seconds for the same test set. The CEG method consistently scores higher throughout the fifteen-second selections.
Citation:
null Ching-Hua Chuan, E. Chew, "Polyphonic Audio Key Finding Using the Spiral Array CEG Algorithm," icme, pp.21-24, 2005 IEEE International Conference on Multimedia and Expo, 2005
Usage of this product signifies your acceptance of the Terms of Use.