Technical Program

Paper Detail

Paper:MSP-P2.3
Session:Multimedia Systems and Applications
Time:Friday, May 21, 13:00 - 15:00
Presentation: Poster
Topic: Multimedia Signal Processing: Multimedia Database
Title: CONTENT-BASED RETRIEVAL OF MP3 SONGS BASED ON QUERY BY SINGING
Authors: Wen-Nung Lie; National Chung Cheng University 
 Chen-Kang Su; National Chung Cheng University 
Abstract: With the growing of multimedia in Internet, content analysis of multimedia plays an important role for humanistic management. In this paper, we investigate the content-based retrieval of MP3 songs based on the interface of query by singing. In our method, the MDCT spectral coefficients were directly used to represent the tonic characteristic of a short-term sound. This spectral profile is used for detailed matching between two audio segments. Perceptual features were also computed from MDCT coefficients for audio classification. Two pre-stages based on SVM and k-means classifications were used to remove incorrect (or noisy) segment candidates and speed up following matching process. On the other hand, the schemes of exponential key-scaling and time-warping techniques were developed to overcome key difference and tempo variation between different singers. Experiments show that the retrieving probability of our design can achieve up to 76 % among the top 5 out of a total of 114 excerpts in the database.
 
           Back


Home -||- Organizing Committee -||- Technical Committee -||- Technical Program -||- Plenaries
Paper Submission -||- Special Sessions -||- ITT -||- Paper Review -||- Exhibits -||- Tutorials
Information -||- Registration -||- Travel Insurance -||- Housing -||- Workshops

©2015 Conference Management Services, Inc. -||- email: webmaster@icassp2004.org -||- Last updated Wednesday, April 07, 2004