Technical Program

Paper Detail

Paper:AE-P6.10
Session:Audio for Multimedia and Networks
Time:Friday, May 21, 13:00 - 15:00
Presentation: Poster
Topic: Audio and Electroacoustics: Audio for Multimedia
Title: AUDIO SEGMENTATION BASED ON MULTI-SCALE AUDIO CLASSIFICATION
Authors: Yibin Zhang; Tsinghua University 
 Jie Zhou; Tsinghua University 
Abstract: Content-based audio segmentation plays an important role in multimedia applications. In order to segment accurately and on-line, most conventional algorithms are based on small-scale feature classification and always result in a high false alarm rate. Our experimental results show that large-scale audio can be more easily classified than small ones. According to this fact, we present a novel multi-scale framework for audio segmentation. First, a rough segmentation step based on large-scale classification is taken to ensure the integrality of the content of segments, which can avoid the consecutive audio belonging to the same kind being segmented into different pieces. Then a subtle segmentation step is taken to further locate the segmentation points for the boundary areas computed by the rough segmentation step. Experimental results show that a low false alarm rate can be achieved while preserving a low missing rate.
 
           Back


Home -||- Organizing Committee -||- Technical Committee -||- Technical Program -||- Plenaries
Paper Submission -||- Special Sessions -||- ITT -||- Paper Review -||- Exhibits -||- Tutorials
Information -||- Registration -||- Travel Insurance -||- Housing -||- Workshops

©2015 Conference Management Services, Inc. -||- email: webmaster@icassp2004.org -||- Last updated Wednesday, April 07, 2004