Technical Program

Paper Detail

Paper:	SS-6.3
Session:	Convolutive Blind Source Separation for Speech and Audio Signals
Time:	Wednesday, May 19, 16:10 - 16:30
Presentation:	Special Session Lecture
Topic:	Special Sessions: Convolutive Blind Source Separation for Speech and Audio Signals
Title:	CONVOLUTIVE BLIND SOURCE SEPARATION FOR MORE THAN TWO SOURCES IN THE FREQUENCY DOMAIN
Authors:	Hiroshi Sawada; NTT Corporation
	Ryo Mukai; NTT Corporation
	Shoko Araki; NTT Corporation
	Shoji Makino; NTT Corporation
Abstract:	Blind source separation (BSS) for convolutive mixtures can be efficiently achieved in the frequency domain, where independent component analysis is performed separately in each frequency bin. However, frequency-domain BSS involves a permutation problem, which is well known as a difficult problem, especially when the number of sources is large. This paper presents a method for solving the permutation problem, which works well even for many sources. The successful solution for the permutation problem highlights another problem with frequency-domain BSS that arises from the circularity of discrete frequency representation. This paper discusses the phenomena of the problem and presents a method for solving it. With these two methods, we can separate many sources with a practical execution time. Moreover, real-time processing is currently possible for up to three sources with our implementation.

Back

Home -||- Organizing Committee -||- Technical Committee -||- Technical Program -||- Plenaries
Paper Submission -||- Special Sessions -||- ITT -||- Paper Review -||- Exhibits -||- Tutorials
Information -||- Registration -||- Travel Insurance -||- Housing -||- Workshops

©2015 Conference Management Services, Inc. -||- email: webmaster@icassp2004.org -||- Last updated Wednesday, April 07, 2004