DSpace Repository

Limitation of Frequency Domain Blind Source Separation for Convolutive Mixture of Speech

Show simple item record

dc.contributor.author Shoko Araki en
dc.contributor.author Shoji Makino en
dc.contributor.author Tsuyoki Nishikawa en
dc.contributor.author Hiroshi Saruwatari en
dc.date.accessioned 2012-08-22T07:58:45Z en
dc.date.available 2012-08-22T07:58:45Z en
dc.date.issued 2001-04 en
dc.identifier.uri http://hdl.handle.net/10061/7995 en
dc.description HSC2001: IEEE International Workshop on Hands-Free Speech Communication, April 9-11, 2001, Kyoto, Japan. en
dc.description.abstract Despite several recent proposals to achieve Blind Source Separation (BSS) for realistic acoustic signal, separation performance is still not enough. In particular, when the length of impulse response is long, performance is highly limited. In this paper, we show it is useless to be constrained by the condition, P << T, where T is the frame size of FFT and P is the length of room impulse response. From our experiments, a frame size of 256 or 512 (32 or 64 ms at a sampling frequency of 8 kHz) is best even for the long room reverberation of TR = 150 and 300 ms. We also clarified the reason for poor performance of BSS in long reverberant environment, finding that separation is achieved chiefly for the sound from the direction of jammer because BSS cannot calculate the inverse of the room transfer function both for the target and jammer signals. en
dc.language.iso en en
dc.rights Copyright 2001 IEEE en
dc.title Limitation of Frequency Domain Blind Source Separation for Convolutive Mixture of Speech en
dc.type.nii Conference Paper en
dc.textversion Publisher en
dc.identifier.spage 91 en
dc.identifier.epage 94 en

Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace

Advanced Search


My Account