Multi-Channel Speech Enhancement
Multi-Channel Speech Enhancement
Multi-Channel Speech Enhancement
enhancement
Chunjian Li
DICOM, Aalborg University
3/24/2006
Dual-channel spectral
subtraction
- Hanson and Wong, ICASSP84.
3/24/2006
The method
S ( f )
S( f )
1
a
e j ( f )
S( f )
2
N ( f ) 1
N( f )
N ( f )
a
2
cos()
1
a
N ( f )
a
3/24/2006
Comments
3/24/2006
3/24/2006
Signal model
3/24/2006
Signal estimation
The estimated signal:
s( n) y (n) d1 (n)
M 1
d1 (n) h(i )d 2 (n i )
i 0
M 1
3/24/2006
Signal estimation
The minimization can be solved by applying the orthogonality principle:
M
This can be solved in the same way as solving the normal equations.
But it is usually solved by sequential algorithms such as the LMS
algorithm. The advantages of the LMS are:
-No matrix inversion, low complexity
-Fully adaptive, suitable to non-stationary signal and noise
-Low delay
3/24/2006
LMS
-It is a sequential, gradient descent minimization method,
- The estimate of the weights is updated each time a new sample
is available:
h k h k 1 k g
Where the element of the gradient vector:
g
3/24/2006
( )
M 1
LMS
Or, in matrix form:
g 2(ryd 2 R d 2 h )
The most important trick is, in this sequential implementation, to
approximate the correlation matrix and cross-correlation vector by
The instantaneous estimates.
d dH 2
R
d2
2
ryd 2 d 2 y (n)
3/24/2006
LMS
The step size is often chosen empirically, as long as the following
condition is satisfied for stability reason:
max
3/24/2006
Comments
Eliminating cross-talk
Cross-talk: If the signal is also captured in the reference channel, the ANC
will suppress part of the signal. Cross-talk can be reduced by employing
two adaptive filter within a feedback loop.
3/24/2006
Beamforming
Planar wave
Uniform linear array
Sensors responses are identical and
LTI
Sensors are omni directional
One parameter to estimate: DOA
3/24/2006
ULA
3/24/2006
ULA
The signal model:
y (t ) a( ) s (t ) e(t )
where the array transfer vector :
a( ) 1 e
jc 2
... e
jc m T
Where m is the delay with reference to the first sensor, and c is the
center frequency of the signal. By defining the spatial frequency as:
s c
d sin
c
a( ) 1 e
3/24/2006
j s
... e
j ( m 1) s T
ULA
3/24/2006
Spatial filtering
3/24/2006
Spatial filtering
3/24/2006
3/24/2006
Restrictions to beamforming
MIMO systems
Spatial processing techniques with no
knowledge of array geometry
Invisible beam
Arbitrarily high spatial resolution
Do not depend on signal frequency
Spatial noise is not assumed to be white
Not a spatial sampling system
3/24/2006
Solutions to BSS
[2] A. Hyvarinen, J. Karhunen, and E. Oja, Independent Component Analysis, John Wiley & Sons, Inc. 2001
[3] H. Attias, Independent factor analysis, Neural Computation, 1999.
3/24/2006
Independent component
analysis (ICA)
Instantaneous mixing
The number of sensors is greater than
or equal to the number of sources
No system noise
The sources (components) are
independent of each other
The sources are non-Gaussian
processes
3/24/2006
ICA model
Cocktail party problem. Three sources, three sensors:
x1 (t ) a11 s1 (t ) a12 s2 (t ) a13 s3 (t )
x (t ) a s (t ) a s (t ) a s (t )
22 2
23 3
2 12 1
x As
Neither s nor A are known. Can not be solved by linear algebra.
If the sources are independent non-Gaussian, the A matrix can
be found by maximizing the non-Gaussianity of the sources.
3/24/2006
Contrast function
An iterative gradient method. First initialize the A matrix.
If the mixing matrix A is square and non-singular, move it to the left:
A 1x s
Calculate the non-Gaussianity of s, and find the next estimate of A that
gives a higher non-Gaussianity. Iterate until convergence.
The contrast function is the objective function to maximize or minimize.
3/24/2006
Maximizing non-Gaussianity
Non-Gaussian is independent
Measuring non-Gaussianity
- by kurtosis
- by negentropy
3/24/2006
ICA methods
3/24/2006
Extensions to ICA
Noisy ICA
ICA with non-square mixing matrix
Independent Factor Analysis
Convolutive mixture
Methods using time structure
3/24/2006
3/24/2006
BSE
D. Mandic and A. Cichocki, An Online Algorithm For Blind Extraction Of Sources With Different
Dynamical Structures.
3/24/2006