title: Estimation of Direction of Arrival Using Information Theory author: - name: Aatmaram Yadav (11004), Anupam Jakhar (11134), - name: Arpit Jangid (11150), Jayesh Kumar Gupta (11337), - name: Subhajit Mohanty (11731) date: EE301A - Term Paper Group 2 math: <script type="text/javascript" src="http://cdn.mathjax.org/mathjax/latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML"></script> mainfont: Gentium sansfont: Lato monofont: Consolas fontsize: 11pt documentclass: article classoption: oneside papersize: a4 ...
Time delay estimation (TDE) algorithms are popularly employed for the estimation of Direction of Arrival (DoA) of an acoustic source. The most popular approach relies on defining the relative delay between a pair of microphones by getting a peak in the comparing function at the correct DoA of the source.
The most common method applied for TDE is the generalized cross-correlation (GCC) algorithm with its numerous variants. But in reverberant environments its practical usefulness is severely restricted.
In our framework, we use the concept of mutual information to create a new comparing function that calculates the correct TDE by maximizing information that one microphone has about the other, assuming Gaussian distributions.
In the following sections, we describe the theory and implementation of this method and show the comparative results with (GCC-PHAT) in the result section as done in [1]1.
Our model consists of a two element microphone array positioned arbitrarily in some enclosure. Distance between the microphones is
(@)
where
Using geometry we can find the following relation between DoA angle
(@)
where
The DoA is typically obtained using the original GCC function 2 or one of its variants. Here we use GCC-PHAT version, considered to be the most robust version 3
(@)
(@)
However to take the effect of reverberation into account, we use the following modal:
(@)
where
Mutual information between signals
(@mi)
where
The problem of finding the delay is equivalent to finding the value of
(@)
For large L
(@) $$C(\tau) \approx \begin{bmatrix} x_1 \ x_2(\tau) \end{bmatrix} \begin{bmatrix} x_1 \ x_2(\tau) \end{bmatrix}^T = \begin{bmatrix} C_{11} && C_{12}(\tau) \ C_{21}(\tau) && C_{22}\end{bmatrix}$$
For reverberation model we find the marginal MI, considering jointly
(@mmi)
Here
According to information theoretic criterion, when (@mmi) reaches a maximum as a function of
Simulations involved a single source and two microphone system. The relative sample delay estimate varies according to
Variable Value
Speech duration 5 s
Sampling Frequency,
Distance between source 2.23 & mid-point of receivers Actual delay (samples) 6.89
Expected delay (samples) 7
Length of
Room dimensions [5 4 1] m
Noise added 15 dB
The data was divided into 10 frames, and for each frame an estimate
(@)
The root mean-squared error (RMSE) metric is the performance measure used to evaluate the system. For a single displacement of the geometry, this is defined to be the square root of the average value of
In the following figures we present the average RMSE over all ten simulation frames. Thus lower the average RMSE, better is the performance.
We couldn't recreate the simulations as suggested in paper 1.
Footnotes
-
F. Talantzis, A.G. Constantinides, L. Polymenakos, Estimation of direction of arrival using information theory, in: IEEE Signal Processing, 12 (8), August 2005, pp. 561–564. ↩ ↩2
-
C. H. Knapp and G. C. Carter, “The generalized correlation method for estimation of time delay,” IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-24, no. 4, pp. 320–327, Aug. 1976. ↩
-
M. Brandstein and D. B. Ward, Microphone Arrays Signal Processing Techniques and Applications.New York: Springer-Verlag, 2001. ↩
-
T. M. Cover and J. A. Thomas, Elements of Information Theory.New York: Wiley, 1991. ↩ ↩2