Multi-Source Direction-of-Arrival Estimation Using Steered Response Power and Group-Sparse Optimization
Loading...
Access rights
openAccess
acceptedVersion
URL
Journal Title
Journal ISSN
Volume Title
A1 Alkuperäisartikkeli tieteellisessä aikakauslehdessä
This publication is imported from Aalto University research portal.
View publication in the Research portal (opens in new window)
View/Open full text file from the Research portal (opens in new window)
View publication in the Research portal (opens in new window)
View/Open full text file from the Research portal (opens in new window)
Date
Major/Subject
Mcode
Degree programme
Language
en
Pages
15
Series
IEEE/ACM Transactions on Audio Speech and Language Processing, Volume 32, pp. 3517-3531
Abstract
In this paper, a method is proposed for estimating the direction of arrival (DOA) of multiple broadband sound sources. This is achieved through the solution of a group-sparse optimization problem, which models an observed broadband steered response power (SRP) map as a linear function of power spectral densities (PSDs), corresponding to a set of candidate DOAs, and forming a PSD vector. Given the assumption of spatial sparsity, the estimation of the source DOAs is then accomplished by identifying peaks in the resulting spatial power density, i.e., the estimated direction-specific PSDs integrated over frequency. The motivation behind the proposed method lies in its potential to reveal more distinct peaks in the estimated spatial power density than those directly observed in the broadband SRP map, which can be beneficial to the robustness in DOA estimation performance when multiple sources need to be distinguished under varying acoustic conditions. An implementation of the proposed method using the alternating direction method of multipliers (ADMM) is presented, and the DOA estimation performance is evaluated with both simulated and experimental data. Results show that, especially in reverberant scenarios, the proposed method presents an advantage in locating closely spaced sources when compared to the conventional SRP-PHAT, the group-sparse iterative covariance-based estimation (GSPICE) method, and the wideband MUSIC method with geometric averaging. Furthermore, it is observed that for a compact microphone array, the proposed method overall maintained its performance even when using SRP maps computed with grid resolutions that are lower than the sampling requirements of the broadband SRP function. Finally, results obtained with experimental data showed the validity and applicability of the proposed method in a practical meeting room environment.Description
Publisher Copyright: IEEE
Other note
Citation
Tengan, E, Dietzen, T, Elvander, F & Waterschoot, T V 2024, 'Multi-Source Direction-of-Arrival Estimation Using Steered Response Power and Group-Sparse Optimization', IEEE/ACM Transactions on Audio Speech and Language Processing, vol. 32, pp. 3517-3531. https://doi.org/10.1109/TASLP.2024.3419417