Direction-of-arrival and power spectral density estimation using a single directional microphone and group-sparse optimization

dc.contributorAalto-yliopistofi
dc.contributorAalto Universityen
dc.contributor.authorTengan, Elisaen_US
dc.contributor.authorDietzen, Thomasen_US
dc.contributor.authorElvander, Filipen_US
dc.contributor.authorvan Waterschoot, Toonen_US
dc.contributor.departmentDepartment of Information and Communications Engineeringen
dc.contributor.groupauthorStructured and Stochastic Modelingen
dc.contributor.organizationKU Leuvenen_US
dc.date.accessioned2023-10-18T06:52:40Z
dc.date.available2023-10-18T06:52:40Z
dc.date.issued2023-12en_US
dc.description.abstractIn this paper, two approaches are proposed for estimating the direction of arrival (DOA) and power spectral density (PSD) of stationary point sources by using a single, rotating, directional microphone. These approaches are based on a method previously presented by the authors, in which point source DOAs were estimated by using a broadband signal model and solving a group-sparse optimization problem, where the number of observations made by the rotating directional microphone can be lower than the number of candidate DOAs in an angular grid. The DOA estimation is followed by the estimation of the sources’ PSDs through the solution of an overdetermined least squares problem. The first approach proposed in this paper includes the use of an additional nonnegativity constraint on the residual noise term when solving the group-sparse optimization problem and is referred to as the Group Lasso Least Squares (GL-LS) approach. The second proposed approach, in addition to the new nonnegativity constraint, employs a narrowband signal model when building the linear system of equations used for formulating the group-sparse optimization problem, where the DOAs and PSDs can be jointly estimated by iterative, group-wise reweighting. This is referred to as the Group-Lasso with l1-reweighting (GL-L1) approach. Both proposed approaches are implemented using the alternating direction method of multipliers (ADMM), and their performance is evaluated through simulations in which different setup conditions are considered, ranging from different types of model mismatch to variations in the acoustic scene and microphone directivity pattern. The results obtained show that in a scenario involving a microphone response mismatch between observed data and the signal model used, having the additional nonnegativity constraint on the residual noise can improve the DOA estimation for the case of GL-LS and the PSD estimation for the case of GL-L1. Moreover, the GL-L1 approach can present an advantage over GL-LS in terms of DOA estimation performance in scenarios with low SNR or where multiple sources are closely located to each other. Finally, it is shown that having the least squares PSD re-estimation step is beneficial in most scenarios, such that GL-LS outperformed GL-L1 in terms of PSD estimation errors.en
dc.description.versionPeer revieweden
dc.format.extent20
dc.format.mimetypeapplication/pdfen_US
dc.identifier.citationTengan, E, Dietzen, T, Elvander, F & van Waterschoot, T 2023, ' Direction-of-arrival and power spectral density estimation using a single directional microphone and group-sparse optimization ', Eurasip Journal on Audio, Speech, and Music Processing, vol. 2023, no. 1, 38 . https://doi.org/10.1186/s13636-023-00304-8en
dc.identifier.doi10.1186/s13636-023-00304-8en_US
dc.identifier.issn1687-4714
dc.identifier.issn1687-4722
dc.identifier.otherPURE UUID: 1c8e871b-6948-425f-8d37-70e231358c94en_US
dc.identifier.otherPURE ITEMURL: https://research.aalto.fi/en/publications/1c8e871b-6948-425f-8d37-70e231358c94en_US
dc.identifier.otherPURE LINK: http://www.scopus.com/inward/record.url?scp=85173601840&partnerID=8YFLogxKen_US
dc.identifier.otherPURE FILEURL: https://research.aalto.fi/files/125041132/s13636-023-00304-8.pdfen_US
dc.identifier.urihttps://aaltodoc.aalto.fi/handle/123456789/124161
dc.identifier.urnURN:NBN:fi:aalto-202310186510
dc.language.isoenen
dc.publisherSpringer
dc.relation.ispartofseriesEURASIP JOURNAL ON AUDIO, SPEECH, AND MUSIC PROCESSINGen
dc.rightsopenAccessen
dc.subject.keyworddirection-of-arrival estimationen_US
dc.subject.keywordpower spectral density estimationen_US
dc.subject.keywordsingle-channelen_US
dc.subject.keywordgroup-sparsityen_US
dc.titleDirection-of-arrival and power spectral density estimation using a single directional microphone and group-sparse optimizationen
dc.typeA1 Alkuperäisartikkeli tieteellisessä aikakauslehdessäfi
dc.type.versionpublishedVersion

Files