Direction-of-arrival and power spectral density estimation using a single directional microphone and group-sparse optimization

dc.contributorAalto-yliopistofi
dc.contributorAalto Universityen
dc.contributor.authorTengan, Elisa
dc.contributor.authorDietzen, Thomas
dc.contributor.authorElvander, Filip
dc.contributor.authorvan Waterschoot, Toon
dc.contributor.departmentKU Leuven
dc.contributor.departmentDepartment of Information and Communications Engineering
dc.date.accessioned2023-10-18T06:52:40Z
dc.date.available2023-10-18T06:52:40Z
dc.date.issued2023-12
dc.description.abstractIn this paper, two approaches are proposed for estimating the direction of arrival (DOA) and power spectral density (PSD) of stationary point sources by using a single, rotating, directional microphone. These approaches are based on a method previously presented by the authors, in which point source DOAs were estimated by using a broadband signal model and solving a group-sparse optimization problem, where the number of observations made by the rotating directional microphone can be lower than the number of candidate DOAs in an angular grid. The DOA estimation is followed by the estimation of the sources’ PSDs through the solution of an overdetermined least squares problem. The first approach proposed in this paper includes the use of an additional nonnegativity constraint on the residual noise term when solving the group-sparse optimization problem and is referred to as the Group Lasso Least Squares (GL-LS) approach. The second proposed approach, in addition to the new nonnegativity constraint, employs a narrowband signal model when building the linear system of equations used for formulating the group-sparse optimization problem, where the DOAs and PSDs can be jointly estimated by iterative, group-wise reweighting. This is referred to as the Group-Lasso with l1-reweighting (GL-L1) approach. Both proposed approaches are implemented using the alternating direction method of multipliers (ADMM), and their performance is evaluated through simulations in which different setup conditions are considered, ranging from different types of model mismatch to variations in the acoustic scene and microphone directivity pattern. The results obtained show that in a scenario involving a microphone response mismatch between observed data and the signal model used, having the additional nonnegativity constraint on the residual noise can improve the DOA estimation for the case of GL-LS and the PSD estimation for the case of GL-L1. Moreover, the GL-L1 approach can present an advantage over GL-LS in terms of DOA estimation performance in scenarios with low SNR or where multiple sources are closely located to each other. Finally, it is shown that having the least squares PSD re-estimation step is beneficial in most scenarios, such that GL-LS outperformed GL-L1 in terms of PSD estimation errors.en
dc.description.versionPeer revieweden
dc.format.extent20
dc.format.mimetypeapplication/pdf
dc.identifier.citationTengan , E , Dietzen , T , Elvander , F & van Waterschoot , T 2023 , ' Direction-of-arrival and power spectral density estimation using a single directional microphone and group-sparse optimization ' , EURASIP JOURNAL ON AUDIO, SPEECH, AND MUSIC PROCESSING , vol. 2023 , no. 1 , 38 . https://doi.org/10.1186/s13636-023-00304-8en
dc.identifier.doi10.1186/s13636-023-00304-8
dc.identifier.issn1687-4714
dc.identifier.issn1687-4722
dc.identifier.otherPURE UUID: 1c8e871b-6948-425f-8d37-70e231358c94
dc.identifier.otherPURE ITEMURL: https://research.aalto.fi/en/publications/1c8e871b-6948-425f-8d37-70e231358c94
dc.identifier.otherPURE LINK: http://www.scopus.com/inward/record.url?scp=85173601840&partnerID=8YFLogxK
dc.identifier.otherPURE FILEURL: https://research.aalto.fi/files/125041132/s13636-023-00304-8.pdf
dc.identifier.urihttps://aaltodoc.aalto.fi/handle/123456789/124161
dc.identifier.urnURN:NBN:fi:aalto-202310186510
dc.language.isoenen
dc.publisherSpringer
dc.relation.ispartofseriesEURASIP JOURNAL ON AUDIO, SPEECH, AND MUSIC PROCESSINGen
dc.rightsopenAccessen
dc.subject.keyworddirection-of-arrival estimation
dc.subject.keywordpower spectral density estimation
dc.subject.keywordsingle-channel
dc.subject.keywordgroup-sparsity
dc.titleDirection-of-arrival and power spectral density estimation using a single directional microphone and group-sparse optimizationen
dc.typeA1 Alkuperäisartikkeli tieteellisessä aikakauslehdessäfi
dc.type.versionpublishedVersion
Files