Spectral Features derived from Single Frequency Filter for Multispeaker Localization
Loading...
Access rights
openAccess
acceptedVersion
URL
Journal Title
Journal ISSN
Volume Title
A4 Artikkeli konferenssijulkaisussa
This publication is imported from Aalto University research portal.
View publication in the Research portal (opens in new window)
View/Open full text file from the Research portal (opens in new window)
View publication in the Research portal (opens in new window)
View/Open full text file from the Research portal (opens in new window)
Date
Major/Subject
Mcode
Degree programme
Language
en
Pages
Series
26th National Conference on Communications, NCC 2020
Abstract
In this paper, we present a multispeaker localization method using the time delay estimates obtained from the spectral features derived from the single frequency filter (SFF) representation. The mixture signals are transformed into SFF domain from which the temporal envelopes are extracted at each frequency. Subsequently, the spectral features such as mean and variance of temporal envelopes across frequencies are correlated for extracting the time delay estimates. Since these features emphasize the high SNR regions of the mixtures, correlation of the corresponding features across the channels leads to robust delay estimates in real acoustic environments. We study the efficacy of the developed approach by comparing its performance with the existing correlation based time delay estimation techniques. Both, a standard data set recorded in real-room acoustic environments and simulated data set are used for evaluations. It is observed that the localization performance of the proposed algorithm closely matches the performance of a state-of-the-art correlation approach and outperforms other approaches.Description
Keywords
Other note
Citation
Thakallapalli, S, Kadiri, S & Gangashetty, S 2020, Spectral Features derived from Single Frequency Filter for Multispeaker Localization. in 26th National Conference on Communications, NCC 2020., 9056007, IEEE, National Conference on Communications, Kharagpur, India, 21/02/2020. https://doi.org/10.1109/NCC48643.2020.9056007