Spectral Features derived from Single Frequency Filter for Multispeaker Localization

Loading...
Thumbnail Image

Access rights

openAccess
acceptedVersion

URL

Journal Title

Journal ISSN

Volume Title

A4 Artikkeli konferenssijulkaisussa

Major/Subject

Mcode

Degree programme

Language

en

Pages

Series

26th National Conference on Communications, NCC 2020

Abstract

In this paper, we present a multispeaker localization method using the time delay estimates obtained from the spectral features derived from the single frequency filter (SFF) representation. The mixture signals are transformed into SFF domain from which the temporal envelopes are extracted at each frequency. Subsequently, the spectral features such as mean and variance of temporal envelopes across frequencies are correlated for extracting the time delay estimates. Since these features emphasize the high SNR regions of the mixtures, correlation of the corresponding features across the channels leads to robust delay estimates in real acoustic environments. We study the efficacy of the developed approach by comparing its performance with the existing correlation based time delay estimation techniques. Both, a standard data set recorded in real-room acoustic environments and simulated data set are used for evaluations. It is observed that the localization performance of the proposed algorithm closely matches the performance of a state-of-the-art correlation approach and outperforms other approaches.

Description

Keywords

Other note

Citation

Thakallapalli, S, Kadiri, S & Gangashetty, S 2020, Spectral Features derived from Single Frequency Filter for Multispeaker Localization. in 26th National Conference on Communications, NCC 2020., 9056007, IEEE, National Conference on Communications, Kharagpur, India, 21/02/2020. https://doi.org/10.1109/NCC48643.2020.9056007