Spatially Selective Sound Capture Based on Aggregated Pairwise Similarity Measures

Loading...
Thumbnail Image

Access rights

openAccess
acceptedVersion

URL

Journal Title

Journal ISSN

Volume Title

A1 Alkuperäisartikkeli tieteellisessä aikakauslehdessä

Major/Subject

Mcode

Degree programme

Language

en

Pages

13

Series

Journal of the Audio Engineering Society, Volume 73, issue 11, pp. 747-759

Abstract

This paper proposes a spatial post-filter for speech enhancement in reverberant environments, such as meeting rooms and lecture halls, using widely distributed microphones. The method operates in the time-frequency domain by aggregating phase-corrected cross-spectral similarities between microphone pairs to form a unified spatial filter. This filter can be applied to omnidirectional or beamformed signals to extract a target source from a mixture while reducing the influence of spatial aliasing. The approach is evaluated in simulated multitalker scenarios and validated through objective performance metrics and subjective listening tests under varied acoustic conditions. Results demonstrate robust suppression of interference and consistent enhancement of the desired source.

Description

Keywords

Other note

Citation

Wirler, S & Pulkki, V 2025, 'Spatially Selective Sound Capture Based on Aggregated Pairwise Similarity Measures', Journal of the Audio Engineering Society, vol. 73, no. 11, pp. 747-759. https://doi.org/10.17743/jaes.2022.0228