Acoustic scene analysis by multiperspective sector-based sound field decomposition

Loading...
Thumbnail Image
Journal Title
Journal ISSN
Volume Title
Sähkötekniikan korkeakoulu | Master's thesis
Date
2021-10-19
Department
Major/Subject
Acoustics and Audio Technology
Mcode
ELEC3030
Degree programme
CCIS - Master’s Programme in Computer, Communication and Information Sciences (TS2013)
Language
en
Pages
71
Series
Abstract
In this thesis, the performance of high-order sound field analysis by spatial filtering is evaluated in the context of 3D source localization. First, various factors in ‘sector- based’ processing—including analysis order, the presence of interfering noise, and source direction—are assessed using two key sound field indicators derived from intensimetric analysis: sound field diffuseness estimation and source direction-of- arrival (DoA) estimation. This carries forward to the evaluation of a 3D source localization technique which utilizes concurrent analyses from multiple receivers. The evaluation is carried out by simulation of ideal spherical harmonic receiver signals of sound fields comprised of fundamental components—a mixture of plane wave sound sources and an isotropic diffuse field—which adhere to an underlying plane wave model that has previously been characterized analytically. This model is then challenged with two scenarios: the presence of an interfering sound source, and an anisotropic, partially-correlated reverberant field. The high-order, sector- based approach to source localization is shown to be a consistent improvement upon the (first-order) method without spatial filtering, operating primarily through the principle of increasing the direct-to-diffuse (or signal-to-noise) energy ratio. However, numerous considerations must be taken if robust localization is to be performed over a large spatial extent. These include sector orientation, the arrangement and number of receivers, and estimation filtering and culling. Informed by the system performance under the tested conditions as well as the analytical model describing the sound field under the action of spatial filtering, optimization techniques are proposed, tested, and found to be successful under specified constraints. These include employing the diffuseness metric for weighted-DoA estimation in the localization task, an iterative approach localization using the estimated source distance to weight the contribution of DoA estimates, and a spatial sweep technique which applies a diffuseness constraint for DoA estimation culling which moves toward multisource localization.
Description
Supervisor
Pulkki, Ville
Thesis advisor
McCormack, Leo
Keywords
spatial audio, sound field analysis, Ambisonics, HOA, spatial filtering
Other note
Citation