Acoustic Scattering for Spatial Audio Applications
School of Science | Doctoral thesis (article-based) | Defence date: 2022-05-24
Unless otherwise stated, all rights belong to the author. You may download, display and print this publication for Your own personal use. Commercial use is prohibited.
70 + app. 60
Aalto University publication series DOCTORAL THESES, 63/2022
AbstractModeling of sound propagation on the context of acoustic design and interactive applications have mainly focused on room acoustics as well as source and receiver modeling. In order to enrich the description and perceptual immersion of virtual sound-fields, modeling frameworks can also include the effects of scattering of bodies within the physical space. One of the main challenges in modeling the effects of scattering, is that its behaviour not only depends on the geometry of the scatterer but also the direction of arrival of the incident field. This thesis is a collection of five publications, the first two studies focus on the effects of near-field sources, and the last three studies involve the effects of scattering within spatial audio applications. The first publication explores the effects of near-field sources on High-order Ambisonics recording, processing and binaural reproduction. Results indicate that while near-field sources introduce low-frequency proximity gains in high-order microphones arrays, the regularization stages in Ambisonics recording prevents excessive gains. The second publication explores the directivity of near-field speech of 24 subjects and evaluates various repeatable speech reproduction alternatives. The third publication presents a scheme for encoding the acoustic scattering of arbitrary geometries into the spherical harmonic domain. After encoding, the scattering is represented as a multiple-input multiple-output matrix which describes the relation between the incoming and outgoing scattering modes of a geometry. This method allows for the standard transformations in the spherical harmonic domain (rotation, translation, scaling) and it is compatible with existing spatial audio frameworks such as Ambisonics and image-source methods. This method is validated using boundary element method simulations and indicates minimal synthesis error. The fourth publication presents a method to encode the space domain signals from a microphone array with arbitrary geometry and irregularly distributed sensors into Ambisonics. The algorithm relies on the array response and its enclosure's scattering properties to solve the direction of various active sources as well as the diffuse properties of the sound-field. Objective and subjective evaluations indicate that the proposed method outperforms traditional linear encoding. The fifth publication extends the method presented in the third publication by allowing sector-based encoding of acoustic scattering, optimal for geometries and surfaces which do not require entire spherical radiation. This last publication also presents a method to compress the data of the scattering matrix, allowing for more efficient memory storage. Methods proposed in the third and fifth publications can be used to introduce scattering geometries into interactive sound environments to produce more descriptive sound-fields while the fourth publication can be used to develop Ambisonic recording arrays on practical devices such as wearables and head-mounted displays.
Defence is held on 24.5.2022 12:00 – 16:00 Zoom, https://aalto.zoom.us/j/62282474989
Supervising professorLokki, Tapio, Prof., Aalto University, Department of Signal Processing and Acoustics, Finland
Thesis advisorPolitis, Archontis, Dr., Tampere University of Technology, Finland
acoustics, spatial audio, acoustic modelling, scattering
- [Publication 1]: Gonzalez, Raimundo and Politis, Archontis and Lokki, Tapio. Effects of Near-field Sources on Ambisonics Recording and Playback. In 151stConvention of the Audio Engineering Society, Las Vegas, NV, US, October 2021
- [Publication 2]: Gonzalez, Raimundo and McKenzie, Tom and Politis, Archontis and Lokki, Tapio. Near-Field Evaluation of Reproducible Speech Sources.Submitted to Journal of the Audio Engineering Society, January 19 2022
[Publication 3]: Gonzalez, Raimundo and Politis, Archontis and Lokki, Tapio. Spherical Decomposition of Arbitrary Scattering Geometries for Virtual AcousticEnvironments. In Proceedings of the 23rd International Conference on Digital Audio Effects (DAFx-20), Vienna, Austria, September 2021.
Full text in Acris/Aaltodoc: http://urn.fi/URN:NBN:fi:aalto-2021112410384
[Publication 4]: McCormack, Leo and Politis, Archontis and Gonzalez, Raimundo and Lokki, Tapio and Pulkki, Ville. Parametric Ambisonic Encoding of ArbitraryMicrophone Arrays. Submitted to IEEE Transactions on Audio, Speech and Language Processing, October 27th 2021.
DOI: 10.5281/zenodo.6401603 View at publisher
- [Publication 5]: Gonzalez, Raimundo and Hold, Christoph and Politis, Archontis and Lokki, Tapio. Sector-Based Encoding and Data Compression of VirtualAcoustic Scattering. Submitted to 30th European Signal Processing Conferenrence (EUSIPCO), February 8 2022