Acoustic Scattering for Spatial Audio Applications

dc.contributorAalto Universityen
dc.contributor.advisorPolitis, Archontis, Dr., Tampere University of Technology, Finland
dc.contributor.authorGonzalez, Raimundo
dc.contributor.departmentTietotekniikan laitosfi
dc.contributor.departmentDepartment of Computer Scienceen
dc.contributor.schoolPerustieteiden korkeakoulufi
dc.contributor.schoolSchool of Scienceen
dc.contributor.supervisorLokki, Tapio, Prof., Aalto University, Department of Signal Processing and Acoustics, Finland
dc.descriptionDefence is held on 24.5.2022 12:00 – 16:00 Zoom,
dc.description.abstractModeling of sound propagation on the context of acoustic design and interactive applications have mainly focused on room acoustics as well as source and receiver modeling. In order to enrich the description and perceptual immersion of virtual sound-fields, modeling frameworks can also include the effects of scattering of bodies within the physical space. One of the main challenges in modeling the effects of scattering, is that its behaviour not only depends on the geometry of the scatterer but also the direction of arrival of the incident field. This thesis is a collection of five publications, the first two studies focus on the effects of near-field sources, and the last three studies involve the effects of scattering within spatial audio applications. The first publication explores the effects of near-field sources on High-order Ambisonics recording, processing and binaural reproduction. Results indicate that while near-field sources introduce low-frequency proximity gains in high-order microphones arrays, the regularization stages in Ambisonics recording prevents excessive gains. The second publication explores the directivity of near-field speech of 24 subjects and evaluates various repeatable speech reproduction alternatives. The third publication presents a scheme for encoding the acoustic scattering of arbitrary geometries into the spherical harmonic domain. After encoding, the scattering is represented as a multiple-input multiple-output matrix which describes the relation between the incoming and outgoing scattering modes of a geometry. This method allows for the standard transformations in the spherical harmonic domain (rotation, translation, scaling) and it is compatible with existing spatial audio frameworks such as Ambisonics and image-source methods. This method is validated using boundary element method simulations and indicates minimal synthesis error. The fourth publication presents a method to encode the space domain signals from a microphone array with arbitrary geometry and irregularly distributed sensors into Ambisonics. The algorithm relies on the array response and its enclosure's scattering properties to solve the direction of various active sources as well as the diffuse properties of the sound-field. Objective and subjective evaluations indicate that the proposed method outperforms traditional linear encoding. The fifth publication extends the method presented in the third publication by allowing sector-based encoding of acoustic scattering, optimal for geometries and surfaces which do not require entire spherical radiation. This last publication also presents a method to compress the data of the scattering matrix, allowing for more efficient memory storage. Methods proposed in the third and fifth publications can be used to introduce scattering geometries into interactive sound environments to produce more descriptive sound-fields while the fourth publication can be used to develop Ambisonic recording arrays on practical devices such as wearables and head-mounted displays.en
dc.format.extent70 + app. 60
dc.identifier.isbn978-952-64-0792-0 (electronic)
dc.identifier.isbn978-952-64-0791-3 (printed)
dc.identifier.issn1799-4942 (electronic)
dc.identifier.issn1799-4934 (printed)
dc.identifier.issn1799-4934 (ISSN-L)
dc.opnProf. Sascha Spors, Universität Rostock, Germany
dc.publisherAalto Universityen
dc.relation.haspart[Publication 1]: Gonzalez, Raimundo and Politis, Archontis and Lokki, Tapio. Effects of Near-field Sources on Ambisonics Recording and Playback. In 151stConvention of the Audio Engineering Society, Las Vegas, NV, US, October 2021
dc.relation.haspart[Publication 2]: Gonzalez, Raimundo and McKenzie, Tom and Politis, Archontis and Lokki, Tapio. Near-Field Evaluation of Reproducible Speech Sources.Submitted to Journal of the Audio Engineering Society, January 19 2022
dc.relation.haspart[Publication 3]: Gonzalez, Raimundo and Politis, Archontis and Lokki, Tapio. Spherical Decomposition of Arbitrary Scattering Geometries for Virtual AcousticEnvironments. In Proceedings of the 23rd International Conference on Digital Audio Effects (DAFx-20), Vienna, Austria, September 2021. Full text in Acris/Aaltodoc:
dc.relation.haspart[Publication 4]: McCormack, Leo and Politis, Archontis and Gonzalez, Raimundo and Lokki, Tapio and Pulkki, Ville. Parametric Ambisonic Encoding of ArbitraryMicrophone Arrays. Submitted to IEEE Transactions on Audio, Speech and Language Processing, October 27th 2021. DOI: 10.5281/zenodo.6401603
dc.relation.haspart[Publication 5]: Gonzalez, Raimundo and Hold, Christoph and Politis, Archontis and Lokki, Tapio. Sector-Based Encoding and Data Compression of VirtualAcoustic Scattering. Submitted to 30th European Signal Processing Conferenrence (EUSIPCO), February 8 2022
dc.relation.ispartofseriesAalto University publication series DOCTORAL THESESen
dc.revRafaely, Boaz, Prof., Ben-Gurion University of the Negev, Israel
dc.revZotter, Franz, Prof., Universität für Musik und darstellende Kunst Graz, Austria
dc.subject.keywordspatial audioen
dc.subject.keywordacoustic modellingen
dc.subject.otherComputer scienceen
dc.titleAcoustic Scattering for Spatial Audio Applicationsen
dc.typeG5 Artikkeliväitöskirjafi
dc.type.ontasotDoctoral dissertation (article-based)en
dc.type.ontasotVäitöskirja (artikkeli)fi
local.aalto.acrisexportstatuschecked 2022-05-24_0852
local.aalto.infraAalto Acoustics Lab
Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
5.87 MB
Adobe Portable Document Format