Object-Based Six-Degrees-of-Freedom Rendering of Sound Scenes Captured with Multiple Ambisonic Receivers

dc.contributorAalto-yliopistofi
dc.contributorAalto Universityen
dc.contributor.authorMcCormack, Leoen_US
dc.contributor.authorPolitis, Archontisen_US
dc.contributor.authorMcKenzie, Thomasen_US
dc.contributor.authorHold, Christophen_US
dc.contributor.authorPulkki, Villeen_US
dc.contributor.departmentDept Signal Process and Acousten
dc.contributor.groupauthorCommunication Acoustics: Spatial Sound and Psychoacousticsen
dc.date.accessioned2022-06-22T09:01:30Z
dc.date.available2022-06-22T09:01:30Z
dc.date.issued2022-05en_US
dc.descriptionFunding Information: This research has received funding from the Aalto University Doctoral School of Electrical Engineering. Thomas McKenzie was supported by the Human Optimised XR (HumOR) Project. Special thanks are extended to Nils Meyer-Kahlen for assistance with the 3D room scan and Unity scene used during the evaluation and to Sebastian J. Schlecht for the insightful discussions that transpired during the course of this study. Publisher Copyright: © 2022 Audio Engineering Society. All rights reserved.
dc.description.abstractThis article proposes a system for object-based six-degrees-of-freedom (6DoF) rendering of spatial sound scenes that are captured using a distributed arrangement of multiple Ambisonic receivers. The approach is based on first identifying and tracking the positions of sound sources within the scene, followed by the isolation of their signals through the use of beamformers. These sound objects are subsequently spatialized over the target playback setup, with respect to both the head orientation and position of the listener. The diffuse ambience of the scene is rendered separately by first spatially subtracting the source signals from the receivers located nearest to the listener position. The resultant residual Ambisonic signals are then spatialized, decorrelated, and summed together with suitable interpolation weights. The proposed system is evaluated through an in situ listening test conducted in 6DoF virtual reality, whereby real-world sound sources are compared with the auralization achieved through the proposed rendering method. The results of 15 participants suggest that in comparison to a linear interpolation-based alternative, the proposed object-based approach is perceived as being more realistic.en
dc.description.versionPeer revieweden
dc.format.extent18
dc.format.extent355-372
dc.format.mimetypeapplication/pdfen_US
dc.identifier.citationMcCormack, L, Politis, A, McKenzie, T, Hold, C & Pulkki, V 2022, ' Object-Based Six-Degrees-of-Freedom Rendering of Sound Scenes Captured with Multiple Ambisonic Receivers ', AES: Journal of the Audio Engineering Society, vol. 70, no. 5, pp. 355-372 . https://doi.org/10.17743/JAES.2022.0010en
dc.identifier.doi10.17743/JAES.2022.0010en_US
dc.identifier.issn1549-4950
dc.identifier.otherPURE UUID: 2c9b6165-6b00-46e9-8b92-0325d9d08ccaen_US
dc.identifier.otherPURE ITEMURL: https://research.aalto.fi/en/publications/2c9b6165-6b00-46e9-8b92-0325d9d08ccaen_US
dc.identifier.otherPURE LINK: http://www.scopus.com/inward/record.url?scp=85131732082&partnerID=8YFLogxKen_US
dc.identifier.otherPURE FILEURL: https://research.aalto.fi/files/84723574/21739.pdfen_US
dc.identifier.urihttps://aaltodoc.aalto.fi/handle/123456789/115311
dc.identifier.urnURN:NBN:fi:aalto-202206224151
dc.language.isoenen
dc.publisherAudio Engineering Society
dc.relation.ispartofseriesAES: Journal of the Audio Engineering Societyen
dc.relation.ispartofseriesVolume 70, issue 5en
dc.rightsopenAccessen
dc.titleObject-Based Six-Degrees-of-Freedom Rendering of Sound Scenes Captured with Multiple Ambisonic Receiversen
dc.typeA1 Alkuperäisartikkeli tieteellisessä aikakauslehdessäfi
dc.type.versionpublishedVersion
Files