Transfer-Plausible Acoustics for Augmented Reality

dc.contributorAalto-yliopistofi
dc.contributorAalto Universityen
dc.contributor.advisorSchlecht, Sebastian J., Prof., Friedrich-Alexander Universität Erlangen-Nürnberg, Germany
dc.contributor.advisorRobinson, Philip, Dr., Reality Labs Research, USA
dc.contributor.authorMeyer-Kahlen, Nils
dc.contributor.departmentInformaatio- ja tietoliikennetekniikan laitosfi
dc.contributor.departmentDepartment of Information and Communications Engineeringen
dc.contributor.labVirtual Acoustics Groupen
dc.contributor.schoolSähkötekniikan korkeakoulufi
dc.contributor.schoolSchool of Electrical Engineeringen
dc.contributor.supervisorLokki, Tapio, Prof., Aalto University, Department of Information and Communications Engineering, Finland
dc.date.accessioned2024-07-11T09:00:22Z
dc.date.available2024-07-11T09:00:22Z
dc.date.defence2024-07-26
dc.date.issued2024
dc.description.abstractAugmented reality (AR) telepresence systems aim to present visual and auditory "holograms" of conversation partners via head-mounted displays and transparent headphones. These systems require binaural audio that adapts not only to the user's orientation and position but also to their acoustic environment. Many fundamental technologies for such real-time, binaural auralization systems have been developed over the years. These virtual acoustic systems were often tested in direct comparison to a high-quality reference rendering, so the implied objective for the system's development was often indistinguishability from a reference. However, differences were usually audible in such tests, at least for non-ideal, practically relevant systems. When developing future AR systems, two questions arise: "Why exactly do such discrepancies occur?" and "What are meaningful objectives and evaluation paradigms other than indistinguishability from a reference?" First, finding reasons for discrepancies involves a detailed understanding of specific rendering methods, underlying models, and their violations. Two fundamental properties of a parametric spatial room impulse response processing technique are studied as examples. Second, as an objective that leads to meaningful AR evaluation paradigms, one option is to assess if auditory illusions are evoked, i.e., whether a listener believes a virtual sound source to be real. This work introduces the transfer-plausibility paradigm, which evaluates if a virtual source creates an auditory illusion, even in the presence of other, real sound sources. In summary, Publication I and Publication II discuss fundamental properties of spatial room impulse response processing techniques: Publication I shows how direction-of-arrival estimation based on the pseudo intensity vector depends on anisotropy in the late reverberation. Publication II investigates how perceptual roughness can occur in spatial room impulse response rendering based on broadband directional assignment. Publication III and Publication IV deal with problems more closely related to AR. Publication III proposes an approach for blind spatial room impulse response estimation using a pseudo-reference signal. Publication IV demonstrates auditory modeling-based quantification of impairments caused by so-called transparent headphones used for AR. Publication V and Publication VI introduce the notion of transfer-plausibility and compare it against other paradigms. The results suggest that even non-ideal virtual acoustic renderings are comparable in transfer-plausibility tests. Publication VII presents an experiment about the inability for self-localization using position-dependent room acoustic differences. The thesis concludes by presenting opportunities for future transfer-plausibility tests and a proposed model for describing differences in experimental paradigms by their sensitivity to auditory similarity, context, and artifacts.en
dc.format.extent89 + app. 101
dc.identifier.isbn978-952-64-1913-8 (electronic)
dc.identifier.isbn978-952-64-1912-1 (printed)
dc.identifier.issn1799-4942 (electronic)
dc.identifier.issn1799-4934 (printed)
dc.identifier.issn1799-4934 (ISSN-L)
dc.identifier.urihttps://aaltodoc.aalto.fi/handle/123456789/129514
dc.identifier.urnURN:ISBN:978-952-64-1913-8
dc.language.isoenen
dc.opnPörschmann, Christoph, Prof., Technische Hochschule Köln, Germany
dc.publisherAalto Universityen
dc.publisherAalto-yliopistofi
dc.relation.haspart[Publication 1]: Meyer-Kahlen, N., Schlecht, S.J. Directional distribution of the pseudo intensity vector in anisotropic late reverberation. The Journal of the Acoustical Society of America, 155(2), 1515–1526, February 2024. Full text in Acris/Aaltodoc: https://urn.fi/URN:NBN:fi:aalto-202403062615. DOI: 10.1121/10.0024960
dc.relation.haspart[Publication 2]: Meyer-Kahlen, N., Schlecht, S.J., Lokki, T. Perceptual roughness of spatially assigned sparse noise for rendering reverberation. The Journal of the Acoustical Society of America, 150(5), 3521–3531, November 2021. Full text in Acris/Aaltodoc: https://urn.fi/URN:NBN:fi:aalto-202204062741. DOI: 10.1121/10.0007048
dc.relation.haspart[Publication 3]: Meyer-Kahlen, N., Schlecht, S.J. Blind directional room impulse response parameterization from relative transfer functions. International Workshop on Acoustic Signal Enhancement (IWAENC), Bamberg, Germany, September 2022. Full text in Acris/Aaltodoc: https://urn.fi/URN:NBN:fi:aalto-202301181211. DOI: 10.1109/IWAENC53105.2022.9914706
dc.relation.haspart[Publication 4]: Lladó, P., McKenzie, T., Meyer-Kahlen, N., Schlecht, S.J. Predicting perceptual transparency of head-worn devices. The Journal of the Audio Engineering Society, 70 (7), 585–600, July/August 2022. Full text in Acris/Aaltodoc: https://urn.fi/URN:NBN:fi:aalto-202208174886. DOI: 10.17743/jaes.2022.0024
dc.relation.haspart[Publication 5]: Wirler, S., Meyer-Kahlen, N, Schlecht, S.J. Towards transfer-plausibility for evaluating mixed reality audio in complex scenes. In AES International Conference on Audio for Virtual and Augmented Reality, Remote, August 2020.
dc.relation.haspart[Publication 6]: Meyer-Kahlen, N, Schlecht, S.J., Amengual Garí, S., Lokki, T. Testing auditory illusions in augmented reality: plausibility, transfer-plausibility and authenticity. The Journal of the Audio Engineering Society, Submitted, 2024.
dc.relation.haspart[Publication 7]: Meyer-Kahlen, N., Schlecht, S.J., Lokki, T. Clearly audible differences rarely reveal where you are in a room. The Journal of the Acoustical Society of America, 152 (2), 877–887, August 2022.
dc.relation.ispartofseriesAalto University publication series DOCTORAL THESESen
dc.relation.ispartofseries139/2024
dc.revBrinkmann, Fabian, Dr., Technische Universität Berlin, Germany
dc.revPike, Chris, Dr., Sonos, Inc, UK
dc.subject.keywordvirtual acousticsen
dc.subject.keywordaugmented realityen
dc.subject.keywordSpatial Room Impulse Response Processingen
dc.subject.keywordroom acousticsen
dc.subject.keywordplausibilityen
dc.subject.otherAcousticsen
dc.subject.otherComputer scienceen
dc.titleTransfer-Plausible Acoustics for Augmented Realityen
dc.typeG5 Artikkeliväitöskirjafi
dc.type.dcmitypetexten
dc.type.ontasotDoctoral dissertation (article-based)en
dc.type.ontasotVäitöskirja (artikkeli)fi
local.aalto.acrisexportstatuschecked 2024-08-08_1313
local.aalto.archiveyes
local.aalto.formfolder2024_07_10_klo_14_38
local.aalto.infraAalto Acoustics Lab
local.aalto.infraMAGICS

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
isbn9789526419138.pdf
Size:
17.1 MB
Format:
Adobe Portable Document Format