Audiovisual matching of room acoustics in virtual reality

dc.contributorAalto-yliopistofi
dc.contributorAalto Universityen
dc.contributor.advisorMeyer-Kahlen, Nils
dc.contributor.authorQiu, Lisha
dc.contributor.schoolSähkötekniikan korkeakoulufi
dc.contributor.schoolSchool of Electrical Engineeringen
dc.contributor.supervisorLokki, Tapio
dc.date.accessioned2025-12-16T18:03:28Z
dc.date.available2025-12-16T18:03:28Z
dc.date.issued2025-11-24
dc.description.abstractEnsuring audio-visual consistency is crucial for building a credible virtual reality (VR) and augmented reality (AR) experience, as perceptual inconsistency can weaken the sense of presence and reality. Although V-to-A matching has been studied in VR, where users adjust auditory rendering based on visual scenes, there are still many issues that have not been fully explored. This paper systematically studies the indoor acoustic matching performance under four experimental conditions: Visual-to-Audio (V-to-A), Audiovisual-to-Audio (AV-to-V), Audio-to-Audio (A-to-A), and Audio-to-visual (A-to-V) further address the current research gaps. A high-fidelity VR experiment was conducted using the measured spatial room impulse response and binaural audio presented by the panoramic visual scene. Our results demonstrate that matching performance improves significantly when acoustic references are available. In contrast, relying solely on V-to-A information leads to substantially poorer performance, whereas both AV-to-A and A-to-A conditions provide clear benefits. Furthermore, inconsistent source vision does not significantly undermine perceptual consistency. Although slight asymmetry was observed between the A-to-V and V-to-A tasks, it was not statistically significant. In terms of analysis, we determined that reverberation and clarity are the main auditory cues (loads on PC1) for perceiving the size and distance of space, while the secondary factor (PC2) is irrelevant in perception. This work provides a fundamental framework for designing a coherent spatial audio rendering system in immersive media, emphasizing the relative importance of auditory and visual references in achieving perceptual consistency in AR/VR.en
dc.format.extent45
dc.format.mimetypeapplication/pdfen
dc.identifier.urihttps://aaltodoc.aalto.fi/handle/123456789/141219
dc.identifier.urnURN:NBN:fi:aalto-202512169328
dc.language.isoenen
dc.locationP1fi
dc.programmeMaster's Programme in Computer, Communication and Information Sciencesen
dc.programmeTieto-, tietoliikenne- ja informaatiotekniikan maisteriohjelmafi
dc.programmeMagisterprogrammet i data-, informations- och kommunikationstekniksv
dc.programme.majorAcoustics and Audio Technologyen
dc.subject.keywordaudio-visual matchingen
dc.subject.keywordVRen
dc.subject.keywordARen
dc.subject.keywordpsychoacousticsen
dc.subject.keywordroom acousticsen
dc.subject.keywordvirtual acousticsen
dc.titleAudiovisual matching of room acoustics in virtual realityen
dc.typeG2 Pro gradu, diplomityöfi
dc.type.ontasotMaster's thesisen
dc.type.ontasotDiplomityöfi
local.aalto.electroniconlyyes
local.aalto.openaccessno

Files