Deep Room Impulse Response Completion -- A Multiexponential Decay Approach
No Thumbnail Available
URL
Journal Title
Journal ISSN
Volume Title
Sähkötekniikan korkeakoulu |
Master's thesis
Authors
Date
2024-05-20
Department
Major/Subject
Acoustics and Audio Technology
Mcode
ELEC3030
Degree programme
CCIS - Master’s Programme in Computer, Communication and Information Sciences (TS2013)
Language
en
Pages
63+1
Series
Abstract
The acoustics of enclosed spaces, vital for applications like concert hall design, have been studied extensively since the 1960s. Emerging technologies such as augmented reality and virtual reality (AR/VR) have brought new demands on immersive audio, necessitating fast, realistic simulation of acoustics. Room Impulse Responses (RIRs), the acoustic fingerprint of a space, are used in the auralization of virtual spaces. However, generating accurate RIRs in real-time for dynamic AR/VR environments remains challenging. Traditional simulation methods such as geometrical acoustics and wave-based acoustics have known limitations in accuracy and computational cost. Hybrid methods combining geometrical acoustics and wave-based acoustics have been proposed, but they also face challenges, thus motivating the need for new RIR generation paradigms. This thesis introduces RIR completion, a novel RIR generation task to predict the remaining portion of an RIR from its early segment. A lightweight encoder-decoder style deep neural network, Deep Exponential Completion of Room impulse responses (DECOR), is proposed to efficiently complete RIRs. DECOR's highly informed decoder predicts the amplitude values of exponential decay curves within the multiexponential decay reverberation framework. Experimental results demonstrate DECOR's effectiveness and reliability compared to a state-of-the-art model, while achieving fewer audible artifacts and more than thirty-fold model size reduction. While this thesis focuses on deep learning-based implementation, it serves as an initial proof-of-concept for RIR completion, opening avenues for more refined future RIR completion methods.Description
Supervisor
Välimäki, VesaThesis advisor
Schlecht, SebastianGötz, Georg
Keywords
room impulse response, deep learning, room acoustics, room impulse response completion, multiexponential decay, generative acoustics