Low-Rank Room Impulse Response Estimation

Thumbnail Image

Access rights

openAccess

URL

Journal Title

Journal ISSN

Volume Title

A1 Alkuperäisartikkeli tieteellisessä aikakauslehdessä

Date

2023

Major/Subject

Mcode

Degree programme

Language

en

Pages

13
957-969

Series

IEEE/ACM Transactions on Audio Speech and Language Processing, Volume 31

Abstract

In this paper we consider low-rank estimation of room impulse responses (RIRs). Inspired by a physics-driven room-acoustical model, we propose an estimator of RIRs that promotes a low-rank structure for a matricization, or reshaping, of the estimated RIR. This low-rank prior acts as a regularizer for the inverse problem of estimating an RIR from input-output observations, preventing overfitting and improving estimation accuracy. As directly enforcing a low rank of the estimate results is an NP-hard problem, we consider two different relaxations, one using the nuclear norm, and one using the recently introduced concept of quadratic envelopes. Both relaxations allow for implementing the proposed estimator using a first-order algorithm with convergence guarantees. When evaluated on both synthetic and recorded RIRs, it is shown that under noisy output conditions, or when the spectral excitation of the input signal is poor, the proposed estimator outperforms comparable existing methods. The performance of the two low-rank relaxations methods is similar, but the quadratic envelope has the benefit of superior robustness to the choice of regularization hyperparameter in the case when the signal-to-noise ratio is unknown. The performance of the proposed method is compared to that of ordinary least squares, Tikhonov least squares, as well as the Cramér-Rao lower bound (CRLB).

Description

Publisher Copyright: © 2014 IEEE. | openaire: EC/H2020/773268/EU//SONORA

Keywords

Low-rank modeling, quadratic envelopes, room impulse responses

Other note

Citation

Jälmby , M , Elvander , F & Waterschoot , T V 2023 , ' Low-Rank Room Impulse Response Estimation ' , IEEE/ACM Transactions on Audio Speech and Language Processing , vol. 31 , pp. 957-969 . https://doi.org/10.1109/TASLP.2023.3240650