Estimation and Restoration of Unknown Nonlinear Distortion Using Diffusion
Loading...
Access rights
openAccess
publishedVersion
URL
Journal Title
Journal ISSN
Volume Title
A1 Alkuperäisartikkeli tieteellisessä aikakauslehdessä
This publication is imported from Aalto University research portal.
View publication in the Research portal (opens in new window)
View/Open full text file from the Research portal (opens in new window)
View publication in the Research portal (opens in new window)
View/Open full text file from the Research portal (opens in new window)
Unless otherwise stated, all rights belong to the author. You may download, display and print this publication for Your own personal use. Commercial use is prohibited.
Date
Major/Subject
Mcode
Degree programme
Language
en
Pages
14
Series
AES: Journal of the Audio Engineering Society, Volume 73, issue 9, pp. 519-532
Abstract
The restoration of nonlinearly distorted audio signals, alongside the identification of the applied memoryless nonlinear operation, is studied. The paper focuses on the difficult but practically important case in which both the nonlinearity and the original input signal are unknown. The proposed method uses a generative diffusion model trained unconditionally on guitar or speech signals to jointly model and invert the nonlinear system at inference time. Both the memoryless nonlinear function model and the restored audio signal are obtained as output. Examples of successful blind estimation of hard and soft-clipping, digital quantization, half-wave rectification, and wavefolding nonlinearities are presented. The results suggest that, out of the nonlinear functions tested here, the Cubic Catmull-Rom spline is best suited to approximating these nonlinearities. In the case of guitar recordings, comparisons with informed and supervised restoration methods show that the proposed blind method is at least as good as they are in terms of objective metrics. Experiments on distorted speech show that the proposed blind method outperforms general-purpose speech enhancement techniques and restores the original voice quality. The proposed method can be applied to memoryless audio effects modeling, restoration of music and speech recordings, and characterization of analog recording media.Description
Publisher Copyright: © 2025, Audio Engineering Society. All rights reserved.
Keywords
Other note
Citation
Švento, M, Moliner Juanpere, E, Juvela, L, Wright, A & Välimäki, V 2025, 'Estimation and Restoration of Unknown Nonlinear Distortion Using Diffusion', AES: Journal of the Audio Engineering Society, vol. 73, no. 9, pp. 519-532. https://doi.org/10.17743/jaes.2022.0221