Unsupervised Estimation of Nonlinear Audio Effects: Comparing Diffusion-based and Adversarial Approaches

dc.contributorAalto-yliopistofi
dc.contributorAalto Universityen
dc.contributor.authorMoliner Juanpere, Eloi
dc.contributor.authorŠvento, Michal
dc.contributor.authorWright, Alec
dc.contributor.authorJuvela, Lauri
dc.contributor.authorRajmic, Pavel
dc.contributor.authorVälimäki, Vesa
dc.contributor.departmentDepartment of Information and Communications Engineeringen
dc.contributor.groupauthorAudio Signal Processingen
dc.contributor.groupauthorSpeech Synthesisen
dc.contributor.organizationBrno University of Technology
dc.contributor.organizationUniversity of Edinburgh
dc.date.accessioned2025-09-23T13:46:45Z
dc.date.available2025-09-23T13:46:45Z
dc.date.issued2025
dc.description.abstractAccurately estimating nonlinear audio effects without access to paired input-output signals remains a challenging problem. This work studies unsupervised probabilistic approaches for solving this task. We introduce a method, novel for this application, based on diffusion generative models for blind system identification, en- abling the estimation of unknown nonlinear effects using black- and gray-box models. This study compares this method with a previously proposed adversarial approach, analyzing the perfor- mance of both methods under different parameterizations of the effect operator and varying lengths of available effected record- ings. Through experiments on guitar distortion effects, we show that the diffusion-based approach provides more stable results and is less sensitive to data availability, while the adversarial approach is superior at estimating more pronounced distortion effects. Our findings contribute to the robust unsupervised blind estimation of audio effects, demonstrating the potential of diffusion models for system identification in music technology.en
dc.description.versionPeer revieweden
dc.format.extent8
dc.format.mimetypeapplication/pdf
dc.identifier.citationMoliner Juanpere, E, Švento, M, Wright, A, Juvela, L, Rajmic, P & Välimäki, V 2025, Unsupervised Estimation of Nonlinear Audio Effects: Comparing Diffusion-based and Adversarial Approaches. in Proceedings of the 28th International Conference on Digital Audio Effects. Proceedings of the International Conference on Digital Audio Effects, DAFx, pp. 366-373, International Conference on Digital Audio Effects, Ancona, Italy, 02/09/2025. < https://www.dafx.de/paper-archive/2025/DAFx25_paper_75.pdf >en
dc.identifier.issn2413-6700
dc.identifier.issn2413-6689
dc.identifier.otherPURE UUID: b6fc3c6e-0920-477c-8354-b949e035f74b
dc.identifier.otherPURE ITEMURL: https://research.aalto.fi/en/publications/b6fc3c6e-0920-477c-8354-b949e035f74b
dc.identifier.otherPURE LINK: https://dafx25.dii.univpm.it/wp-content/uploads/2025/09/DAFx25Proceedings.pdf
dc.identifier.otherPURE LINK: https://www.dafx.de/paper-archive/2025/DAFx25_paper_75.pdf
dc.identifier.otherPURE FILEURL: https://research.aalto.fi/files/196207702/Unsupervised_estimation_of_nonlinear_audio_effects.pdf
dc.identifier.urihttps://aaltodoc.aalto.fi/handle/123456789/139136
dc.identifier.urnURN:NBN:fi:aalto-202509237334
dc.language.isoenen
dc.relation.ispartofInternational Conference on Digital Audio Effectsen
dc.relation.ispartofseriesProceedings of the 28th International Conference on Digital Audio Effectsen
dc.relation.ispartofseriespp. 366-373en
dc.relation.ispartofseriesProceedings of the International Conference on Digital Audio Effectsen
dc.rightsopenAccessen
dc.rightsCC BY
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.titleUnsupervised Estimation of Nonlinear Audio Effects: Comparing Diffusion-based and Adversarial Approachesen
dc.typeA4 Artikkeli konferenssijulkaisussafi
dc.type.versionpublishedVersion

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Unsupervised_estimation_of_nonlinear_audio_effects.pdf
Size:
792.59 KB
Format:
Adobe Portable Document Format