Unsupervised Estimation of Nonlinear Audio Effects: Comparing Diffusion-based and Adversarial Approaches
| dc.contributor | Aalto-yliopisto | fi |
| dc.contributor | Aalto University | en |
| dc.contributor.author | Moliner Juanpere, Eloi | |
| dc.contributor.author | Švento, Michal | |
| dc.contributor.author | Wright, Alec | |
| dc.contributor.author | Juvela, Lauri | |
| dc.contributor.author | Rajmic, Pavel | |
| dc.contributor.author | Välimäki, Vesa | |
| dc.contributor.department | Department of Information and Communications Engineering | en |
| dc.contributor.groupauthor | Audio Signal Processing | en |
| dc.contributor.groupauthor | Speech Synthesis | en |
| dc.contributor.organization | Brno University of Technology | |
| dc.contributor.organization | University of Edinburgh | |
| dc.date.accessioned | 2025-09-23T13:46:45Z | |
| dc.date.available | 2025-09-23T13:46:45Z | |
| dc.date.issued | 2025 | |
| dc.description.abstract | Accurately estimating nonlinear audio effects without access to paired input-output signals remains a challenging problem. This work studies unsupervised probabilistic approaches for solving this task. We introduce a method, novel for this application, based on diffusion generative models for blind system identification, en- abling the estimation of unknown nonlinear effects using black- and gray-box models. This study compares this method with a previously proposed adversarial approach, analyzing the perfor- mance of both methods under different parameterizations of the effect operator and varying lengths of available effected record- ings. Through experiments on guitar distortion effects, we show that the diffusion-based approach provides more stable results and is less sensitive to data availability, while the adversarial approach is superior at estimating more pronounced distortion effects. Our findings contribute to the robust unsupervised blind estimation of audio effects, demonstrating the potential of diffusion models for system identification in music technology. | en |
| dc.description.version | Peer reviewed | en |
| dc.format.extent | 8 | |
| dc.format.mimetype | application/pdf | |
| dc.identifier.citation | Moliner Juanpere, E, Švento, M, Wright, A, Juvela, L, Rajmic, P & Välimäki, V 2025, Unsupervised Estimation of Nonlinear Audio Effects: Comparing Diffusion-based and Adversarial Approaches. in Proceedings of the 28th International Conference on Digital Audio Effects. Proceedings of the International Conference on Digital Audio Effects, DAFx, pp. 366-373, International Conference on Digital Audio Effects, Ancona, Italy, 02/09/2025. < https://www.dafx.de/paper-archive/2025/DAFx25_paper_75.pdf > | en |
| dc.identifier.issn | 2413-6700 | |
| dc.identifier.issn | 2413-6689 | |
| dc.identifier.other | PURE UUID: b6fc3c6e-0920-477c-8354-b949e035f74b | |
| dc.identifier.other | PURE ITEMURL: https://research.aalto.fi/en/publications/b6fc3c6e-0920-477c-8354-b949e035f74b | |
| dc.identifier.other | PURE LINK: https://dafx25.dii.univpm.it/wp-content/uploads/2025/09/DAFx25Proceedings.pdf | |
| dc.identifier.other | PURE LINK: https://www.dafx.de/paper-archive/2025/DAFx25_paper_75.pdf | |
| dc.identifier.other | PURE FILEURL: https://research.aalto.fi/files/196207702/Unsupervised_estimation_of_nonlinear_audio_effects.pdf | |
| dc.identifier.uri | https://aaltodoc.aalto.fi/handle/123456789/139136 | |
| dc.identifier.urn | URN:NBN:fi:aalto-202509237334 | |
| dc.language.iso | en | en |
| dc.relation.ispartof | International Conference on Digital Audio Effects | en |
| dc.relation.ispartofseries | Proceedings of the 28th International Conference on Digital Audio Effects | en |
| dc.relation.ispartofseries | pp. 366-373 | en |
| dc.relation.ispartofseries | Proceedings of the International Conference on Digital Audio Effects | en |
| dc.rights | openAccess | en |
| dc.rights | CC BY | |
| dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ | |
| dc.title | Unsupervised Estimation of Nonlinear Audio Effects: Comparing Diffusion-based and Adversarial Approaches | en |
| dc.type | A4 Artikkeli konferenssijulkaisussa | fi |
| dc.type.version | publishedVersion |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Unsupervised_estimation_of_nonlinear_audio_effects.pdf
- Size:
- 792.59 KB
- Format:
- Adobe Portable Document Format