Perceptual Loss Function for Neural Modelling of Audio Systems

dc.contributorAalto-yliopistofi
dc.contributorAalto Universityen
dc.contributor.authorWright, Alecen_US
dc.contributor.authorVälimäki, Vesaen_US
dc.contributor.departmentDepartment of Signal Processing and Acousticsen
dc.contributor.groupauthorAudio Signal Processingen
dc.date.accessioned2020-07-03T11:06:24Z
dc.date.available2020-07-03T11:06:24Z
dc.date.issued2020-05-04en_US
dc.description.abstractThis work investigates alternate pre-emphasis filters used as part of the loss function during neural network training for nonlinear audio processing. In our previous work, the error-to-signal ratio loss function was used during network training, with a first-order highpass pre-emphasis filter applied to both the target signal and neural network output. This work considers more perceptually relevant pre-emphasis filters, which include lowpass filtering at high frequencies. We conducted listening tests to determine whether they offer an improvement to the quality of a neural network model of a guitar tube amplifier. Listening test results indicate that the use of an A-weighting pre-emphasis filter offers the best improvement among the tested filters. The proposed perceptual loss function improves the sound quality of neural network models in audio processing without affecting the computational cost.en
dc.description.versionPeer revieweden
dc.format.extent5
dc.format.mimetypeapplication/pdfen_US
dc.identifier.citationWright, A & Välimäki, V 2020, Perceptual Loss Function for Neural Modelling of Audio Systems. in 2020 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2020 - Proceedings., 9052944, Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, IEEE, pp. 251-255, IEEE International Conference on Acoustics, Speech, and Signal Processing, Barcelona, Spain, 04/05/2020. https://doi.org/10.1109/ICASSP40776.2020.9052944en
dc.identifier.doi10.1109/ICASSP40776.2020.9052944en_US
dc.identifier.isbn978-1-5090-6631-5
dc.identifier.isbn978-1-5090-6632-2
dc.identifier.issn1520-6149
dc.identifier.issn2379-190X
dc.identifier.otherPURE UUID: 02aff20c-6fa4-41af-a3bd-01784f24e1f3en_US
dc.identifier.otherPURE ITEMURL: https://research.aalto.fi/en/publications/02aff20c-6fa4-41af-a3bd-01784f24e1f3en_US
dc.identifier.otherPURE FILEURL: https://research.aalto.fi/files/43946100/ICASSP_2020_RNN_Loss_Functions.pdf
dc.identifier.urihttps://aaltodoc.aalto.fi/handle/123456789/45300
dc.identifier.urnURN:NBN:fi:aalto-202007034257
dc.language.isoenen
dc.relation.ispartofIEEE International Conference on Acoustics, Speech, and Signal Processingen
dc.relation.ispartofseries2020 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2020 - Proceedingsen
dc.relation.ispartofseriespp. 251-255en
dc.relation.ispartofseriesProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processingen
dc.rightsopenAccessen
dc.subject.keywordAcoustic signal processingen_US
dc.subject.keywordMusic technologyen_US
dc.subject.keywordPsychoacousticsen_US
dc.titlePerceptual Loss Function for Neural Modelling of Audio Systemsen
dc.typeA4 Artikkeli konferenssijulkaisussafi
dc.type.versionacceptedVersion

Files