Time-varying autoregressions for speaker verification in reverberant conditions
Loading...
Access rights
openAccess
publishedVersion
URL
Journal Title
Journal ISSN
Volume Title
A4 Artikkeli konferenssijulkaisussa
This publication is imported from Aalto University research portal.
View publication in the Research portal (opens in new window)
View/Open full text file from the Research portal (opens in new window)
View publication in the Research portal (opens in new window)
View/Open full text file from the Research portal (opens in new window)
Date
Major/Subject
Mcode
Degree programme
Language
en
Pages
5
Series
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Volume 2017-August, pp. 1512-1516, Interspeech: Annual Conference of the International Speech Communication Association
Abstract
In poor room acoustics conditions, speech signals received by a microphone might become corrupted by the signals’ delayed versions that are reflected from the room surfaces (e.g. wall, floor). This phenomenon, reverberation, drops the accuracy of automatic speaker verification systems by causing mismatch between the training and testing. Since reverberation causes temporal smearing to the signal, one way to tackle its effects is to study robust feature extraction, particularly based on long-time temporal feature extraction. This approach has been adopted previously in the form of 2-dimensional autoregressive (2DAR) feature extraction scheme by using frequency domain linear prediction (FDLP). In 2DAR, FDLP processing is followed by time domain linear prediction (TDLP). In the current study, we propose modifying the latter part of the 2DAR feature extraction scheme by replacing TDLP with time-varying linear prediction (TVLP) to add an extra layer of temporal processing. Our speaker verification experiments using the proposed features with the text-dependent RedDots corpus show small but consistent improvements in clean and reverberant conditions (up to 6.5%) over the 2DAR features and large improvements over the MFCC features in reverberant conditions (up to 46.5%).Description
Other note
Citation
Vestman, V, Gowda, D, Sahidullah, M, Alku, P & Kinnunen, T 2017, Time-varying autoregressions for speaker verification in reverberant conditions. in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. vol. 2017-August, Interspeech: Annual Conference of the International Speech Communication Association, International Speech Communication Association (ISCA), pp. 1512-1516, Interspeech, Stockholm, Sweden, 20/08/2017. https://doi.org/10.21437/Interspeech.2017-734