Comparison of spectral tilt measures for sentence prominence in speech — Effects of dimensionality and adverse noise conditions

dc.contributorAalto-yliopistofi
dc.contributorAalto Universityen
dc.contributor.authorKakouros, Sofoklisen_US
dc.contributor.authorRäsänen, Okkoen_US
dc.contributor.authorAlku, Paavoen_US
dc.contributor.departmentDepartment of Signal Processing and Acousticsen
dc.contributor.groupauthorSpeech Communication Technologyen
dc.date.accessioned2018-09-11T14:08:42Z
dc.date.available2018-09-11T14:08:42Z
dc.date.embargoinfo:eu-repo/date/embargoEnd/2020-08-24en_US
dc.date.issued2018-10-01en_US
dc.description.abstractLinguistic prominence in speech is known to correlate with the acoustic measures of energy, F0, and duration. In contrast, the role of spectral tilt in the realization of prominence has remained more inconsistent between previous empirical investigations. This may be partially due to the lack of a standard method for quantifying spectral tilt or due to difficulties in estimating the acoustical source of spectral tilt, the glottal flow, from continuous speech. These issues have rendered interpretations and comparisons between studies difficult. In addition, (i) little is known about the robustness of tilt estimators for prominence detection in the case when speech is not clean but corrupted, as in real life, by environmental noise or telephone transmission (i.e. degradation caused by bandpass filtering and quantization noise). Moreover, (ii) little attention has been paid to multidimensional representations of source spectrum that can potentially incorporate more information about the phonation style than purely scalar measures. In this work, we study spectral tilt in signaling prominence in spoken Dutch and French under different levels of additive noise, and for telephone-band coded speech, and compare several one-dimensional tilt measures that have been previously encountered in the literature as well as multidimensional tilt measures. We also compare spectral tilt measures with other standard acoustic correlates for prominence, namely, energy, F0, and duration. Our results provide further empirical support for the finding that tilt is a systematic correlate of prominence in Dutch, that the role is smaller in French, and that energy, F0, and duration appear still to be the most robust features for discriminating prominent and non-prominent words. In addition, our results show that there are notable differences between different tilt measures at different levels of noise, and that multidimensional representations for tilt improve class separability from the scalar measures.en
dc.description.versionPeer revieweden
dc.format.extent16
dc.format.mimetypeapplication/pdfen_US
dc.identifier.citationKakouros, S, Räsänen, O & Alku, P 2018, 'Comparison of spectral tilt measures for sentence prominence in speech — Effects of dimensionality and adverse noise conditions', Speech Communication, vol. 103, pp. 11-26. https://doi.org/10.1016/j.specom.2018.08.002en
dc.identifier.doi10.1016/j.specom.2018.08.002en_US
dc.identifier.issn0167-6393
dc.identifier.issn1872-7182
dc.identifier.otherPURE UUID: e65b2bd4-f017-4f5b-b22f-62f25724ac19en_US
dc.identifier.otherPURE ITEMURL: https://research.aalto.fi/en/publications/e65b2bd4-f017-4f5b-b22f-62f25724ac19en_US
dc.identifier.otherPURE FILEURL: https://research.aalto.fi/files/27714977/ELEC_Kakouros_et_al_Comparison_of_spectral.pdfen_US
dc.identifier.urihttps://aaltodoc.aalto.fi/handle/123456789/34011
dc.identifier.urnURN:NBN:fi:aalto-201809115119
dc.language.isoenen
dc.publisherElsevier
dc.relation.ispartofseriesSpeech Communicationen
dc.relation.ispartofseriesVolume 103, pp. 11-26en
dc.rightsopenAccessen
dc.subject.keywordProsodyen_US
dc.subject.keywordSentence prominenceen_US
dc.subject.keywordAcoustic measuresen_US
dc.subject.keywordSpectral tilten_US
dc.subject.keywordNoise robustnessen_US
dc.subject.keywordDNNen_US
dc.titleComparison of spectral tilt measures for sentence prominence in speech — Effects of dimensionality and adverse noise conditionsen
dc.typeA1 Alkuperäisartikkeli tieteellisessä aikakauslehdessäfi
dc.type.versionacceptedVersion

Files