Analysis of phonation onsets in vowel production, using information from glottal area and flow estimate

dc.contributorAalto-yliopistofi
dc.contributorAalto Universityen
dc.contributor.authorMurtola, Tiina
dc.contributor.authorMalinen, Jarmo
dc.contributor.authorGeneid, Ahmed
dc.contributor.authorAlku, Paavo
dc.contributor.departmentDept Signal Process and Acoust
dc.contributor.departmentDepartment of Mathematics and Systems Analysis
dc.contributor.departmentUniversity of Helsinki
dc.contributor.departmentSpeech Communication Technology
dc.date.accessioned2019-05-06T09:16:02Z
dc.date.available2019-05-06T09:16:02Z
dc.date.embargoinfo:eu-repo/date/embargoEnd/2021-05-02
dc.date.issued2019-05-01
dc.description.abstractA multichannel dataset comprising high-speed videoendoscopy images, and electroglottography and free-field microphone signals, was used to investigate phonation onsets in vowel production. Use of the multichannel data enabled simultaneous analysis of the two main aspects of phonation, glottal area, extracted from the high-speed videoendoscopy images, and glottal flow, estimated from the microphone signal using glottal inverse filtering. Pulse-wise parameterization of the glottal area and glottal flow indicate that there is no single dominant way to initiate quasi-stable phonation. The trajectories of fundamental frequency and normalized amplitude quotient, extracted from glottal area and estimated flow, may differ markedly during onsets. The location and steepness of the amplitude envelopes of the two signals were observed to be closely related, and quantitative analysis supported the hypothesis that glottal area and flow do not carry essentially different amplitude information during vowel onsets. Linear models wereused to predict the phonation onset times from the characteristics of the subsequent steady phonation. The phonation onset time of glottal area was found to have good predictability from a combination of the fundamental frequency and the normalized amplitude quotient of the glottal flow, as well as the gender of the speaker. For the phonation onset time of glottal flow, the best linear model was obtained using the fundamental frequency and the normalized amplitude quotient of the glottal flow as predictors.en
dc.description.versionPeer revieweden
dc.format.extent24
dc.format.extent55-65
dc.format.mimetypeapplication/pdf
dc.identifier.citationMurtola , T , Malinen , J , Geneid , A & Alku , P 2019 , ' Analysis of phonation onsets in vowel production, using information from glottal area and flow estimate ' , Speech Communication , vol. 109 , pp. 55-65 . https://doi.org/10.1016/j.specom.2019.03.007en
dc.identifier.doi10.1016/j.specom.2019.03.007
dc.identifier.issn0167-6393
dc.identifier.issn1872-7182
dc.identifier.otherPURE UUID: 81aaea8f-cf2f-4f82-9dd4-856b6dac31b3
dc.identifier.otherPURE ITEMURL: https://research.aalto.fi/en/publications/81aaea8f-cf2f-4f82-9dd4-856b6dac31b3
dc.identifier.otherPURE LINK: http://www.scopus.com/inward/record.url?scp=85063761258&partnerID=8YFLogxK
dc.identifier.otherPURE FILEURL: https://research.aalto.fi/files/32811936/ELEC_murtola_et_al_speec_communication.pdf
dc.identifier.urihttps://aaltodoc.aalto.fi/handle/123456789/37706
dc.identifier.urnURN:NBN:fi:aalto-201905062825
dc.language.isoenen
dc.publisherElsevier
dc.relation.ispartofseriesSpeech Communicationen
dc.relation.ispartofseriesVolume 109en
dc.rightsopenAccessen
dc.subject.keywordphonation onset
dc.subject.keywordvowel production
dc.subject.keywordhigh-speed videoendoscopy
dc.subject.keywordglottal inverse filtering
dc.titleAnalysis of phonation onsets in vowel production, using information from glottal area and flow estimateen
dc.typeA1 Alkuperäisartikkeli tieteellisessä aikakauslehdessäfi
Files