Learning Centre

Comparison of glottal closure instants detection algorithms for emotional speech

 |  Login

Show simple item record

dc.contributor Aalto-yliopisto fi
dc.contributor Aalto University en
dc.contributor.author Kadiri, Sudarsana
dc.contributor.author Alku, Paavo
dc.contributor.author Yegnanarayana, Bayya
dc.date.accessioned 2020-10-02T06:22:06Z
dc.date.available 2020-10-02T06:22:06Z
dc.date.issued 2020-05
dc.identifier.citation Kadiri , S , Alku , P & Yegnanarayana , B 2020 , Comparison of glottal closure instants detection algorithms for emotional speech . in 2020 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2020 - Proceedings . , 9054737 , Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing , IEEE , pp. 7379-7383 , IEEE International Conference on Acoustics, Speech, and Signal Processing , Barcelona , Spain , 04/05/2020 . https://doi.org/10.1109/ICASSP40776.2020.9054737 en
dc.identifier.isbn 978-1-5090-6631-5
dc.identifier.issn 1520-6149
dc.identifier.issn 2379-190X
dc.identifier.other PURE UUID: 0423d9b5-ea42-4245-9718-268997a15219
dc.identifier.other PURE ITEMURL: https://research.aalto.fi/en/publications/0423d9b5-ea42-4245-9718-268997a15219
dc.identifier.other PURE LINK: http://www.scopus.com/inward/record.url?scp=85091281973&partnerID=8YFLogxK
dc.identifier.other PURE FILEURL: https://research.aalto.fi/files/41344491/GCI_Emotion_ICASSP_2020_Kadiri.pdf
dc.identifier.uri https://aaltodoc.aalto.fi/handle/123456789/46762
dc.description avaa julkaisu, kun artikkeli saatavilla
dc.description.abstract In production of voiced speech, epochs or glottal closure instants (GCIs) refer to the instants of significant excitation of the vocal tract. Extraction of GCIs is used as a pre-processing stage in many areas of speech technology, such as in prosody modification, speech synthesis and voice source analysis. In the past decades, several GCI detection algorithms have been developed and most of them provide excellent results for speech signals produced using modal (normal) type of phonation. There are, however, no studies comparing multiple state-of-the-art GCI detection methods in emotional speech. In this paper, we compare six GCI detection algorithms using emotional speech and known evaluation metrics. We use the Berlin EMO-DB acted emotional speech database which contains seven emotions and simultaneous electroglottography (EGG) recordings as ground truth. The results show that all six GCI detection algorithms give best performance in processing speech of neutral emotion and that the performance degrade particularly in emotions of high arousal (anger and joy). To improve the performance of GCI detection in emotional speech, the study underlines the importance of local average pitch period estimates. en
dc.format.extent 5
dc.format.extent 7379-7383
dc.format.mimetype application/pdf
dc.language.iso en en
dc.relation.ispartof IEEE International Conference on Acoustics, Speech and Signal Processing en
dc.relation.ispartofseries Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing en
dc.rights openAccess en
dc.title Comparison of glottal closure instants detection algorithms for emotional speech en
dc.type A4 Artikkeli konferenssijulkaisussa fi
dc.description.version Peer reviewed en
dc.contributor.department Dept Signal Process and Acoust
dc.contributor.department International Institute of Information Technology Hyderabad
dc.subject.keyword Emotions
dc.subject.keyword Epochs
dc.subject.keyword Excitation source
dc.subject.keyword Glottal Closure Instants
dc.subject.keyword Speech analysis
dc.identifier.urn URN:NBN:fi:aalto-202010025727
dc.identifier.doi 10.1109/ICASSP40776.2020.9054737
dc.type.version acceptedVersion


Files in this item

Files Size Format View

There are no open access files associated with this item.

This item appears in the following Collection(s)

Show simple item record

Search archive


Advanced Search

article-iconSubmit a publication

Browse

Statistics