Excitation Features of Speech for Speaker-Specific Emotion Detection

Loading...
Thumbnail Image
Journal Title
Journal ISSN
Volume Title
A1 Alkuperäisartikkeli tieteellisessä aikakauslehdessä
Date
2020-01-01
Major/Subject
Mcode
Degree programme
Language
en
Pages
10
60382-60391
Series
IEEE Access, Volume 8
Abstract
In this article, we study emotion detection from speech in a speaker-specific scenario. By parameterizing the excitation component of voiced speech, the study explores deviations between emotional speech (e.g., speech produced in anger, happiness, sadness, etc.) and neutral speech (i.e., non-emotional) to develop an automatic emotion detection system. The excitation features used in this study are the instantaneous fundamental frequency, the strength of excitation and the energy of excitation. The Kullback-Leibler (KL) distance is computed to measure the similarity between feature distributions of emotional and neutral speech. Based on the KL distance value between a test utterance and an utterance produced in a neutral state by the same speaker, a detection decision is made by the system. In the training of the proposed system, only three neutral utterances produced by the speaker were used, unlike in most existing emotion recognition and detection systems that call for large amounts of training data (both emotional and neutral) by several speakers. In addition, the proposed system is independent of language or lexical content. The system is evaluated using two databases of emotional speech. The performance of the proposed detection method is shown to be better than that of reference methods.
Description
Keywords
emotion detection, excitation source, Kullback-Leibler (KL) distance, linear prediction (LP) analysis, paralinguistics, Speech analysis, zero frequency filtering (ZFF)
Other note
Citation
Kadiri , S R & Alku , P 2020 , ' Excitation Features of Speech for Speaker-Specific Emotion Detection ' , IEEE Access , vol. 8 , 9046041 , pp. 60382-60391 . https://doi.org/10.1109/ACCESS.2020.2982954