Subjective Evaluation of Basic Emotions from Audio–Visual Data

Loading...
Thumbnail Image

Access rights

openAccess

URL

Journal Title

Journal ISSN

Volume Title

A1 Alkuperäisartikkeli tieteellisessä aikakauslehdessä

Date

2022-07-01

Major/Subject

Mcode

Degree programme

Language

en

Pages

Series

Sensors, Volume 22, issue 13

Abstract

Understanding of the perception of emotions or affective states in humans is important to develop emotion-aware systems that work in realistic scenarios. In this paper, the perception of emotions in naturalistic human interaction (audio–visual data) is studied using perceptual evaluation. For this purpose, a naturalistic audio–visual emotion database collected from TV broadcasts such as soap-operas and movies, called the IIIT-H Audio–Visual Emotion (IIIT-H AVE) database, is used. The database consists of audio-alone, video-alone, and audio–visual data in English. Using data of all three modes, perceptual tests are conducted for four basic emotions (angry, happy, neutral, and sad) based on category labeling and for two dimensions, namely arousal (active or passive) and valence (positive or negative), based on dimensional labeling. The results indicated that the participants’ perception of emotions was remarkably different between the audio-alone, video-alone, and audio–video data. This finding emphasizes the importance of emotion-specific features compared to commonly used features in the development of emotion-aware systems.

Description

Funding Information: Funding: This research was partly funded by Academy of Finland grant number 313390. Publisher Copyright: © 2022 by the authors. Licensee MDPI, Basel, Switzerland.

Keywords

emotion analysis, emotion recognition, emotion synthesis, feature extraction, naturalistic audio–visual emotion database

Other note

Citation

Kadiri, S R & Alku, P 2022, ' Subjective Evaluation of Basic Emotions from Audio–Visual Data ', Sensors, vol. 22, no. 13, 4931 . https://doi.org/10.3390/s22134931