Fusionsense: Emotion classification using feature fusion of multimodal data and deep learning in a brain-inspired spiking neural network

dc.contributorAalto-yliopistofi
dc.contributorAalto Universityen
dc.contributor.authorTan, Clarenceen_US
dc.contributor.authorCeballos, Gerardoen_US
dc.contributor.authorKasabov, Nikolaen_US
dc.contributor.authorSubramaniyam, Narayan Puthanmadamen_US
dc.contributor.departmentDepartment of Neuroscience and Biomedical Engineeringen
dc.contributor.organizationAuckland University of Technologyen_US
dc.contributor.organizationUniversidad de los Andes Méridaen_US
dc.date.accessioned2020-10-16T08:07:43Z
dc.date.available2020-10-16T08:07:43Z
dc.date.issued2020-09-02en_US
dc.description.abstractUsing multimodal signals to solve the problem of emotion recognition is one of the emerging trends in affective computing. Several studies have utilized state of the art deep learning methods and combined physiological signals, such as the electrocardiogram (EEG), electroencephalogram (ECG), skin temperature, along with facial expressions, voice, posture to name a few, in order to classify emotions. Spiking neural networks (SNNs) represent the third generation of neural networks and employ biologically plausible models of neurons. SNNs have been shown to handle Spatio-temporal data, which is essentially the nature of the data encountered in emotion recognition problem, in an efficient manner. In this work, for the first time, we propose the application of SNNs in order to solve the emotion recognition problem with the multimodal dataset. Specifically, we use the NeuCube framework, which employs an evolving SNN architecture to classify emotional valence and evaluate the performance of our approach on the MAHNOB-HCI dataset. The multimodal data used in our work consists of facial expressions along with physiological signals such as ECG, skin temperature, skin conductance, respiration signal, mouth length, and pupil size. We perform classification under the Leave-One-Subject-Out (LOSO) cross-validation mode. Our results show that the proposed approach achieves an accuracy of 73.15% for classifying binary valence when applying feature-level fusion, which is comparable to other deep learning methods. We achieve this accuracy even without using EEG, which other deep learning methods have relied on to achieve this level of accuracy. In conclusion, we have demonstrated that the SNN can be successfully used for solving the emotion recognition problem with multimodal data and also provide directions for future research utilizing SNN for Affective computing. In addition to the good accuracy, the SNN recognition system is requires incrementally trainable on new data in an adaptive way. It only one pass training, which makes it suitable for practical and on-line applications. These features are not manifested in other methods for this problem.en
dc.description.versionPeer revieweden
dc.format.extent27
dc.format.mimetypeapplication/pdfen_US
dc.identifier.citationTan, C, Ceballos, G, Kasabov, N & Subramaniyam, N P 2020, 'Fusionsense : Emotion classification using feature fusion of multimodal data and deep learning in a brain-inspired spiking neural network', Sensors, vol. 20, no. 18, 5328, pp. 1-27. https://doi.org/10.3390/s20185328en
dc.identifier.doi10.3390/s20185328en_US
dc.identifier.issn1424-8220
dc.identifier.otherPURE UUID: 250d11e4-b4d0-43d0-a370-f8423694fa8aen_US
dc.identifier.otherPURE ITEMURL: https://research.aalto.fi/en/publications/250d11e4-b4d0-43d0-a370-f8423694fa8aen_US
dc.identifier.otherPURE FILEURL: https://research.aalto.fi/files/51909423/Tan_FusionSense.sensors_20_05328.pdf
dc.identifier.urihttps://aaltodoc.aalto.fi/handle/123456789/46944
dc.identifier.urnURN:NBN:fi:aalto-202010165841
dc.language.isoenen
dc.publisherMDPI AG
dc.relation.ispartofseriesSensorsen
dc.relation.ispartofseriesVolume 20, issue 18, pp. 1-27en
dc.rightsopenAccessen
dc.subject.keywordEvolving Spiking Neural Networks (eSNNs)en_US
dc.subject.keywordFacial emotion recognitionen_US
dc.subject.keywordMultimodal dataen_US
dc.subject.keywordNeuCubeen_US
dc.subject.keywordSpatio-temporal dataen_US
dc.titleFusionsense: Emotion classification using feature fusion of multimodal data and deep learning in a brain-inspired spiking neural networken
dc.typeA1 Alkuperäisartikkeli tieteellisessä aikakauslehdessäfi
dc.type.versionpublishedVersion

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Tan_FusionSense.sensors_20_05328.pdf
Size:
4.91 MB
Format:
Adobe Portable Document Format