A novel spatio-temporal convolutional neural framework for multimodal emotion recognition. (September 2022)
- Record Type:
- Journal Article
- Title:
- A novel spatio-temporal convolutional neural framework for multimodal emotion recognition. (September 2022)
- Main Title:
- A novel spatio-temporal convolutional neural framework for multimodal emotion recognition
- Authors:
- Sharafi, Masoumeh
Yazdchi, Mohammadreza
Rasti, Reza
Nasimi, Fahimeh - Abstract:
- Abstract: Proposing a practical method for high-performance emotion recognition could facilitate human–computer interaction. Among existing methods, deep learning techniques have improved the performance of emotion recognition systems. In this work, a new multimodal neural design is presented wherein audio and visual data are combined as the input to a hybrid network comprised of a bidirectional long short term memory (BiLSTM) network and two convolutional neural networks (CNNs). The spatial and temporal features extracted from video frames are fused with Mel-Frequency Cepstral Coefficients (MFCCs) and energy features extracted from audio signals and BiLSTM network outputs. Finally, a Softmax classifier is used to classify inputs into the set of target categories. The proposed model is evaluated on Surrey Audio–Visual Expressed Emotion (SAVEE), Ryerson Audio–Visual Database of Emotional Speech and Song (RAVDESS), and Ryerson Multimedia research Lab (RML) databases. Experimental results on these datasets prove the effectiveness of the proposed model where it achieves the accuracy of 99.75%, 94.99%, and 99.23% for the SAVEE, RAVDESS, and RML databases, respectively. Our experimental study reveals that the suggested method is more effective than existing algorithms in adapting to emotion recognition in these datasets.
- Is Part Of:
- Biomedical signal processing and control. Volume 78(2022)
- Journal:
- Biomedical signal processing and control
- Issue:
- Volume 78(2022)
- Issue Display:
- Volume 78, Issue 2022 (2022)
- Year:
- 2022
- Volume:
- 78
- Issue:
- 2022
- Issue Sort Value:
- 2022-0078-2022-0000
- Page Start:
- Page End:
- Publication Date:
- 2022-09
- Subjects:
- Bidirectional long short term memory -- Convolutional neural network -- Deep learning -- Emotion recognition -- Mel-frequency cepstral coefficients
Signal processing -- Periodicals
Biomedical engineering -- Periodicals
Signal Processing, Computer-Assisted -- Periodicals
Image Processing, Computer-Assisted -- Periodicals
Biomedical Engineering -- Periodicals
610.28 - Journal URLs:
- http://www.sciencedirect.com/science/journal/17468094 ↗
http://www.elsevier.com/journals ↗
http://www.sciencedirect.com/science?_ob=PublicationURL&_tockey=%23TOC%2329675%232006%23999989998%23626449%23FLA%23&_cdi=29675&_pubType=J&_auth=y&_acct=C000045259&_version=1&_urlVersion=0&_userid=836873&md5=664b5cf9a57fc91971a17faf20c32ec1 ↗ - DOI:
- 10.1016/j.bspc.2022.103970 ↗
- Languages:
- English
- ISSNs:
- 1746-8094
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 2087.880400
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 23045.xml