Towards Efficient Multi-Modal Emotion Recognition. (18th January 2013)
- Record Type:
- Journal Article
- Title:
- Towards Efficient Multi-Modal Emotion Recognition. (18th January 2013)
- Main Title:
- Towards Efficient Multi-Modal Emotion Recognition
- Authors:
- Dobrišek, Simon
Gajšek, Rok
Mihelič, France
Pavešić, Nikola
Štruc, Vitomir - Abstract:
- The paper presents a multi-modal emotion recognition system exploiting audio and video (i.e., facial expression) information. The system first processes both sources of information individually to produce corresponding matching scores and then combines the computed matching scores to obtain a classification decision. For the video part of the system, a novel approach to emotion recognition, relying on image-set matching, is developed. The proposed approach avoids the need for detecting and tracking specific facial landmarks throughout the given video sequence, which represents a common source of error in video-based emotion recognition systems, and, therefore, adds robustness to the video processing chain. The audio part of the system, on the other hand, relies on utterance-specific Gaussian Mixture Models (GMMs) adapted from a Universal Background Model (UBM) via the maximum a posteriori probability (MAP) estimation. It improves upon the standard UBM-MAP procedure by exploiting gender information when building the utterance-specific GMMs, thus ensuring enhanced emotion recognition performance. Both the uni-modal parts as well as the combined system are assessed on the challenging multi-modal eNTERFACE'05 corpus with highly encouraging results. The developed system represents a feasible solution to emotion recognition that can easily be integrated into various systems, such as humanoid robots, smart surveillance systems and alike.
- Is Part Of:
- International journal of advanced robotic systems. Volume 10:Number 1(2013)
- Journal:
- International journal of advanced robotic systems
- Issue:
- Volume 10:Number 1(2013)
- Issue Display:
- Volume 10, Issue 1 (2013)
- Year:
- 2013
- Volume:
- 10
- Issue:
- 1
- Issue Sort Value:
- 2013-0010-0001-0000
- Page Start:
- Page End:
- Publication Date:
- 2013-01-18
- Subjects:
- Emotion Recognition -- Video Processing -- Speech Processing -- Canonical Correlations -- GMM-UBM
Robotics -- Periodicals
Robotics
Periodicals
629.892 - Journal URLs:
- http://arx.sagepub.com/ ↗
http://search.epnet.com/direct.asp?db=bch&jid=13CR&scope=site ↗
http://www.intechweb.org/journal.php?id=3 ↗
http://www.uk.sagepub.com/home.nav ↗ - DOI:
- 10.5772/54002 ↗
- Languages:
- English
- ISSNs:
- 1729-8806
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 24530.xml