Weighted spectral features based on local Hu moments for speech emotion recognition. (April 2015)
- Record Type:
- Journal Article
- Title:
- Weighted spectral features based on local Hu moments for speech emotion recognition. (April 2015)
- Main Title:
- Weighted spectral features based on local Hu moments for speech emotion recognition
- Authors:
- Sun, Yaxin
Wen, Guihua
Wang, Jiabing - Abstract:
- Highlights: HuWSF is computed from local regions of a spectrogram by Hu moments. HuWSF can evaluates how the energy aggregate to some frequencies in a spectrogram. HuWSF extracts the features among neighbor coefficients of Mel filters of a frame. HuWSF extracts the features among coefficients of Mel filters of neighbor frames. HuWSF can reduce the changes brought by sentences, speakers and speaking styles. Abstract: Features greatly influence the results of speech emotion recognition, among which Mel-frequency Cepstral Coefficients (MFCC) is the most commonly used in speech emotion. However, MFCC does not consider both the relationship among neighbor coefficients of Mel filters of a frame and the relationship among coefficients of Mel filters of neighbor frames, which possibly leads to lose many useful features from spectrogram. This paper presents novel weighted spectral features based on Local Hu moments. The idea is motivated by that the energy on spectrogram would drastically vary with some emotion types such as angry and happy, while it would slightly change with other emotion types such as sadness and fear. This phenomenon would affect the local energy distribution of spectrogram in both time axis and frequency axis of spectrogram. To describe local energy distribution of spectrogram, Hu moments computed from local regions of spectrogram are used, as Hu moments can evaluate the degree how the energy is concentrated to the center of energy gravity of local region ofHighlights: HuWSF is computed from local regions of a spectrogram by Hu moments. HuWSF can evaluates how the energy aggregate to some frequencies in a spectrogram. HuWSF extracts the features among neighbor coefficients of Mel filters of a frame. HuWSF extracts the features among coefficients of Mel filters of neighbor frames. HuWSF can reduce the changes brought by sentences, speakers and speaking styles. Abstract: Features greatly influence the results of speech emotion recognition, among which Mel-frequency Cepstral Coefficients (MFCC) is the most commonly used in speech emotion. However, MFCC does not consider both the relationship among neighbor coefficients of Mel filters of a frame and the relationship among coefficients of Mel filters of neighbor frames, which possibly leads to lose many useful features from spectrogram. This paper presents novel weighted spectral features based on Local Hu moments. The idea is motivated by that the energy on spectrogram would drastically vary with some emotion types such as angry and happy, while it would slightly change with other emotion types such as sadness and fear. This phenomenon would affect the local energy distribution of spectrogram in both time axis and frequency axis of spectrogram. To describe local energy distribution of spectrogram, Hu moments computed from local regions of spectrogram are used, as Hu moments can evaluate the degree how the energy is concentrated to the center of energy gravity of local region of spectrogram and can significantly vary with the speech emotion types. The conducted experiments validate the proposed features in terms of the effectiveness of the speech emotion recognition. … (more)
- Is Part Of:
- Biomedical signal processing and control. Volume 18(2015)
- Journal:
- Biomedical signal processing and control
- Issue:
- Volume 18(2015)
- Issue Display:
- Volume 18, Issue 2015 (2015)
- Year:
- 2015
- Volume:
- 18
- Issue:
- 2015
- Issue Sort Value:
- 2015-0018-2015-0000
- Page Start:
- 80
- Page End:
- 90
- Publication Date:
- 2015-04
- Subjects:
- Speech emotion recognition -- Speech spectral features -- Feature extraction -- Hu moments
Signal processing -- Periodicals
Biomedical engineering -- Periodicals
Signal Processing, Computer-Assisted -- Periodicals
Image Processing, Computer-Assisted -- Periodicals
Biomedical Engineering -- Periodicals
610.28 - Journal URLs:
- http://www.sciencedirect.com/science/journal/17468094 ↗
http://www.elsevier.com/journals ↗
http://www.sciencedirect.com/science?_ob=PublicationURL&_tockey=%23TOC%2329675%232006%23999989998%23626449%23FLA%23&_cdi=29675&_pubType=J&_auth=y&_acct=C000045259&_version=1&_urlVersion=0&_userid=836873&md5=664b5cf9a57fc91971a17faf20c32ec1 ↗ - DOI:
- 10.1016/j.bspc.2014.10.008 ↗
- Languages:
- English
- ISSNs:
- 1746-8094
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 2087.880400
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 7364.xml