Automatic analysis of speech F0 contour for the characterization of mood changes in bipolar patients. (March 2015)
- Record Type:
- Journal Article
- Title:
- Automatic analysis of speech F0 contour for the characterization of mood changes in bipolar patients. (March 2015)
- Main Title:
- Automatic analysis of speech F0 contour for the characterization of mood changes in bipolar patients
- Authors:
- Guidi, A.
Vanello, N.
Bertschy, G.
Gentili, C.
Landini, L.
Scilingo, E.P. - Abstract:
- Highlights: We present an automatic method for estimating features describing speech F0 contour. Analysis on acted emotional speeches distinguished high- from low-arousal emotions. Intra-subject analysis on bipolar patients found out differences between mood states. The results on bipolar patients are subject-specific and task-dependent. Results on controls confirmed a good specificity of the proposed features. Abstract: Bipolar disorders are characterized by a mood swing, ranging from mania to depression. A system that could monitor and eventually predict these changes would be useful to improve therapy and avoid dangerous events. Speech might convey relevant information about subjects' mood and there is a growing interest to study its changes in presence of mood disorders. In this work we present an automatic method to characterize fundamental frequency (F0) dynamics in voiced part of syllables. The method performs a segmentation of voiced sounds from running speech samples and estimates two categories of features. The first category is borrowed from Taylor's Tilt intonational model. However, the meaning of the proposed features is different from the meaning of Taylor's ones since the former are estimated from all voiced segments without performing any analysis of intonation. A second category of features takes into account the speed of change of F0. In this work, the proposed features are first estimated from an emotional speech database. Then, an analysis on speechHighlights: We present an automatic method for estimating features describing speech F0 contour. Analysis on acted emotional speeches distinguished high- from low-arousal emotions. Intra-subject analysis on bipolar patients found out differences between mood states. The results on bipolar patients are subject-specific and task-dependent. Results on controls confirmed a good specificity of the proposed features. Abstract: Bipolar disorders are characterized by a mood swing, ranging from mania to depression. A system that could monitor and eventually predict these changes would be useful to improve therapy and avoid dangerous events. Speech might convey relevant information about subjects' mood and there is a growing interest to study its changes in presence of mood disorders. In this work we present an automatic method to characterize fundamental frequency (F0) dynamics in voiced part of syllables. The method performs a segmentation of voiced sounds from running speech samples and estimates two categories of features. The first category is borrowed from Taylor's Tilt intonational model. However, the meaning of the proposed features is different from the meaning of Taylor's ones since the former are estimated from all voiced segments without performing any analysis of intonation. A second category of features takes into account the speed of change of F0. In this work, the proposed features are first estimated from an emotional speech database. Then, an analysis on speech samples acquired from eleven psychiatric patients experiencing different mood states, and eighteen healthy control subjects is introduced. Subjects had to perform a text reading task and a picture commenting task. The results of the analysis on the emotional speech database indicate that the proposed features can discriminate between high and low arousal emotions. This was verified both at single subject and group level. An intra-subject analysis was performed on bipolar patients and it highlighted significant changes of the features with different mood states, although this was not observed for all the subjects. The directions of the changes estimated for different patients experiencing the same mood swing, were not coherent and were task-dependent. Interestingly, a single-subject analysis performed on healthy controls and on bipolar patients recorded twice with the same mood label, resulted in a very small number of significant differences. In particular a very good specificity was highlighted for the Taylor-inspired features and for a subset of the second category of features, thus strengthening the significance of the results obtained with patients. Even if the number of enrolled patients is small, this work suggests that the proposed features might give a relevant contribution to the demanding research field of speech-based mood classifiers. Moreover, the results here presented indicate that a model of speech changes in bipolar patients might be subject-specific and that a richer characterization of subject status could be necessary to explain the observed variability. … (more)
- Is Part Of:
- Biomedical signal processing and control. Volume 17(2015)
- Journal:
- Biomedical signal processing and control
- Issue:
- Volume 17(2015)
- Issue Display:
- Volume 17, Issue 2015 (2015)
- Year:
- 2015
- Volume:
- 17
- Issue:
- 2015
- Issue Sort Value:
- 2015-0017-2015-0000
- Page Start:
- 29
- Page End:
- 37
- Publication Date:
- 2015-03
- Subjects:
- F0 contour analysis -- Bipolar disorders -- Emotional speech -- Voiced/unvoiced segmentation
Signal processing -- Periodicals
Biomedical engineering -- Periodicals
Signal Processing, Computer-Assisted -- Periodicals
Image Processing, Computer-Assisted -- Periodicals
Biomedical Engineering -- Periodicals
610.28 - Journal URLs:
- http://www.sciencedirect.com/science/journal/17468094 ↗
http://www.elsevier.com/journals ↗
http://www.sciencedirect.com/science?_ob=PublicationURL&_tockey=%23TOC%2329675%232006%23999989998%23626449%23FLA%23&_cdi=29675&_pubType=J&_auth=y&_acct=C000045259&_version=1&_urlVersion=0&_userid=836873&md5=664b5cf9a57fc91971a17faf20c32ec1 ↗ - DOI:
- 10.1016/j.bspc.2014.10.011 ↗
- Languages:
- English
- ISSNs:
- 1746-8094
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 2087.880400
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 570.xml