2-level hierarchical depression recognition method based on task-stimulated and integrated speech features. (February 2022)
- Record Type:
- Journal Article
- Title:
- 2-level hierarchical depression recognition method based on task-stimulated and integrated speech features. (February 2022)
- Main Title:
- 2-level hierarchical depression recognition method based on task-stimulated and integrated speech features
- Authors:
- Xing, Yujuan
Liu, Zhenyu
Li, Gang
Ding, ZhiJie
Hu, Bin - Abstract:
- Highlights: A hierarchical classification model was designed considering the task-stimulated features and integrated features for better recognition performance. I-vector was used to solve the variable length problem of frame level features and overcome speaker and channel variability effects. The effectiveness of hierarchical classification was verified on different features and their combinations. Gender-independent and gender-dependent experiments were carried out to test the gender influence on our method. Abstract: Depression had been paid more and more attention by researchers because of its high prevalence, recurrence, disability and mortality. Speech depression recognition had become a research hotspot due to its advantages of non-invasiveness and easy access to data. However, the problems such as the speech variation in different emotional stimulus, gender impact, the speaker and channel variation and the variable length of frame feature, would have a great impact on recognition performance. In order to solve these problems, a novel 2-level hierarchical depression recognition method was proposed in this paper. It contained two stages. In 1 st -level classification stage, i-vectors were extracted based on spectral features, prosodic features, formants and voice quality of speech segments in different task stimulus respectively. Then, support vector machine (SVM) and random forest (RF) were used to obtain primary results. In the stage of 2 nd -level classification,Highlights: A hierarchical classification model was designed considering the task-stimulated features and integrated features for better recognition performance. I-vector was used to solve the variable length problem of frame level features and overcome speaker and channel variability effects. The effectiveness of hierarchical classification was verified on different features and their combinations. Gender-independent and gender-dependent experiments were carried out to test the gender influence on our method. Abstract: Depression had been paid more and more attention by researchers because of its high prevalence, recurrence, disability and mortality. Speech depression recognition had become a research hotspot due to its advantages of non-invasiveness and easy access to data. However, the problems such as the speech variation in different emotional stimulus, gender impact, the speaker and channel variation and the variable length of frame feature, would have a great impact on recognition performance. In order to solve these problems, a novel 2-level hierarchical depression recognition method was proposed in this paper. It contained two stages. In 1 st -level classification stage, i-vectors were extracted based on spectral features, prosodic features, formants and voice quality of speech segments in different task stimulus respectively. Then, support vector machine (SVM) and random forest (RF) were used to obtain primary results. In the stage of 2 nd -level classification, the results of tasks with significant accuracy differences were aggregated into new integrated features. The final result was achieved on new features by SVM. Our experiments were based on the depression speech database of the Gansu Provincial Key Laboratory of Wearable Computing. The experimental results showed that the proposed method had achieved good results in both gender-independent and gender-dependent experiments. Compared with baseline method and bagging classification, the highest accuracy of our method was raised by 9.62% and 9.49% respectively in gender-independent experiments, and F1 score also got improvement obviously. The results also showed that our method had better robustness on gender effect. … (more)
- Is Part Of:
- Biomedical signal processing and control. Volume 72(2022)Part A
- Journal:
- Biomedical signal processing and control
- Issue:
- Volume 72(2022)Part A
- Issue Display:
- Volume 72, Issue 2022 (2022)
- Year:
- 2022
- Volume:
- 72
- Issue:
- 2022
- Issue Sort Value:
- 2022-0072-2022-0000
- Page Start:
- Page End:
- Publication Date:
- 2022-02
- Subjects:
- Depression recognition -- I-vector -- Speech task stimulus -- Hierarchical classification -- Bagging classification
Signal processing -- Periodicals
Biomedical engineering -- Periodicals
Signal Processing, Computer-Assisted -- Periodicals
Image Processing, Computer-Assisted -- Periodicals
Biomedical Engineering -- Periodicals
610.28 - Journal URLs:
- http://www.sciencedirect.com/science/journal/17468094 ↗
http://www.elsevier.com/journals ↗
http://www.sciencedirect.com/science?_ob=PublicationURL&_tockey=%23TOC%2329675%232006%23999989998%23626449%23FLA%23&_cdi=29675&_pubType=J&_auth=y&_acct=C000045259&_version=1&_urlVersion=0&_userid=836873&md5=664b5cf9a57fc91971a17faf20c32ec1 ↗ - DOI:
- 10.1016/j.bspc.2021.103287 ↗
- Languages:
- English
- ISSNs:
- 1746-8094
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 2087.880400
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 20164.xml