Deep dual-side learning ensemble model for Parkinson speech recognition. (August 2021)
- Record Type:
- Journal Article
- Title:
- Deep dual-side learning ensemble model for Parkinson speech recognition. (August 2021)
- Main Title:
- Deep dual-side learning ensemble model for Parkinson speech recognition
- Authors:
- Ma, Jie
Zhang, Yuanfan
Li, Yongming
Zhou, Lang
Qin, Lingyun
Zeng, Yuwei
Wang, Pin
Lei, Yan - Abstract:
- Highlights: A deep sample learning algorithm (DSL) is constructed for PD speech feature data. An embedded deep stack group sparse autoencoder (EGSAE) is designed for PD speech data. Deep dual-side learning ensemble model is constructed by combining EGSAE and DSL. Both the diagnosis and automatic assessment of treatment of PD speech data are considered. Abstract: Early diagnosis of Parkinson's disease (PD) is very important Kansara et al. (2013) and Stern (1993). In recent years, machine learning-based speech data analysis has been shown to be effective for diagnosing Parkinson's disease (PD) and automatically assessing rehabilitative speech treatment in PD Sakar et al. (2013), Tsanas et al. (2012) and Little et al. (2009). Machine learning includes feature learning and sample learning. Deep learning (deep feature learning) can generate high-level and high-quality features through deep feature transformation, improving classification accuracy. For reasons such as data collection, some samples have low quality for classification. Therefore, sample learning is necessary. Sample selection removes useless samples; therefore, deep sample learning is better, since it can generate high-level and high-quality samples through deep sample transformation. However, there are no public studies about deep sample learning. To solve the problem above, a deep dual-side learning ensemble model is designed in this paper. In this model, a deep sample learning algorithm is designed and combinedHighlights: A deep sample learning algorithm (DSL) is constructed for PD speech feature data. An embedded deep stack group sparse autoencoder (EGSAE) is designed for PD speech data. Deep dual-side learning ensemble model is constructed by combining EGSAE and DSL. Both the diagnosis and automatic assessment of treatment of PD speech data are considered. Abstract: Early diagnosis of Parkinson's disease (PD) is very important Kansara et al. (2013) and Stern (1993). In recent years, machine learning-based speech data analysis has been shown to be effective for diagnosing Parkinson's disease (PD) and automatically assessing rehabilitative speech treatment in PD Sakar et al. (2013), Tsanas et al. (2012) and Little et al. (2009). Machine learning includes feature learning and sample learning. Deep learning (deep feature learning) can generate high-level and high-quality features through deep feature transformation, improving classification accuracy. For reasons such as data collection, some samples have low quality for classification. Therefore, sample learning is necessary. Sample selection removes useless samples; therefore, deep sample learning is better, since it can generate high-level and high-quality samples through deep sample transformation. However, there are no public studies about deep sample learning. To solve the problem above, a deep dual-side learning ensemble model is designed in this paper. In this model, a deep sample learning algorithm is designed and combined with a deep network (deep feature learning), thereby realizing the deep dual-side learning of PD speech data. First, an embedded stack group sparse autoencoder is designed in this paper to conduct deep feature learning to acquire new high-level deep feature data. Second, the deep features are fused with original speech features by L1 regularization feature selection methods, thereby constructing hybrid feature data. Third, an iterative mean clustering algorithm (IMC) was designed, thereby constructing a deep sample learning algorithm and conducting deep sample transformation. After that step, hierarchical sample spaces are constructed based on a deep sample learning algorithm, and the classification models are constructed on the sample spaces. Finally, the weighted fusion mechanism is designed to merge the classification models into a classification ensemble model, thereby fusing the deep feature learning algorithm and the deep sample learning algorithm together. The ensemble model is called the deep dual-side learning ensemble model. At the end of this paper, two representative speech datasets of PD were used for validation. The experimental results show that the main innovation part of the algorithm is effective. For the two datasets, the mean accuracy of the proposed algorithm reaches 98.4% and 99.6%, which are better than the state-of-art relevant algorithms. The study shows that deep dual-side learning is better for existing deep feature learning for PD speech recognition. … (more)
- Is Part Of:
- Biomedical signal processing and control. Volume 69(2021)
- Journal:
- Biomedical signal processing and control
- Issue:
- Volume 69(2021)
- Issue Display:
- Volume 69, Issue 2021 (2021)
- Year:
- 2021
- Volume:
- 69
- Issue:
- 2021
- Issue Sort Value:
- 2021-0069-2021-0000
- Page Start:
- Page End:
- Publication Date:
- 2021-08
- Subjects:
- Speech recognition of Parkinson's disease -- Automatic assessment of rehabilitative speech treatment -- Deep dual-side learning -- Deep learning -- Feature fusion -- Deep sample learning
Signal processing -- Periodicals
Biomedical engineering -- Periodicals
Signal Processing, Computer-Assisted -- Periodicals
Image Processing, Computer-Assisted -- Periodicals
Biomedical Engineering -- Periodicals
610.28 - Journal URLs:
- http://www.sciencedirect.com/science/journal/17468094 ↗
http://www.elsevier.com/journals ↗
http://www.sciencedirect.com/science?_ob=PublicationURL&_tockey=%23TOC%2329675%232006%23999989998%23626449%23FLA%23&_cdi=29675&_pubType=J&_auth=y&_acct=C000045259&_version=1&_urlVersion=0&_userid=836873&md5=664b5cf9a57fc91971a17faf20c32ec1 ↗ - DOI:
- 10.1016/j.bspc.2021.102849 ↗
- Languages:
- English
- ISSNs:
- 1746-8094
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 2087.880400
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 18872.xml