Thomson Multitaper MFCC and PLP voice features for early detection of Parkinson disease. (September 2018)
- Record Type:
- Journal Article
- Title:
- Thomson Multitaper MFCC and PLP voice features for early detection of Parkinson disease. (September 2018)
- Main Title:
- Thomson Multitaper MFCC and PLP voice features for early detection of Parkinson disease
- Authors:
- Upadhya, Savitha S.
Cheeran, A.N.
Nirmal, J.H. - Abstract:
- Highlights: A variance reduced speech power spectrum using Thomson Multitaper windows is investigated and is compared with single taper smooth window which exhibits large variance for the application of early detection of Parkinson disease using speech samples. Both MFCC and PLP features using the above techniques is extracted and a comparative performance analysis is reported. The experiment was also carried out by weighting the Thomson Multitaper spectrum using Uniform, Non-Uniform and Adaptive weights and the appropriate weight for this application and dataset is suggested. Optimal value of the number of tapers used for this application is also discussed. Abstract: In this paper, MFCC and PLP voice features extracted using Single Taper Smooth (STS) window and Thomson Multitaper (TMT) windowing technique together with a neural network classifier is used in the classification of Healthy people from early stage Parkinson diseased patients and a performance comparison of the two techniques is reported. Parkinson disease in their early stages, not only affects the muscular movements of the human body but also influences the articulatory process of the speech production mechanism. This signifies change in the shape of the vocal tract which manifests itself in the short time power spectrum. The MFCC and PLP features used in this investigation, which represent the vocal tract parameters are derived from the short time spectrum. It is therefore crucial to estimate this short timeHighlights: A variance reduced speech power spectrum using Thomson Multitaper windows is investigated and is compared with single taper smooth window which exhibits large variance for the application of early detection of Parkinson disease using speech samples. Both MFCC and PLP features using the above techniques is extracted and a comparative performance analysis is reported. The experiment was also carried out by weighting the Thomson Multitaper spectrum using Uniform, Non-Uniform and Adaptive weights and the appropriate weight for this application and dataset is suggested. Optimal value of the number of tapers used for this application is also discussed. Abstract: In this paper, MFCC and PLP voice features extracted using Single Taper Smooth (STS) window and Thomson Multitaper (TMT) windowing technique together with a neural network classifier is used in the classification of Healthy people from early stage Parkinson diseased patients and a performance comparison of the two techniques is reported. Parkinson disease in their early stages, not only affects the muscular movements of the human body but also influences the articulatory process of the speech production mechanism. This signifies change in the shape of the vocal tract which manifests itself in the short time power spectrum. The MFCC and PLP features used in this investigation, which represent the vocal tract parameters are derived from the short time spectrum. It is therefore crucial to estimate this short time power spectrum accurately. Generally, the short time speech power spectrum is estimated using STS window. But this power spectrum computed manifests large variance in the spectral estimates. Hence a variance reduced power spectrum is attained by computing the weighted average of the short time speech spectra obtained using a set of TMT windows. This spectrum is then used to compute the PLP and MFCC features. In this paper, extraction of both these voice features using STS window as well as TMT technique with three different weights namely Uniform, Eigen value (EV) and Adaptive weights is implemented using the speech samples of healthy and Parkinson diseased individuals. The experiment was carried out for several Thomson tapers ranging from 1 to 12 and the optimal number of tapers needed for the application and dataset is reported. A comparative performance analysis of the techniques implemented using both MFCC and PLP as features is then carried out in terms of classification accuracy, Equal Error Rate, sensitivity, selectivity and F1 score for the optimal taper value. The results obtained show that in comparison with the STS window a maximum improvement in the classification accuracy was obtained to be 6.6% for nine tapers, adaptive weights using MFCC as features and 6.9% for five tapers, EV weights using PLP as features for experimental dataset 1 and 6.0% using MFCC and 6.4% using PLP for experimental dataset 2. A performance improvement in other measures for the optimal taper value is also observed and reported for experimental dataset 1. … (more)
- Is Part Of:
- Biomedical signal processing and control. Volume 46(2018)
- Journal:
- Biomedical signal processing and control
- Issue:
- Volume 46(2018)
- Issue Display:
- Volume 46, Issue 2018 (2018)
- Year:
- 2018
- Volume:
- 46
- Issue:
- 2018
- Issue Sort Value:
- 2018-0046-2018-0000
- Page Start:
- 293
- Page End:
- 301
- Publication Date:
- 2018-09
- Subjects:
- MFCC -- PLP -- Single taper smooth window -- Multitaper -- Multitaper weights -- Classification accuracy -- Equal error rate
Signal processing -- Periodicals
Biomedical engineering -- Periodicals
Signal Processing, Computer-Assisted -- Periodicals
Image Processing, Computer-Assisted -- Periodicals
Biomedical Engineering -- Periodicals
610.28 - Journal URLs:
- http://www.sciencedirect.com/science/journal/17468094 ↗
http://www.elsevier.com/journals ↗
http://www.sciencedirect.com/science?_ob=PublicationURL&_tockey=%23TOC%2329675%232006%23999989998%23626449%23FLA%23&_cdi=29675&_pubType=J&_auth=y&_acct=C000045259&_version=1&_urlVersion=0&_userid=836873&md5=664b5cf9a57fc91971a17faf20c32ec1 ↗ - DOI:
- 10.1016/j.bspc.2018.07.019 ↗
- Languages:
- English
- ISSNs:
- 1746-8094
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 2087.880400
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 7242.xml