Speech rate estimation in disordered speech based on spectral landmark detection. (May 2016)
- Record Type:
- Journal Article
- Title:
- Speech rate estimation in disordered speech based on spectral landmark detection. (May 2016)
- Main Title:
- Speech rate estimation in disordered speech based on spectral landmark detection
- Authors:
- Huici, Hernandez-Diaz
Kairuz, Hector A.
Martens, Heidi
Van Nuffelen, Gwen
De Bodt, Marc - Abstract:
- Highlights: An algorithm for speech rate (SR) estimation in sentences is proposed. The landmark-based algorithm was tested with 198 sentences and compared with three other reported algorithms. The proposed algorithm shows the highest correlations and the minimum errors in comparison with others. The SR estimator is suitable for disordered speech and does not need training. This algorithm solves problems for slow SR segments of speech reported in formerly reported research. Abstract: Speech rate (SR) plays an important role in the assessment of disordered speech. Clinicians rely primarily on manual or semi-automatic methods to determine SR. The reported algorithms are designed for normal speech and show many restrictions with respect to disordered speech that are predominantly characterized by slow SR. This research presents an algorithm that in addition to energy and pitch, relies on information regarding the spectral characteristics of the borders of the syllables (landmarks). Speech samples (three sentences per speaker) for 66 healthy and dysarthric speakers were analyzed with four algorithms ( Mrate, robust SR estimation method, Praat' s script and the proposed algorithm based on landmark detection). The landmark approach is demonstrated to be more accurate for speakers with slow SR. The Pearson correlation coefficient between the calculated SR and the reference remains over 0.84 for the 198 sentences analyzed, while the other algorithms' correlations are below the valuesHighlights: An algorithm for speech rate (SR) estimation in sentences is proposed. The landmark-based algorithm was tested with 198 sentences and compared with three other reported algorithms. The proposed algorithm shows the highest correlations and the minimum errors in comparison with others. The SR estimator is suitable for disordered speech and does not need training. This algorithm solves problems for slow SR segments of speech reported in formerly reported research. Abstract: Speech rate (SR) plays an important role in the assessment of disordered speech. Clinicians rely primarily on manual or semi-automatic methods to determine SR. The reported algorithms are designed for normal speech and show many restrictions with respect to disordered speech that are predominantly characterized by slow SR. This research presents an algorithm that in addition to energy and pitch, relies on information regarding the spectral characteristics of the borders of the syllables (landmarks). Speech samples (three sentences per speaker) for 66 healthy and dysarthric speakers were analyzed with four algorithms ( Mrate, robust SR estimation method, Praat' s script and the proposed algorithm based on landmark detection). The landmark approach is demonstrated to be more accurate for speakers with slow SR. The Pearson correlation coefficient between the calculated SR and the reference remains over 0.84 for the 198 sentences analyzed, while the other algorithms' correlations are below the values reported in literature for fluent speech. In samples where SR is high, the algorithm shows similar limitations versus other algorithms due to the merging of syllables. The landmark-based algorithm is an adequate method for determining SR in disordered speech. … (more)
- Is Part Of:
- Biomedical signal processing and control. Volume 27(2016)
- Journal:
- Biomedical signal processing and control
- Issue:
- Volume 27(2016)
- Issue Display:
- Volume 27, Issue 2016 (2016)
- Year:
- 2016
- Volume:
- 27
- Issue:
- 2016
- Issue Sort Value:
- 2016-0027-2016-0000
- Page Start:
- 1
- Page End:
- 6
- Publication Date:
- 2016-05
- Subjects:
- Speaking rate estimation -- Landmarks -- Disordered speech -- Dysarthria
Signal processing -- Periodicals
Biomedical engineering -- Periodicals
Signal Processing, Computer-Assisted -- Periodicals
Image Processing, Computer-Assisted -- Periodicals
Biomedical Engineering -- Periodicals
610.28 - Journal URLs:
- http://www.sciencedirect.com/science/journal/17468094 ↗
http://www.elsevier.com/journals ↗
http://www.sciencedirect.com/science?_ob=PublicationURL&_tockey=%23TOC%2329675%232006%23999989998%23626449%23FLA%23&_cdi=29675&_pubType=J&_auth=y&_acct=C000045259&_version=1&_urlVersion=0&_userid=836873&md5=664b5cf9a57fc91971a17faf20c32ec1 ↗ - DOI:
- 10.1016/j.bspc.2016.01.005 ↗
- Languages:
- English
- ISSNs:
- 1746-8094
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 2087.880400
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 2193.xml