Automatic detection of pharyngeal fricatives in cleft palate speech using acoustic features based on the vocal tract area spectrum. (July 2021)
- Record Type:
- Journal Article
- Title:
- Automatic detection of pharyngeal fricatives in cleft palate speech using acoustic features based on the vocal tract area spectrum. (July 2021)
- Main Title:
- Automatic detection of pharyngeal fricatives in cleft palate speech using acoustic features based on the vocal tract area spectrum
- Authors:
- Fu, Jia
He, Fei
Yin, Heng
He, Ling - Abstract:
- Highlights: An automatic pharyngeal fricative detection method is proposed. A vocal tract area spectrum is proposed to represent the vocal tract using time-varying cascaded pipes. Four acoustic features based on vocal tract area spectrum are extracted to identify the pharyngeal fricative. Abstract: The pharyngeal fricative is a typical compensatory articulation disorder in cleft palate speech. It is produced by retracting the root of the tongue to the posterior pharyngeal wall to substitute for the fricatives and affricates produced in the oral cavity. People who use the pharyngeal fricative have difficulties in daily communication. Research on automatic pharyngeal fricative detection can provide aids in diagnosis for speech-language pathologists and clinical doctors. This work proposes a vocal tract area spectrum (VTAS) to represent a vocal tract model using time-varying cascaded pipes. Four acoustic features based on the VTAS (the centroid and spread (CS), peak linear deviation (PLD), relative-normal entropy (RNE), mean of the ratios' statistics (MRS)) are proposed to evaluate the differences between pharyngeal fricatives and normal speech. The CS feature is proposed to evaluate the overall shape of the vocal tract to detect whether there are abnormal gestures or movements of the articulators in speech production. The PLD and RNE features focus on the variation and complexity of each vocal tube's area during the whole pronunciation process. The MRS feature is proposed toHighlights: An automatic pharyngeal fricative detection method is proposed. A vocal tract area spectrum is proposed to represent the vocal tract using time-varying cascaded pipes. Four acoustic features based on vocal tract area spectrum are extracted to identify the pharyngeal fricative. Abstract: The pharyngeal fricative is a typical compensatory articulation disorder in cleft palate speech. It is produced by retracting the root of the tongue to the posterior pharyngeal wall to substitute for the fricatives and affricates produced in the oral cavity. People who use the pharyngeal fricative have difficulties in daily communication. Research on automatic pharyngeal fricative detection can provide aids in diagnosis for speech-language pathologists and clinical doctors. This work proposes a vocal tract area spectrum (VTAS) to represent a vocal tract model using time-varying cascaded pipes. Four acoustic features based on the VTAS (the centroid and spread (CS), peak linear deviation (PLD), relative-normal entropy (RNE), mean of the ratios' statistics (MRS)) are proposed to evaluate the differences between pharyngeal fricatives and normal speech. The CS feature is proposed to evaluate the overall shape of the vocal tract to detect whether there are abnormal gestures or movements of the articulators in speech production. The PLD and RNE features focus on the variation and complexity of each vocal tube's area during the whole pronunciation process. The MRS feature is proposed to describe the continuity of the vocal tract. To evaluate the effectiveness of these four features, pharyngeal fricative detection experiments are conducted using a pharyngeal fricative dataset. This dataset contains 1246 speech samples spoken by 50 cleft palate patients and 50 normal speakers, covering all types of initial consonants in which the pharyngeal fricative usually occurs. The detection accuracy of the pharyngeal fricative using the CS, PLD, RNE and MRS feature ranges from 80.66% to 90.21%. When using the proposed CS+PLD+RNE+MRS feature, an accuracy of 95.18% can be achieved on the pharyngeal fricative dataset. … (more)
- Is Part Of:
- Computer speech & language. Volume 68(2021)
- Journal:
- Computer speech & language
- Issue:
- Volume 68(2021)
- Issue Display:
- Volume 68, Issue 2021 (2021)
- Year:
- 2021
- Volume:
- 68
- Issue:
- 2021
- Issue Sort Value:
- 2021-0068-2021-0000
- Page Start:
- Page End:
- Publication Date:
- 2021-07
- Subjects:
- Cleft palate speech -- Pharyngeal fricative -- Vocal tract area spectrum -- Shape and continuity of vocal tract -- Complexity of individual vocal tube area sequence
Speech processing systems -- Periodicals
Automatic speech recognition -- Periodicals
Computers -- Periodicals
Linguistics -- Periodicals
Speech-Language Pathology -- Periodicals
Traitement automatique de la parole -- Périodiques
Reconnaissance automatique de la parole -- Périodiques
Automatic speech recognition
Speech processing systems
Electronic journals
Periodicals
006.454 - Journal URLs:
- http://www.journals.elsevier.com/computer-speech-and-language/ ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.csl.2021.101203 ↗
- Languages:
- English
- ISSNs:
- 0885-2308
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.276600
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 16008.xml