Automatic classification of speech dysfluencies in continuous speech based on similarity measures and morphological image processing tools. (January 2016)
- Record Type:
- Journal Article
- Title:
- Automatic classification of speech dysfluencies in continuous speech based on similarity measures and morphological image processing tools. (January 2016)
- Main Title:
- Automatic classification of speech dysfluencies in continuous speech based on similarity measures and morphological image processing tools
- Authors:
- Esmaili, Iman
Dabanloo, Nader Jafarnia
Vali, Mansour - Abstract:
- Graphical abstract: Highlights: Dysfluency types of prolongation and syllable/word/phrase repetition are considered. Morphological image processing tools are employed to extract dysfluency patterns. Dysfluent segments are detected precisely in continuous speech. The proposed method does not need classifier or training stage. The output parameters are type, duration and number of dysfluencies. Abstract: Speech-language pathologists, traditionally, count the number of speech dysfluencies to measure the rate of stuttering severity. Subjective stuttering assessment is time consuming and highly dependent on clinician's experiences. The present study proposes an objective evaluation of speech dysfluencies (sounds prolongation, syllables\words\phrases repetition) in continuous speech signals. The proposed method is based on finding similarity in successive frames of speech features for sounds prolongation detection and in close segments of speech for repetition detection. Speech signals are initially parameterized to MFCC, PLP or filter bank energy feature sets. Then, similarity matrix is calculated based on similarities of all pairs of frames using cross-correlation or Euclidean criterion. Similarity matrix is considered as an image and highly similar components are extracted using proper threshold. By employing morphological image processing tools, irrelevant parts of similarity matrix are removed and dysfluent parts are detected. The effects of different feature sets andGraphical abstract: Highlights: Dysfluency types of prolongation and syllable/word/phrase repetition are considered. Morphological image processing tools are employed to extract dysfluency patterns. Dysfluent segments are detected precisely in continuous speech. The proposed method does not need classifier or training stage. The output parameters are type, duration and number of dysfluencies. Abstract: Speech-language pathologists, traditionally, count the number of speech dysfluencies to measure the rate of stuttering severity. Subjective stuttering assessment is time consuming and highly dependent on clinician's experiences. The present study proposes an objective evaluation of speech dysfluencies (sounds prolongation, syllables\words\phrases repetition) in continuous speech signals. The proposed method is based on finding similarity in successive frames of speech features for sounds prolongation detection and in close segments of speech for repetition detection. Speech signals are initially parameterized to MFCC, PLP or filter bank energy feature sets. Then, similarity matrix is calculated based on similarities of all pairs of frames using cross-correlation or Euclidean criterion. Similarity matrix is considered as an image and highly similar components are extracted using proper threshold. By employing morphological image processing tools, irrelevant parts of similarity matrix are removed and dysfluent parts are detected. The effects of different feature sets and similarity measures on classification results were examined. The promising classification accuracy of 99.84%, 98.07% and 99.87% were achieved for detection of prolongation, syllable/word repetition and phrase repetition, respectively. … (more)
- Is Part Of:
- Biomedical signal processing and control. Volume 23(2016)
- Journal:
- Biomedical signal processing and control
- Issue:
- Volume 23(2016)
- Issue Display:
- Volume 23, Issue 2016 (2016)
- Year:
- 2016
- Volume:
- 23
- Issue:
- 2016
- Issue Sort Value:
- 2016-0023-2016-0000
- Page Start:
- 104
- Page End:
- 114
- Publication Date:
- 2016-01
- Subjects:
- Automatic dysfluency classification -- Similarity measures -- Morphological image processing -- Stuttering severity measurement
Signal processing -- Periodicals
Biomedical engineering -- Periodicals
Signal Processing, Computer-Assisted -- Periodicals
Image Processing, Computer-Assisted -- Periodicals
Biomedical Engineering -- Periodicals
610.28 - Journal URLs:
- http://www.sciencedirect.com/science/journal/17468094 ↗
http://www.elsevier.com/journals ↗
http://www.sciencedirect.com/science?_ob=PublicationURL&_tockey=%23TOC%2329675%232006%23999989998%23626449%23FLA%23&_cdi=29675&_pubType=J&_auth=y&_acct=C000045259&_version=1&_urlVersion=0&_userid=836873&md5=664b5cf9a57fc91971a17faf20c32ec1 ↗ - DOI:
- 10.1016/j.bspc.2015.08.006 ↗
- Languages:
- English
- ISSNs:
- 1746-8094
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 2087.880400
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 9099.xml