On the feasibility of using a bispectral measure as a nonintrusive predictor of speech intelligibility. (September 2019)
- Record Type:
- Journal Article
- Title:
- On the feasibility of using a bispectral measure as a nonintrusive predictor of speech intelligibility. (September 2019)
- Main Title:
- On the feasibility of using a bispectral measure as a nonintrusive predictor of speech intelligibility
- Authors:
- Hossain, Md Ekramul
Zilany, Muhammad S.A.
Davies-Venn, Evelyn - Abstract:
- Highlights: BSIM can address a wide range of linear and nonlinear distortions that influence speech intelligibility in quiet and noise. BSIM can provide real-time predictions of the effect of acoustic signal distortion on speech intelligibility. BSIM was compared to five existing speech intelligibility metrics. BSIM could be used to analyze algorithms that process noisy speech. Abstract: The presence of background noise or nonlinear distortions encountered in real-world situations often reduces the intelligibility of speech signals. Several objective measurements and prediction procedures have been developed to assess speech intelligibility in noise. Most of the existing measures are, however, suitable for only a subset of specified forms of distortion. This study developed a reliable, reference-free speech intelligibility metric that uses the properties of an acoustic signal to predict the effects of a wide range of distortions that influence speech intelligibility in quiet and noisy conditions. The bispectral speech intelligibility metric (BSIM), was developed by extracting the features from the spectrogram of speech signals using the third-order statistics, which are collectively known as the bispectrum. Speech intelligibility scores predicted by the BSIM were compared to behavioral speech intelligibility scores in quiet and noise. The performance of the BSIM was also compared with that of several widely used speech intelligibility metrics. Results showed that the BSIMHighlights: BSIM can address a wide range of linear and nonlinear distortions that influence speech intelligibility in quiet and noise. BSIM can provide real-time predictions of the effect of acoustic signal distortion on speech intelligibility. BSIM was compared to five existing speech intelligibility metrics. BSIM could be used to analyze algorithms that process noisy speech. Abstract: The presence of background noise or nonlinear distortions encountered in real-world situations often reduces the intelligibility of speech signals. Several objective measurements and prediction procedures have been developed to assess speech intelligibility in noise. Most of the existing measures are, however, suitable for only a subset of specified forms of distortion. This study developed a reliable, reference-free speech intelligibility metric that uses the properties of an acoustic signal to predict the effects of a wide range of distortions that influence speech intelligibility in quiet and noisy conditions. The bispectral speech intelligibility metric (BSIM), was developed by extracting the features from the spectrogram of speech signals using the third-order statistics, which are collectively known as the bispectrum. Speech intelligibility scores predicted by the BSIM were compared to behavioral speech intelligibility scores in quiet and noise. The performance of the BSIM was also compared with that of several widely used speech intelligibility metrics. Results showed that the BSIM can successfully predict nonlinear distortions, such as peak-clipping and center-clipping, as well as time domain distortions, such as phase-jitter and reverberation. Unlike existing metrics, such as the articulation index and speech transmission index, the BSIM successfully captured the effect of fluctuating noise on speech intelligibility and predicted the effects of the degradation of noisy speech processed by the ideal time-frequency segregation method. The BSIM presents a reliable, reference-free, and objective measure of speech intelligibility that can provide real-time predictions of the effect of signal processing and acoustics distortion on speech intelligibility in quiet and noise. In addition, the BSIM could be used to analyze algorithms that process noisy speech. … (more)
- Is Part Of:
- Computer speech & language. Volume 57(2019)
- Journal:
- Computer speech & language
- Issue:
- Volume 57(2019)
- Issue Display:
- Volume 57, Issue 2019 (2019)
- Year:
- 2019
- Volume:
- 57
- Issue:
- 2019
- Issue Sort Value:
- 2019-0057-2019-0000
- Page Start:
- 59
- Page End:
- 80
- Publication Date:
- 2019-09
- Subjects:
- Speech intelligibility -- Spectrogram -- Higher order statistics -- Bispectrum
Speech processing systems -- Periodicals
Automatic speech recognition -- Periodicals
Computers -- Periodicals
Linguistics -- Periodicals
Speech-Language Pathology -- Periodicals
Traitement automatique de la parole -- Périodiques
Reconnaissance automatique de la parole -- Périodiques
Automatic speech recognition
Speech processing systems
Electronic journals
Periodicals
006.454 - Journal URLs:
- http://www.journals.elsevier.com/computer-speech-and-language/ ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.csl.2019.02.003 ↗
- Languages:
- English
- ISSNs:
- 0885-2308
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.276600
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 10443.xml