Spectral and temporal manipulations of SFF envelopes for enhancement of speech intelligibility in noise. (March 2019)
- Record Type:
- Journal Article
- Title:
- Spectral and temporal manipulations of SFF envelopes for enhancement of speech intelligibility in noise. (March 2019)
- Main Title:
- Spectral and temporal manipulations of SFF envelopes for enhancement of speech intelligibility in noise
- Authors:
- Chennupati, Nivedita
Kadiri, Sudarsana Reddy
B., Yegnanarayana - Abstract:
- Highlights: Speech signals with different levels of intelligibility are analyzed to find the contrasting features using Single frequency filtering (SFF) approach. It is observed that the speech signal is highly intelligible when it has high dynamic range for fine structure and low for gross structure across spectral and temporal domains. Modifications are proposed on fine and gross, spectral and temporal structures of SFF representation in order to increase the intelligibility of normal speech signal. The intelligibility improvement in noise is assessed using objective and subjective measures. Abstract: This paper presents a method for modifying speech to enhance its intelligibility in noise. The features contributing to intelligibility are analyzed using the recently proposed single frequency filtering (SFF) analysis of speech signals. In the SFF method, the spectral and temporal resolutions can be controlled using a single parameter of the filter, corresponding to the location of the pole on the negative real axis with respect to the unit circle in the z-plane. The SFF magnitude (envelope) and phase at several frequencies can be used to synthesize the original speech signal. Analysis of highly intelligible speech shows that the speech signal is more intelligible when it has higher dynamic range of amplitude locally (fine structure) and/or lower dynamic range of amplitude globally (gross structure) in both the spectral and temporal domains. Some features of normal speechHighlights: Speech signals with different levels of intelligibility are analyzed to find the contrasting features using Single frequency filtering (SFF) approach. It is observed that the speech signal is highly intelligible when it has high dynamic range for fine structure and low for gross structure across spectral and temporal domains. Modifications are proposed on fine and gross, spectral and temporal structures of SFF representation in order to increase the intelligibility of normal speech signal. The intelligibility improvement in noise is assessed using objective and subjective measures. Abstract: This paper presents a method for modifying speech to enhance its intelligibility in noise. The features contributing to intelligibility are analyzed using the recently proposed single frequency filtering (SFF) analysis of speech signals. In the SFF method, the spectral and temporal resolutions can be controlled using a single parameter of the filter, corresponding to the location of the pole on the negative real axis with respect to the unit circle in the z-plane. The SFF magnitude (envelope) and phase at several frequencies can be used to synthesize the original speech signal. Analysis of highly intelligible speech shows that the speech signal is more intelligible when it has higher dynamic range of amplitude locally (fine structure) and/or lower dynamic range of amplitude globally (gross structure) in both the spectral and temporal domains. Some features of normal speech are modified at fine and gross temporal and spectral levels, and the modified SFF envelopes are used to synthesize speech. The proposed method gives higher objective scores of intelligibility compared to original and the reference method (spectral shaping and dynamic range compression), under different conditions of noise. In subjective evaluation, though the word accuracies are not significantly different between the proposed and reference methods, listeners seem to prefer the proposed method as it gives louder and crisper sound. … (more)
- Is Part Of:
- Computer speech & language. Volume 54(2019)
- Journal:
- Computer speech & language
- Issue:
- Volume 54(2019)
- Issue Display:
- Volume 54, Issue 2019 (2019)
- Year:
- 2019
- Volume:
- 54
- Issue:
- 2019
- Issue Sort Value:
- 2019-0054-2019-0000
- Page Start:
- 86
- Page End:
- 105
- Publication Date:
- 2019-03
- Subjects:
- Speech intelligibility -- single frequency filtering (SFF) -- spectral fine structure (SFS) -- temporal fine structure (TFS) -- spectral gross structure (SGS) -- temporal gross structure (TGS)
Speech processing systems -- Periodicals
Automatic speech recognition -- Periodicals
Computers -- Periodicals
Linguistics -- Periodicals
Speech-Language Pathology -- Periodicals
Traitement automatique de la parole -- Périodiques
Reconnaissance automatique de la parole -- Périodiques
Automatic speech recognition
Speech processing systems
Electronic journals
Periodicals
006.454 - Journal URLs:
- http://www.journals.elsevier.com/computer-speech-and-language/ ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.csl.2018.09.002 ↗
- Languages:
- English
- ISSNs:
- 0885-2308
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.276600
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 8758.xml