Unsupervised speech enhancement in low SNR environments via sparseness and temporal gradient regularization. (1st December 2018)
- Record Type:
- Journal Article
- Title:
- Unsupervised speech enhancement in low SNR environments via sparseness and temporal gradient regularization. (1st December 2018)
- Main Title:
- Unsupervised speech enhancement in low SNR environments via sparseness and temporal gradient regularization
- Authors:
- Saleem, Nasir
Khattak, Muhammad Irfan
Shafi, Muhammad - Abstract:
- Abstract: A crucial stage in unsupervised speech enhancement algorithm is the estimation of noise related parameters which usually needs prior models for noise. However, estimation of such parameters is a challenging task at low signal-to-noise ratios or in nonstationary noisy environments. In this paper, without knowing the prior models, an unsupervised and iterative speech enhancement algorithm is proposed which assumes speech spectrogram and its temporal gradient as sparse components. The quasi-harmonic description of the speech signals justifies this assumption. The speech enhancement is performed by decomposing the spectrogram of noisy speech into sparse matrix, enforcing the sparsity and temporal gradient regularizations. The Kullback–Leibler divergence is incorporated to minimize the distance between the observation and reconstructed components with nonnegativity constraints. Alternating direction method of multipliers is used to optimize the algorithm. The proposed algorithm is different from many speech enhancement approaches as it reduces background noise in an uncomplicated manner without need of a noise estimation algorithm to find noise-only excerpt. In addition, the proposed algorithm obtains an improved performance in adverse environments without knowing the exact distribution of noise. The experimental results demonstrate that the proposed algorithm outperforms the competing algorithms in terms of the speech quality and intelligibility. Moreover, theAbstract: A crucial stage in unsupervised speech enhancement algorithm is the estimation of noise related parameters which usually needs prior models for noise. However, estimation of such parameters is a challenging task at low signal-to-noise ratios or in nonstationary noisy environments. In this paper, without knowing the prior models, an unsupervised and iterative speech enhancement algorithm is proposed which assumes speech spectrogram and its temporal gradient as sparse components. The quasi-harmonic description of the speech signals justifies this assumption. The speech enhancement is performed by decomposing the spectrogram of noisy speech into sparse matrix, enforcing the sparsity and temporal gradient regularizations. The Kullback–Leibler divergence is incorporated to minimize the distance between the observation and reconstructed components with nonnegativity constraints. Alternating direction method of multipliers is used to optimize the algorithm. The proposed algorithm is different from many speech enhancement approaches as it reduces background noise in an uncomplicated manner without need of a noise estimation algorithm to find noise-only excerpt. In addition, the proposed algorithm obtains an improved performance in adverse environments without knowing the exact distribution of noise. The experimental results demonstrate that the proposed algorithm outperforms the competing algorithms in terms of the speech quality and intelligibility. Moreover, the composite objective measure reinforced better performance in terms of residual noise and speech distortion in strong noise. … (more)
- Is Part Of:
- Applied acoustics. Volume 141(2018)
- Journal:
- Applied acoustics
- Issue:
- Volume 141(2018)
- Issue Display:
- Volume 141, Issue 2018 (2018)
- Year:
- 2018
- Volume:
- 141
- Issue:
- 2018
- Issue Sort Value:
- 2018-0141-2018-0000
- Page Start:
- 333
- Page End:
- 347
- Publication Date:
- 2018-12-01
- Subjects:
- Unsupervised speech enhancement -- Kullback–Leibler divergence -- Sparsity -- Gradient -- Speech distortion -- Speech quality -- Residual noise -- Intelligibility
Acoustical engineering -- Periodicals
Periodicals
620.2 - Journal URLs:
- http://www.sciencedirect.com/science/journal/0003682X ↗
http://www.elsevier.com/journals ↗
http://www.elsevier.com/homepage/elecserv.htt ↗ - DOI:
- 10.1016/j.apacoust.2018.07.027 ↗
- Languages:
- English
- ISSNs:
- 0003-682X
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 1571.400000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 12424.xml