Speech enhancement using Maximum A-Posteriori and Gaussian Mixture Models for speech and noise Periodogram estimation. (March 2016)
- Record Type:
- Journal Article
- Title:
- Speech enhancement using Maximum A-Posteriori and Gaussian Mixture Models for speech and noise Periodogram estimation. (March 2016)
- Main Title:
- Speech enhancement using Maximum A-Posteriori and Gaussian Mixture Models for speech and noise Periodogram estimation
- Authors:
- Chehrehsa, Sarang
Moir, Tom James - Abstract:
- Highlights: We use Gaussian Mixture Modelling (GMM) to model Periodograms of speech and noise. A method to find the proper size of GMMs is discussed. We propose a Maximum A-Posteriori (MAP) Periodogram estimation using GMMs. The GMMs are used to enhance the noisy speech using Wiener filter. Abstract: In speech enhancement, Gaussian Mixture Models (GMMs) can be used to model the Probability Density Function (PDF) of the Periodograms of speech and different noise types. These GMMs are created by applying the Estimate Maximization (EM) algorithm on large datasets of speech and different noise type Periodograms and hence classify them into a small number of clusters whose centroid Periodograms are the mean vectors of the GMMs. These GMMs are used to realize the Maximum A-Posteriori (MAP) estimation of the speech and noise Periodograms present in a noisy speech observation. To realize the MAP estimation, use of a constrained optimization algorithm is proposed in which relatively good enhancement results with high processing times are attained. Due to the use of constraints in the optimization algorithm, incorrect estimation results may arise due to possible local maxima. A simple analytic MAP algorithm is proposed to attain global maximums in lower calculation times. With the new method the complicated MAP formula is simplified as much as possible to find the maxima, through solving a set of equations and not through conventional numerical methods used in optimization. ThisHighlights: We use Gaussian Mixture Modelling (GMM) to model Periodograms of speech and noise. A method to find the proper size of GMMs is discussed. We propose a Maximum A-Posteriori (MAP) Periodogram estimation using GMMs. The GMMs are used to enhance the noisy speech using Wiener filter. Abstract: In speech enhancement, Gaussian Mixture Models (GMMs) can be used to model the Probability Density Function (PDF) of the Periodograms of speech and different noise types. These GMMs are created by applying the Estimate Maximization (EM) algorithm on large datasets of speech and different noise type Periodograms and hence classify them into a small number of clusters whose centroid Periodograms are the mean vectors of the GMMs. These GMMs are used to realize the Maximum A-Posteriori (MAP) estimation of the speech and noise Periodograms present in a noisy speech observation. To realize the MAP estimation, use of a constrained optimization algorithm is proposed in which relatively good enhancement results with high processing times are attained. Due to the use of constraints in the optimization algorithm, incorrect estimation results may arise due to possible local maxima. A simple analytic MAP algorithm is proposed to attain global maximums in lower calculation times. With the new method the complicated MAP formula is simplified as much as possible to find the maxima, through solving a set of equations and not through conventional numerical methods used in optimization. This method results in excellent speech enhancement with a relatively short processing time. … (more)
- Is Part Of:
- Computer speech & language. Volume 36(2016)
- Journal:
- Computer speech & language
- Issue:
- Volume 36(2016)
- Issue Display:
- Volume 36, Issue 2016 (2016)
- Year:
- 2016
- Volume:
- 36
- Issue:
- 2016
- Issue Sort Value:
- 2016-0036-2016-0000
- Page Start:
- 58
- Page End:
- 71
- Publication Date:
- 2016-03
- Subjects:
- Gaussian Mixture Model (GMM) -- Maximum A-Posteriori (MAP) -- Wiener filter -- Speech enhancement
Speech processing systems -- Periodicals
Automatic speech recognition -- Periodicals
Computers -- Periodicals
Linguistics -- Periodicals
Speech-Language Pathology -- Periodicals
Traitement automatique de la parole -- Périodiques
Reconnaissance automatique de la parole -- Périodiques
Automatic speech recognition
Speech processing systems
Electronic journals
Periodicals
006.454 - Journal URLs:
- http://www.journals.elsevier.com/computer-speech-and-language/ ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.csl.2015.09.001 ↗
- Languages:
- English
- ISSNs:
- 0885-2308
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.276600
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 528.xml