Detection of vowel transition regions from Hindi language. (November 2021)
- Record Type:
- Journal Article
- Title:
- Detection of vowel transition regions from Hindi language. (November 2021)
- Main Title:
- Detection of vowel transition regions from Hindi language
- Authors:
- Yadav, Jainath
- Abstract:
- Highlights: The vowel transition regions lie in the junction between a consonant and vowel (CV) regions and between a vowel and a consonant (VC) regions. We have proposed a method for determining the vowel transition regions based on the rate of change of formant frequencies using zero-time windowing and numerator of group-delay function. Zero-time windowing derives the instantaneous formant frequencies accurately at every sample location due to contribution of that sample itself. The numerator of group-delay function enhances the formant frequencies. The proposed transition region detection method is evaluated on CV, CVC and TIMIT databases and it has shown significant improvement in the performance compared to the existing method. Abstract: The vowel transition regions are the crucial landmarks in the speech signal. These vital regions are present at both ends of the vowel. They lie in the junction between a consonant and a vowel (CV) regions. This region plays an important role in numerous speech applications like speaker recognition, emotion conversion, speech rate modification, and CV unit recognition. The performance of these applications crucially depends on the accuracy of the estimation of vowel transition regions. In this paper, we have proposed a method for determining the transition regions based on the rate of change of formant frequencies using zero-time windowing and numerator of the group-delay function. Zero-time windowing derives the instantaneous formantHighlights: The vowel transition regions lie in the junction between a consonant and vowel (CV) regions and between a vowel and a consonant (VC) regions. We have proposed a method for determining the vowel transition regions based on the rate of change of formant frequencies using zero-time windowing and numerator of group-delay function. Zero-time windowing derives the instantaneous formant frequencies accurately at every sample location due to contribution of that sample itself. The numerator of group-delay function enhances the formant frequencies. The proposed transition region detection method is evaluated on CV, CVC and TIMIT databases and it has shown significant improvement in the performance compared to the existing method. Abstract: The vowel transition regions are the crucial landmarks in the speech signal. These vital regions are present at both ends of the vowel. They lie in the junction between a consonant and a vowel (CV) regions. This region plays an important role in numerous speech applications like speaker recognition, emotion conversion, speech rate modification, and CV unit recognition. The performance of these applications crucially depends on the accuracy of the estimation of vowel transition regions. In this paper, we have proposed a method for determining the transition regions based on the rate of change of formant frequencies using zero-time windowing and numerator of the group-delay function. Zero-time windowing derives the instantaneous formant frequencies accurately at every sample location due to the contribution of that sample itself. The numerator of the group-delay function enhances the formant frequencies. The proposed transition region detection method is evaluated on CV, and continuous speech databases recorded in the Hindi language. The proposed method has shown around 12% improvement in accuracy compared to the existing method. … (more)
- Is Part Of:
- Computer speech & language. Volume 70(2021)
- Journal:
- Computer speech & language
- Issue:
- Volume 70(2021)
- Issue Display:
- Volume 70, Issue 2021 (2021)
- Year:
- 2021
- Volume:
- 70
- Issue:
- 2021
- Issue Sort Value:
- 2021-0070-2021-0000
- Page Start:
- Page End:
- Publication Date:
- 2021-11
- Subjects:
- Vowel transition region -- ITR -- FTR -- Vowel onset and offset points -- Zero-time windowing -- Zero-frequency filtering -- Group delay function -- NGD spectrum -- HNGD spectrum
Speech processing systems -- Periodicals
Automatic speech recognition -- Periodicals
Computers -- Periodicals
Linguistics -- Periodicals
Speech-Language Pathology -- Periodicals
Traitement automatique de la parole -- Périodiques
Reconnaissance automatique de la parole -- Périodiques
Automatic speech recognition
Speech processing systems
Electronic journals
Periodicals
006.454 - Journal URLs:
- http://www.journals.elsevier.com/computer-speech-and-language/ ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.csl.2021.101231 ↗
- Languages:
- English
- ISSNs:
- 0885-2308
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.276600
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 17252.xml