Information theoretic optimal vocal tract region selection from real time magnetic resonance images for broad phonetic class recognition. (September 2016)
- Record Type:
- Journal Article
- Title:
- Information theoretic optimal vocal tract region selection from real time magnetic resonance images for broad phonetic class recognition. (September 2016)
- Main Title:
- Information theoretic optimal vocal tract region selection from real time magnetic resonance images for broad phonetic class recognition
- Authors:
- Prasad, Abhay
Ghosh, Prasanta Kumar - Abstract:
- Abstract : Highlights: Information theoretic optimal regions selection from rtMRI images. Forward region splitting algorithm for maximizing mutual information. Articulatory features from the optimal set of regions. Benefit of proposed features for broad phonetic class recognition. Abstract: We propose an information theoretic region selection algorithm from the real time magnetic resonance imaging (rtMRI) video frames for a broad phonetic class recognition task. Representations derived from these optimal regions are used as the articulatory features for recognition. A set of connected and arbitrary shaped regions are selected such that the articulatory features computed from such regions provide maximal information about the broad phonetic classes. We also propose a tree-structured greedy region splitting algorithm to further segment these regions so that articulatory features from these split regions enhance the information about the phonetic classes. We find that some of the proposed articulatory features correlate well with the articulatory gestures from the Articulatory Phonology theory of speech production. Broad phonetic class recognition experiment using four rtMRI subjects reveals that the recognition accuracy with optimal split regions is, on average, higher than that using only acoustic features. Combining acoustic and articulatory features further reduces the error-rate by ∼8.25% (relative).
- Is Part Of:
- Computer speech & language. Volume 39(2016)
- Journal:
- Computer speech & language
- Issue:
- Volume 39(2016)
- Issue Display:
- Volume 39, Issue 2016 (2016)
- Year:
- 2016
- Volume:
- 39
- Issue:
- 2016
- Issue Sort Value:
- 2016-0039-2016-0000
- Page Start:
- 108
- Page End:
- 128
- Publication Date:
- 2016-09
- Subjects:
- Mutual information -- Phonetic recognition -- Speech production -- Region splitting
Speech processing systems -- Periodicals
Automatic speech recognition -- Periodicals
Computers -- Periodicals
Linguistics -- Periodicals
Speech-Language Pathology -- Periodicals
Traitement automatique de la parole -- Périodiques
Reconnaissance automatique de la parole -- Périodiques
Automatic speech recognition
Speech processing systems
Electronic journals
Periodicals
006.454 - Journal URLs:
- http://www.journals.elsevier.com/computer-speech-and-language/ ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.csl.2016.03.003 ↗
- Languages:
- English
- ISSNs:
- 0885-2308
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.276600
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 2467.xml