Unsupervised segmentation of the vocal tract from real-time MRI sequences. (September 2015)
- Record Type:
- Journal Article
- Title:
- Unsupervised segmentation of the vocal tract from real-time MRI sequences. (September 2015)
- Main Title:
- Unsupervised segmentation of the vocal tract from real-time MRI sequences
- Authors:
- Silva, Samuel
Teixeira, António - Abstract:
- Abstract : Highlights: Vocal tract segmentation considering sequential nature of the RT-MRI data. Explicit consideration of vocal tract configurations with an open and closed velum. Single, high level segmentation initialisation per speaker, unsupervised operation thereafter. Small set of images for training: small manual annotation overhead. Evaluation of precision and accuracy over large image set and considering annotated images by four observers. Abstract: Advances on real-time magnetic resonance imaging (RT-MRI) make it suitable to study the dynamic aspects of the upper airway. One of the main challenges concerns how to deal with the large amount of data resulting from these studies, particularly to extract relevant features for analysis such as the vocal tract profiles. A method is proposed, based on a modified active appearance model (AAM) approach, for unsupervised segmentation of the vocal tract from midsagittal RT-MRI sequences. The described approach was designed considering the low inter-frame difference. As a result, when compared to a traditional AAM approach, segmentation is performed faster and model convergence is improved, attaining good results using small training sets. The main goal is to extract the vocal tract profiles automatically, over time, providing identification of different regions of interest, to allow the study of the dynamic features of the vocal tract, for example, during speech production. The proposed method has been evaluated againstAbstract : Highlights: Vocal tract segmentation considering sequential nature of the RT-MRI data. Explicit consideration of vocal tract configurations with an open and closed velum. Single, high level segmentation initialisation per speaker, unsupervised operation thereafter. Small set of images for training: small manual annotation overhead. Evaluation of precision and accuracy over large image set and considering annotated images by four observers. Abstract: Advances on real-time magnetic resonance imaging (RT-MRI) make it suitable to study the dynamic aspects of the upper airway. One of the main challenges concerns how to deal with the large amount of data resulting from these studies, particularly to extract relevant features for analysis such as the vocal tract profiles. A method is proposed, based on a modified active appearance model (AAM) approach, for unsupervised segmentation of the vocal tract from midsagittal RT-MRI sequences. The described approach was designed considering the low inter-frame difference. As a result, when compared to a traditional AAM approach, segmentation is performed faster and model convergence is improved, attaining good results using small training sets. The main goal is to extract the vocal tract profiles automatically, over time, providing identification of different regions of interest, to allow the study of the dynamic features of the vocal tract, for example, during speech production. The proposed method has been evaluated against vocal tract delineations manually performed by four observers, yielding good agreement. … (more)
- Is Part Of:
- Computer speech & language. Volume 33(2015)
- Journal:
- Computer speech & language
- Issue:
- Volume 33(2015)
- Issue Display:
- Volume 33, Issue 2015 (2015)
- Year:
- 2015
- Volume:
- 33
- Issue:
- 2015
- Issue Sort Value:
- 2015-0033-2015-0000
- Page Start:
- 25
- Page End:
- 46
- Publication Date:
- 2015-09
- Subjects:
- Vocal tract -- Segmentation -- Real-time MRI
Speech processing systems -- Periodicals
Automatic speech recognition -- Periodicals
Computers -- Periodicals
Linguistics -- Periodicals
Speech-Language Pathology -- Periodicals
Traitement automatique de la parole -- Périodiques
Reconnaissance automatique de la parole -- Périodiques
Automatic speech recognition
Speech processing systems
Electronic journals
Periodicals
006.454 - Journal URLs:
- http://www.journals.elsevier.com/computer-speech-and-language/ ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.csl.2014.12.003 ↗
- Languages:
- English
- ISSNs:
- 0885-2308
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.276600
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 6237.xml