A segmentation-informed deep learning framework to register dynamic two-dimensional magnetic resonance images of the vocal tract during speech. (February 2023)
- Record Type:
- Journal Article
- Title:
- A segmentation-informed deep learning framework to register dynamic two-dimensional magnetic resonance images of the vocal tract during speech. (February 2023)
- Main Title:
- A segmentation-informed deep learning framework to register dynamic two-dimensional magnetic resonance images of the vocal tract during speech
- Authors:
- Ruthven, Matthieu
Miquel, Marc E.
King, Andrew P. - Abstract:
- Highlights: A segmentation-informed deformable registration framework was developed. It estimates displacement fields between dynamic 2D MR images of the vocal tract. Developed to facilitate quantitative analysis of articulator motion during speech. Intended for use in clinical and non-clinical studies of speech. The framework was evaluated using a clinically relevant metric. Abstract: Objective: Dynamic magnetic resonance (MR) imaging enables visualisation of articulators during speech. There is growing interest in quantifying articulator motion in two-dimensional MR images of the vocal tract, to better understand speech production and potentially inform patient management decisions. Image registration is an established way to achieve this quantification. Recently, segmentation-informed deformable registration frameworks have been developed and have achieved state-of-the-art accuracy. This work aims to adapt such a framework and optimise it for estimating displacement fields between dynamic two-dimensional MR images of the vocal tract during speech. Methods: A deep-learning-based registration framework was developed and compared with current state-of-the-art registration methods and frameworks (two traditional methods and three deep-learning-based frameworks, two of which are segmentation informed). The accuracy of the methods and frameworks was evaluated using the Dice coefficient (DSC), average surface distance (ASD) and a metric based on velopharyngeal closure. TheHighlights: A segmentation-informed deformable registration framework was developed. It estimates displacement fields between dynamic 2D MR images of the vocal tract. Developed to facilitate quantitative analysis of articulator motion during speech. Intended for use in clinical and non-clinical studies of speech. The framework was evaluated using a clinically relevant metric. Abstract: Objective: Dynamic magnetic resonance (MR) imaging enables visualisation of articulators during speech. There is growing interest in quantifying articulator motion in two-dimensional MR images of the vocal tract, to better understand speech production and potentially inform patient management decisions. Image registration is an established way to achieve this quantification. Recently, segmentation-informed deformable registration frameworks have been developed and have achieved state-of-the-art accuracy. This work aims to adapt such a framework and optimise it for estimating displacement fields between dynamic two-dimensional MR images of the vocal tract during speech. Methods: A deep-learning-based registration framework was developed and compared with current state-of-the-art registration methods and frameworks (two traditional methods and three deep-learning-based frameworks, two of which are segmentation informed). The accuracy of the methods and frameworks was evaluated using the Dice coefficient (DSC), average surface distance (ASD) and a metric based on velopharyngeal closure. The metric evaluated if the fields captured a clinically relevant and quantifiable aspect of articulator motion. Results: The segmentation-informed frameworks achieved higher DSCs and lower ASDs and captured more velopharyngeal closures than the traditional methods and the framework that was not segmentation informed. All segmentation-informed frameworks achieved similar DSCs and ASDs. However, the proposed framework captured the most velopharyngeal closures. Conclusions: A framework was successfully developed and found to more accurately estimate articulator motion than five current state-of-the-art methods and frameworks. Significance: The first deep-learning-based framework specifically for registering dynamic two-dimensional MR images of the vocal tract during speech has been developed and evaluated. … (more)
- Is Part Of:
- Biomedical signal processing and control. Volume 80(2023)Part 1
- Journal:
- Biomedical signal processing and control
- Issue:
- Volume 80(2023)Part 1
- Issue Display:
- Volume 80, Issue 1, Part 1 (2023)
- Year:
- 2023
- Volume:
- 80
- Issue:
- 1
- Part:
- 1
- Issue Sort Value:
- 2023-0080-0001-0001
- Page Start:
- Page End:
- Publication Date:
- 2023-02
- Subjects:
- Convolutional neural networks -- Registration -- Segmentation -- Dynamic magnetic resonance imaging -- Speech -- Articulators
Signal processing -- Periodicals
Biomedical engineering -- Periodicals
Signal Processing, Computer-Assisted -- Periodicals
Image Processing, Computer-Assisted -- Periodicals
Biomedical Engineering -- Periodicals
610.28 - Journal URLs:
- http://www.sciencedirect.com/science/journal/17468094 ↗
http://www.elsevier.com/journals ↗
http://www.sciencedirect.com/science?_ob=PublicationURL&_tockey=%23TOC%2329675%232006%23999989998%23626449%23FLA%23&_cdi=29675&_pubType=J&_auth=y&_acct=C000045259&_version=1&_urlVersion=0&_userid=836873&md5=664b5cf9a57fc91971a17faf20c32ec1 ↗ - DOI:
- 10.1016/j.bspc.2022.104290 ↗
- Languages:
- English
- ISSNs:
- 1746-8094
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 2087.880400
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 24559.xml