Automated pharyngeal phase detection and bolus localization in videofluoroscopic swallowing study: Killing two birds with one stone?. (October 2022)
- Record Type:
- Journal Article
- Title:
- Automated pharyngeal phase detection and bolus localization in videofluoroscopic swallowing study: Killing two birds with one stone?. (October 2022)
- Main Title:
- Automated pharyngeal phase detection and bolus localization in videofluoroscopic swallowing study: Killing two birds with one stone?
- Authors:
- Bandini, Andrea
Smaoui, Sana
Steele, Catriona M. - Abstract:
- Highlights: Clinical rating of VFSS data is time consuming and requires specialized training. Computer vision may help conduct automated VFSS analysis. We detect the pharyngeal phase and localize bolus in a weakly-supervised manner. No supervision is required in terms of bolus segmentation. Our approach will help conduct objective clinical assessment of VFSS recordings. Abstract: Background and objective: The videofluoroscopic swallowing study (VFSS) is a gold-standard imaging technique for assessing swallowing, but analysis and rating of VFSS recordings is time consuming and requires specialized training and expertise. Researchers have recently demonstrated that it is possible to automatically detect the pharyngeal phase of swallowing and to localize the bolus in VFSS recordings via computer vision approaches, fostering the development of novel techniques for automatic VFSS analysis. However, training of algorithms to perform these tasks requires large amounts of annotated data that are seldom available. In this paper, we demonstrate that the challenges of pharyngeal phase detection and bolus localization can be solved together using a single approach. Methods: We propose a deep-learning framework that jointly tackles pharyngeal phase detection and bolus localization in a weakly-supervised manner, requiring only the initial and final frames of the pharyngeal phase as ground truth annotations for the training. Our approach stems from the observation that bolus presence inHighlights: Clinical rating of VFSS data is time consuming and requires specialized training. Computer vision may help conduct automated VFSS analysis. We detect the pharyngeal phase and localize bolus in a weakly-supervised manner. No supervision is required in terms of bolus segmentation. Our approach will help conduct objective clinical assessment of VFSS recordings. Abstract: Background and objective: The videofluoroscopic swallowing study (VFSS) is a gold-standard imaging technique for assessing swallowing, but analysis and rating of VFSS recordings is time consuming and requires specialized training and expertise. Researchers have recently demonstrated that it is possible to automatically detect the pharyngeal phase of swallowing and to localize the bolus in VFSS recordings via computer vision approaches, fostering the development of novel techniques for automatic VFSS analysis. However, training of algorithms to perform these tasks requires large amounts of annotated data that are seldom available. In this paper, we demonstrate that the challenges of pharyngeal phase detection and bolus localization can be solved together using a single approach. Methods: We propose a deep-learning framework that jointly tackles pharyngeal phase detection and bolus localization in a weakly-supervised manner, requiring only the initial and final frames of the pharyngeal phase as ground truth annotations for the training. Our approach stems from the observation that bolus presence in the pharynx is the most prominent visual feature upon which to infer whether individual VFSS frames belong to the pharyngeal phase. We conducted extensive experiments with multiple convolutional neural networks (CNNs) on a dataset of 1245 bolus-level clips from 59 healthy subjects. Results: We demonstrated that the pharyngeal phase can be detected with an F1-score higher than 0.9. Moreover, by processing the class activation maps of the CNNs, we were able to localize the bolus with promising results, obtaining correlations with ground truth trajectories higher than 0.9, without any manual annotations of bolus location used for training purposes. Conclusions: Once validated on a larger sample of participants with swallowing disorders, our framework will pave the way for the development of intelligent tools for VFSS analysis to support clinicians in swallowing assessment. … (more)
- Is Part Of:
- Computer methods and programs in biomedicine. Volume 225(2022)
- Journal:
- Computer methods and programs in biomedicine
- Issue:
- Volume 225(2022)
- Issue Display:
- Volume 225, Issue 2022 (2022)
- Year:
- 2022
- Volume:
- 225
- Issue:
- 2022
- Issue Sort Value:
- 2022-0225-2022-0000
- Page Start:
- Page End:
- Publication Date:
- 2022-10
- Subjects:
- Videofluoroscopic swallowing study -- Convolutional neural networks -- Pharyngeal phase -- Bolus localization -- Video classification
Medicine -- Computer programs -- Periodicals
Biology -- Computer programs -- Periodicals
Computers -- Periodicals
Medicine -- Periodicals
Médecine -- Logiciels -- Périodiques
Biologie -- Logiciels -- Périodiques
Biology -- Computer programs
Medicine -- Computer programs
Periodicals
Electronic journals
610.28 - Journal URLs:
- http://www.sciencedirect.com/science/journal/01692607 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.cmpb.2022.107058 ↗
- Languages:
- English
- ISSNs:
- 0169-2607
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.095000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 23924.xml