Automatic classification of informative laryngoscopic images using deep learning. Issue 2 (8th February 2022)
- Record Type:
- Journal Article
- Title:
- Automatic classification of informative laryngoscopic images using deep learning. Issue 2 (8th February 2022)
- Main Title:
- Automatic classification of informative laryngoscopic images using deep learning
- Authors:
- Yao, Peter
Witte, Dan
Gimonet, Hortense
German, Alexander
Andreadis, Katerina
Cheng, Michael
Sulica, Lucian
Elemento, Olivier
Barnes, Josue
Rameau, Anaïs - Abstract:
- Abstract: Objective: This study aims to develop and validate a convolutional neural network (CNN)‐based algorithm for automatic selection of informative frames in flexible laryngoscopic videos. The classifier has the potential to aid in the development of computer‐aided diagnosis systems and reduce data processing time for clinician‐computer scientist teams. Methods: A dataset of 22, 132 laryngoscopic frames was extracted from 137 flexible laryngostroboscopic videos from 115 patients. 55 videos were from healthy patients with no laryngeal pathology and 82 videos were from patients with vocal fold polyps. The extracted frames were manually labeled as informative or uninformative by two independent reviewers based on vocal fold visibility, lighting, focus, and camera distance, resulting in 18, 114 informative frames and 4018 uninformative frames. The dataset was split into training and test sets. A pre‐trained ResNet‐18 model was trained using transfer learning to classify frames as informative or uninformative. Hyperparameters were set using cross‐validation. The primary outcome was precision for the informative class and secondary outcomes were precision, recall, and F1‐score for all classes. The processing rate for frames between the model and a human annotator were compared. Results: The automated classifier achieved an informative frame precision, recall, and F1‐score of 94.4%, 90.2%, and 92.3%, respectively, when evaluated on a hold‐out test set of 4438 frames. The modelAbstract: Objective: This study aims to develop and validate a convolutional neural network (CNN)‐based algorithm for automatic selection of informative frames in flexible laryngoscopic videos. The classifier has the potential to aid in the development of computer‐aided diagnosis systems and reduce data processing time for clinician‐computer scientist teams. Methods: A dataset of 22, 132 laryngoscopic frames was extracted from 137 flexible laryngostroboscopic videos from 115 patients. 55 videos were from healthy patients with no laryngeal pathology and 82 videos were from patients with vocal fold polyps. The extracted frames were manually labeled as informative or uninformative by two independent reviewers based on vocal fold visibility, lighting, focus, and camera distance, resulting in 18, 114 informative frames and 4018 uninformative frames. The dataset was split into training and test sets. A pre‐trained ResNet‐18 model was trained using transfer learning to classify frames as informative or uninformative. Hyperparameters were set using cross‐validation. The primary outcome was precision for the informative class and secondary outcomes were precision, recall, and F1‐score for all classes. The processing rate for frames between the model and a human annotator were compared. Results: The automated classifier achieved an informative frame precision, recall, and F1‐score of 94.4%, 90.2%, and 92.3%, respectively, when evaluated on a hold‐out test set of 4438 frames. The model processed frames 16 times faster than a human annotator. Conclusion: The CNN‐based classifier demonstrates high precision for classifying informative frames in flexible laryngostroboscopic videos. This model has the potential to aid researchers with dataset creation for computer‐aided diagnosis systems by automatically extracting relevant frames from laryngoscopic videos. Abstract : We develop and validate a deep learning classifier capable of identifying informative frames in flexible laryngostroboscopic videos with high precision. This model has the potential to aid researchers with dataset creation for computer‐aided diagnosis systems by automatically extracting relevant frames from laryngoscopic videos. … (more)
- Is Part Of:
- Laryngoscope investigative otolaryngology. Volume 7:Issue 2(2022)
- Journal:
- Laryngoscope investigative otolaryngology
- Issue:
- Volume 7:Issue 2(2022)
- Issue Display:
- Volume 7, Issue 2 (2022)
- Year:
- 2022
- Volume:
- 7
- Issue:
- 2
- Issue Sort Value:
- 2022-0007-0002-0000
- Page Start:
- 460
- Page End:
- 466
- Publication Date:
- 2022-02-08
- Subjects:
- artificial intelligence -- computer vision -- computer‐aided diagnosis -- laryngology -- machine learning -- vocal fold polyp
Otolaryngology -- Periodicals
Laryngoscopy -- Periodicals
Otolaryngology
Otolaryngology
Periodicals
Periodicals
617.51 - Journal URLs:
- http://onlinelibrary.wiley.com/journal/10.1002/(ISSN)2378-8038 ↗
http://onlinelibrary.wiley.com/ ↗ - DOI:
- 10.1002/lio2.754 ↗
- Languages:
- English
- ISSNs:
- 2378-8038
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 21324.xml