RANT: A cascade reverse attention segmentation framework with hybrid transformer for laryngeal endoscope images. (September 2022)
- Record Type:
- Journal Article
- Title:
- RANT: A cascade reverse attention segmentation framework with hybrid transformer for laryngeal endoscope images. (September 2022)
- Main Title:
- RANT: A cascade reverse attention segmentation framework with hybrid transformer for laryngeal endoscope images
- Authors:
- Pan, Xiaoying
Bai, Weidong
Ma, Minjie
Zhang, Shaoqiang - Abstract:
- Highlights: Multi-objective segmentation in laryngeal endoscope images. A progressive segmentation framework combining CNN and Transformer is proposed to improve the segmentation accuracy. We achieve the best performance of multi-objective segmentation in two datasets of laryngeal endoscope images. Abstract: Automatic segmentation of organizations and organs is a prerequisite for medical image analysis and computer-assisted diagnosis and treatment. The larynx is an important part of the human body, and scholars have paid less attention to the segmentation research in the larynx. For electronic laryngoscope images, a cascaded reverse attention network with hybrid transformer (RANT) is presented. Firstly, the RANT combines transformer and CNN in a serial way to capture the global dependency feature based on the low-level spatial detail feature. Secondly, combined with the reverse attention and receptive field block module (RRM), connects features of different scale in a cascade way to gradually mined the target. Finally, the segmentation results are optimized by convolutional conditional random fields (ConvCRFs). The experimental results on two laryngoscopy datasets show that the RANT achieves the best balance of multi-organ segmentation compared with other the state-of-the-art segmentation networks. On the two datasets, the mIoU of RANT reaches 76.63% and 88.77%, the mDSC reaches 83.45% and 93.49%, which is greatly improved compared with the benchmark. The RANT network canHighlights: Multi-objective segmentation in laryngeal endoscope images. A progressive segmentation framework combining CNN and Transformer is proposed to improve the segmentation accuracy. We achieve the best performance of multi-objective segmentation in two datasets of laryngeal endoscope images. Abstract: Automatic segmentation of organizations and organs is a prerequisite for medical image analysis and computer-assisted diagnosis and treatment. The larynx is an important part of the human body, and scholars have paid less attention to the segmentation research in the larynx. For electronic laryngoscope images, a cascaded reverse attention network with hybrid transformer (RANT) is presented. Firstly, the RANT combines transformer and CNN in a serial way to capture the global dependency feature based on the low-level spatial detail feature. Secondly, combined with the reverse attention and receptive field block module (RRM), connects features of different scale in a cascade way to gradually mined the target. Finally, the segmentation results are optimized by convolutional conditional random fields (ConvCRFs). The experimental results on two laryngoscopy datasets show that the RANT achieves the best balance of multi-organ segmentation compared with other the state-of-the-art segmentation networks. On the two datasets, the mIoU of RANT reaches 76.63% and 88.77%, the mDSC reaches 83.45% and 93.49%, which is greatly improved compared with the benchmark. The RANT network can segment each class of laryngoscope image effectively. This method is of great significance in clinical application. … (more)
- Is Part Of:
- Biomedical signal processing and control. Volume 78(2022)
- Journal:
- Biomedical signal processing and control
- Issue:
- Volume 78(2022)
- Issue Display:
- Volume 78, Issue 2022 (2022)
- Year:
- 2022
- Volume:
- 78
- Issue:
- 2022
- Issue Sort Value:
- 2022-0078-2022-0000
- Page Start:
- Page End:
- Publication Date:
- 2022-09
- Subjects:
- Laryngeal endoscopic image -- Soft tissue segmentation -- Vocal folds segmentation -- Transformer -- Convolutional neural network
Signal processing -- Periodicals
Biomedical engineering -- Periodicals
Signal Processing, Computer-Assisted -- Periodicals
Image Processing, Computer-Assisted -- Periodicals
Biomedical Engineering -- Periodicals
610.28 - Journal URLs:
- http://www.sciencedirect.com/science/journal/17468094 ↗
http://www.elsevier.com/journals ↗
http://www.sciencedirect.com/science?_ob=PublicationURL&_tockey=%23TOC%2329675%232006%23999989998%23626449%23FLA%23&_cdi=29675&_pubType=J&_auth=y&_acct=C000045259&_version=1&_urlVersion=0&_userid=836873&md5=664b5cf9a57fc91971a17faf20c32ec1 ↗ - DOI:
- 10.1016/j.bspc.2022.103890 ↗
- Languages:
- English
- ISSNs:
- 1746-8094
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 2087.880400
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 23053.xml