A VGG attention vision transformer network for benign and malignant classification of breast ultrasound images. Issue 9 (30th July 2022)
- Record Type:
- Journal Article
- Title:
- A VGG attention vision transformer network for benign and malignant classification of breast ultrasound images. Issue 9 (30th July 2022)
- Main Title:
- A VGG attention vision transformer network for benign and malignant classification of breast ultrasound images
- Authors:
- Qu, Xiaolei
Lu, Hongyan
Tang, Wenzhong
Wang, Shuai
Zheng, Dezhi
Hou, Yaxin
Jiang, Jue - Abstract:
- Abstract: Purpose: Breast cancer is the most commonly occurring cancer worldwide. The ultrasound reflectivity imaging technique can be used to obtain breast ultrasound (BUS) images, which can be used to classify benign and malignant tumors. However, the classification is subjective and dependent on the experience and skill of operators and doctors. The automatic classification method can assist doctors and improve the objectivity, but current convolution neural network (CNN) is not good at learning global features and vision transformer (ViT) is not good at extraction local features. In this study, we proposed a visual geometry group attention ViT (VGGA‐ViT) network to overcome their disadvantages. Methods: In the proposed method, we used a CNN module to extract the local features and employed a ViT module to learn the global relationship among different regions and enhance the relevant local features. The CNN module was named the VGGA module. It was composed of a VGG backbone, a feature extraction fully connected layer, and a squeeze‐and‐excitation block. Both the VGG backbone and the ViT module were pretrained on the ImageNet dataset and retrained using BUS samples in this study. Two BUS datasets were employed for validation. Results: Cross‐validation was conducted on two BUS datasets. For the Dataset A, the proposed VGGA‐ViT network achieved high accuracy (88.71 ± $\ \pm \ $ 1.55%), recall (90.73 ± $\ \pm \ $ 1.57%), specificity (85.58 ± $\ \pm \ $ 3.35%), precisionAbstract: Purpose: Breast cancer is the most commonly occurring cancer worldwide. The ultrasound reflectivity imaging technique can be used to obtain breast ultrasound (BUS) images, which can be used to classify benign and malignant tumors. However, the classification is subjective and dependent on the experience and skill of operators and doctors. The automatic classification method can assist doctors and improve the objectivity, but current convolution neural network (CNN) is not good at learning global features and vision transformer (ViT) is not good at extraction local features. In this study, we proposed a visual geometry group attention ViT (VGGA‐ViT) network to overcome their disadvantages. Methods: In the proposed method, we used a CNN module to extract the local features and employed a ViT module to learn the global relationship among different regions and enhance the relevant local features. The CNN module was named the VGGA module. It was composed of a VGG backbone, a feature extraction fully connected layer, and a squeeze‐and‐excitation block. Both the VGG backbone and the ViT module were pretrained on the ImageNet dataset and retrained using BUS samples in this study. Two BUS datasets were employed for validation. Results: Cross‐validation was conducted on two BUS datasets. For the Dataset A, the proposed VGGA‐ViT network achieved high accuracy (88.71 ± $\ \pm \ $ 1.55%), recall (90.73 ± $\ \pm \ $ 1.57%), specificity (85.58 ± $\ \pm \ $ 3.35%), precision (90.77 ± $\ \pm \ $ 1.98%), F 1 score (90.73 ± $\ \pm \ $ 1.24%), and Matthews correlation coefficient (MCC) (76.34 ± 7 $\ \pm \ 7$ 3.29%), which were better than those of all compared previous networks in this study. The Dataset B was used as a separate test set, the test results showed that the VGGA‐ViT had highest accuracy (81.72 ± $\ \pm \ $ 2.99%), recall (64.45 ± $\ \pm \ $ 2.96%), specificity (90.28 ± $\ \pm \ $ 3.51%), precision (77.08 ± $\ \pm \ $ 7.21%), F 1 score (70.11 ± $\ \pm \ $ 4.25%), and MCC (57.64 ± $\ \pm \ $ 6.88%). Conclusions: In this study, we proposed the VGGA‐ViT for the BUS classification, which was good at learning both local and global features. The proposed network achieved higher accuracy than the compared previous methods. … (more)
- Is Part Of:
- Medical physics. Volume 49:Issue 9(2022)
- Journal:
- Medical physics
- Issue:
- Volume 49:Issue 9(2022)
- Issue Display:
- Volume 49, Issue 9 (2022)
- Year:
- 2022
- Volume:
- 49
- Issue:
- 9
- Issue Sort Value:
- 2022-0049-0009-0000
- Page Start:
- 5787
- Page End:
- 5798
- Publication Date:
- 2022-07-30
- Subjects:
- breast tumor -- breast ultrasound image -- classification -- deep learning
Medical physics -- Periodicals
Medical physics
Geneeskunde
Natuurkunde
Toepassingen
Biophysics
Periodicals
Periodicals
Electronic journals
610.153 - Journal URLs:
- http://scitation.aip.org/content/aapm/journal/medphys ↗
https://aapm.onlinelibrary.wiley.com/journal/24734209 ↗
http://www.aip.org/ ↗ - DOI:
- 10.1002/mp.15852 ↗
- Languages:
- English
- ISSNs:
- 0094-2405
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 5531.130000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 23228.xml