Fairness in AI: are deep learning-based CMR segmentation algorithms biased?. (14th October 2021)
- Record Type:
- Journal Article
- Title:
- Fairness in AI: are deep learning-based CMR segmentation algorithms biased?. (14th October 2021)
- Main Title:
- Fairness in AI: are deep learning-based CMR segmentation algorithms biased?
- Authors:
- Puyol Anton, E
Ruijsink, B
Piechnik, S K
Neubauer, S
Petersen, S E
Razavi, R
King, A P - Abstract:
- Abstract: Background/Introduction: Artificial intelligence (AI) is providing opportunities to transform cardiovascular medicine. A particular challenge in the application of AI technology is their potential for intrinsic and extrinsic biases, such as those based on gender and/or ethnicity. Unless satisfactorily addressed, these biases could lead to inequalities in early diagnosis, treatments and outcomes. Fairness in AI is a relatively new but fast-growing research field which deals with assessing and addressing potential bias in AI models. Purpose: To perform the first analysis that assesses bias in AI-based cardiac MR segmentation models in a large-scale database. Methods: A state-of-the-art deep learning (DL) based segmentation network, the "nnU-Net" framework [1], was used for automatic segmentation of both ventricles and the myocardium from cine short-axis cardiac MR over the full cardiac cycle. The dataset used consisted of end-diastole and end-systole short-axis cine cardiac MR images of 5, 903 subjects (61.5±7.1 years). The nnU-Net network was trained and evaluated using a 5-fold cross validation (splits: train 60% / validation 20% / test 20%). Data on race and gender were obtained from the UK Biobank database and their distribution is summarized in Figure 1. To assess gender and racial bias in the segmentation network, we compared the Dice scores - which measure the overlap between manual and automatic segmentations – and the absolute error in measurements ofAbstract: Background/Introduction: Artificial intelligence (AI) is providing opportunities to transform cardiovascular medicine. A particular challenge in the application of AI technology is their potential for intrinsic and extrinsic biases, such as those based on gender and/or ethnicity. Unless satisfactorily addressed, these biases could lead to inequalities in early diagnosis, treatments and outcomes. Fairness in AI is a relatively new but fast-growing research field which deals with assessing and addressing potential bias in AI models. Purpose: To perform the first analysis that assesses bias in AI-based cardiac MR segmentation models in a large-scale database. Methods: A state-of-the-art deep learning (DL) based segmentation network, the "nnU-Net" framework [1], was used for automatic segmentation of both ventricles and the myocardium from cine short-axis cardiac MR over the full cardiac cycle. The dataset used consisted of end-diastole and end-systole short-axis cine cardiac MR images of 5, 903 subjects (61.5±7.1 years). The nnU-Net network was trained and evaluated using a 5-fold cross validation (splits: train 60% / validation 20% / test 20%). Data on race and gender were obtained from the UK Biobank database and their distribution is summarized in Figure 1. To assess gender and racial bias in the segmentation network, we compared the Dice scores - which measure the overlap between manual and automatic segmentations – and the absolute error in measurements of biventricular volumes and function between patients grouped by ethnicity and gender. Results: Figure 2 shows the Dice scores and the volumetric and functional measures for the full database, stratified by gender and by ethnicity. Results on the overall population showed an excellent agreement between the manual and automatic segmentations which is consistent with previous reported results [2–3]. However, we find statistically significant differences in Dice scores as well as volumetric measures between different ethnicities, showing that the segmentation network is biased against minority racial groups. No significant differences were found in Dice scores between genders. Similarly, for the end diastolic, end systolic volumes and ejection fraction, there were statistically significant differences in absolute error between the overall population and all racial groups except white. Conclusion(s): We have shown, for the first time, that racial bias exists in DL-based cardiac MR segmentation models. Our hypothesis is that this bias is a result of the unbalanced nature of the training data, and this is supported by the results which show that there is racial bias but not gender bias when trained using the UK Biobank database, which is gender-balanced but not race-balanced. In this work we want to highlight the potential issue of bias in DL-based image segmentation models when translating into a clinical environment. FUNDunding Acknowledgement: Type of funding sources: Public grant(s) – National budget only. Main funding source(s): - EPSRC- Wellcome EPSRC Centre for Medical Engineering at the School of Biomedical Engineering and Imaging Sciences, King's College London … (more)
- Is Part Of:
- European heart journal. Volume 42(2021)Supplement 1
- Journal:
- European heart journal
- Issue:
- Volume 42(2021)Supplement 1
- Issue Display:
- Volume 42, Issue 1 (2021)
- Year:
- 2021
- Volume:
- 42
- Issue:
- 1
- Issue Sort Value:
- 2021-0042-0001-0000
- Page Start:
- Page End:
- Publication Date:
- 2021-10-14
- Subjects:
- Artificial Intelligence (Machine Learning, Deep Learning)
Cardiology -- Periodicals
Heart -- Diseases -- Periodicals
616.12005 - Journal URLs:
- http://eurheartj.oxfordjournals.org/ ↗
http://ukcatalogue.oup.com/ ↗ - DOI:
- 10.1093/eurheartj/ehab724.3055 ↗
- Languages:
- English
- ISSNs:
- 0195-668X
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3829.717500
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 25630.xml