Framework for large-scale automatic curation of heterogeneous cardiac MRI (ACUR MRI). (13th July 2021)
- Record Type:
- Journal Article
- Title:
- Framework for large-scale automatic curation of heterogeneous cardiac MRI (ACUR MRI). (13th July 2021)
- Main Title:
- Framework for large-scale automatic curation of heterogeneous cardiac MRI (ACUR MRI)
- Authors:
- Galazis, C
Vimalesvaran, K
Zaman, S
Petri, C
Howard, J
Linton, N
Peters, N
Cole, G
Bharath, AA
Varela, M - Abstract:
- Abstract: Funding Acknowledgements: Type of funding sources: Public grant(s) – National budget only. Main funding source(s): UKRI CDT in AI for Healthcare http://ai4health.io and British Heart Foundation Background Data curation is an important process that structures and organises data, supporting research and the development of artificial intelligence models. However, manually curating a large volume of medical data is a time-consuming, repetitive and costly process that puts additional strain on clinical experts. The curation becomes more complex and demanding as more data sources are used. This leads to an introduction of disparity in the data structure and protocols. Purpose Here, we propose an automatic framework to curate large volumes of heterogenous cardiac MRI scans acquired across different sites and scanner vendors. Our framework requires minimal expert involvement throughout and works directly on DICOM images from the scanner or PACS. The resulting structured standardised data allow for straightforward image analysis, hypothesis testing and the training and application of artificial intelligence models. Methods It is broken down into three main components: anonymisation, cataloguing and outlier detection (see Figure 1). Anonymisation automatically removes any identifiable patient information from the DICOM image attributes. These data are replaced with anonymised labels, whilst maintaining relevant longitudinal information from each patient. DICOMAbstract: Funding Acknowledgements: Type of funding sources: Public grant(s) – National budget only. Main funding source(s): UKRI CDT in AI for Healthcare http://ai4health.io and British Heart Foundation Background Data curation is an important process that structures and organises data, supporting research and the development of artificial intelligence models. However, manually curating a large volume of medical data is a time-consuming, repetitive and costly process that puts additional strain on clinical experts. The curation becomes more complex and demanding as more data sources are used. This leads to an introduction of disparity in the data structure and protocols. Purpose Here, we propose an automatic framework to curate large volumes of heterogenous cardiac MRI scans acquired across different sites and scanner vendors. Our framework requires minimal expert involvement throughout and works directly on DICOM images from the scanner or PACS. The resulting structured standardised data allow for straightforward image analysis, hypothesis testing and the training and application of artificial intelligence models. Methods It is broken down into three main components: anonymisation, cataloguing and outlier detection (see Figure 1). Anonymisation automatically removes any identifiable patient information from the DICOM image attributes. These data are replaced with anonymised labels, whilst maintaining relevant longitudinal information from each patient. DICOM attributes are also used to automatically group the different images according to imaging sequence (e.g. CINE, Delayed-Enhancement, T1 maps), acquisition geometry (e.g. short-axis, 2-chamber, 4-chamber) and imaging attributes (e.g. slice thickness, TE, TR), for easier querying. The sorting characteristics are flexible and can easily be defined by the user. Finally, we detect and flag, for subsequent manual inspection, any outliers within those groups, based on the similarity levels of chosen DICOM attributes. This framework additionally offers interactive image visualisation to allow users to assess its performance in real time. Results We tested the performance of ACUR CMRI on 26, 668 CMR image series (723, 531 images) from 858 patient examinations, which took place across two sites in four different scanners. With an average execution time per patient of 100 seconds, ACUR was able to sort imaging data with 1191 different sequence names into 43 categories. The framework can be freely downloaded from https://bitbucket.org/cmr-ai-working-group/acur/ . Conclusions We present ACUR, an automatic framework to curate large volumes of heterogeneous cardiac MRI data. We show how it can quickly and automatically curate data, grouping it according to desired imaging characteristics defined in DICOM attributes. The proposed framework is flexible and ideally suited as a pre-processing tool for large biomedical imaging data studies. … (more)
- Is Part Of:
- European heart journal. Volume 22(2021)Supplement 2
- Journal:
- European heart journal
- Issue:
- Volume 22(2021)Supplement 2
- Issue Display:
- Volume 22, Issue 2 (2021)
- Year:
- 2021
- Volume:
- 22
- Issue:
- 2
- Issue Sort Value:
- 2021-0022-0002-0000
- Page Start:
- Page End:
- Publication Date:
- 2021-07-13
- Subjects:
- Cardiovascular system -- Imaging -- Periodicals
Heart -- Imaging -- Periodicals
616.10754 - Journal URLs:
- http://ehjcimaging.oxfordjournals.org/ ↗
http://ukcatalogue.oup.com/ ↗ - DOI:
- 10.1093/ehjci/jeab090.131 ↗
- Languages:
- English
- ISSNs:
- 2047-2404
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 17478.xml