Efficient methods for implementation of multi-level nonrigid mass-preserving image registration on GPUs and multi-threaded CPUs. Issue 127 (April 2016)
- Record Type:
- Journal Article
- Title:
- Efficient methods for implementation of multi-level nonrigid mass-preserving image registration on GPUs and multi-threaded CPUs. Issue 127 (April 2016)
- Main Title:
- Efficient methods for implementation of multi-level nonrigid mass-preserving image registration on GPUs and multi-threaded CPUs
- Authors:
- Ellingwood, Nathan D.
Yin, Youbing
Smith, Matthew
Lin, Ching-Long - Abstract:
- Highlights: A Multilevel Transform Composite method for GPU accelerated registration is proposed. Method focused on mass-preserving registration of lung images with large deformation. Diffeomorphic MTC achieves computation and memory efficiencies, preserves accuracy. GPU speedup factor ranging from 6 to 112 achieved compared to CPU counterparts. Improved computational speed essential to analyze data for population-based studies. Abstract: Background and objective: Faster and more accurate methods for registration of images are important for research involved in conducting population-based studies that utilize medical imaging, as well as improvements for use in clinical applications. We present a novel computation- and memory-efficient multi-level method on graphics processing units (GPU) for performing registration of two computed tomography (CT) volumetric lung images. Methods: We developed a computation- and memory-efficient Diffeomorphic Multi-level B-Spline Transform Composite (DMTC) method to implement nonrigid mass-preserving registration of two CT lung images on GPU. The framework consists of a hierarchy of B-Spline control grids of increasing resolution. A similarity criterion known as the sum of squared tissue volume difference (SSTVD) was adopted to preserve lung tissue mass. The use of SSTVD consists of the calculation of the tissue volume, the Jacobian, and their derivatives, which makes its implementation on GPU challenging due to memory constraints. The use ofHighlights: A Multilevel Transform Composite method for GPU accelerated registration is proposed. Method focused on mass-preserving registration of lung images with large deformation. Diffeomorphic MTC achieves computation and memory efficiencies, preserves accuracy. GPU speedup factor ranging from 6 to 112 achieved compared to CPU counterparts. Improved computational speed essential to analyze data for population-based studies. Abstract: Background and objective: Faster and more accurate methods for registration of images are important for research involved in conducting population-based studies that utilize medical imaging, as well as improvements for use in clinical applications. We present a novel computation- and memory-efficient multi-level method on graphics processing units (GPU) for performing registration of two computed tomography (CT) volumetric lung images. Methods: We developed a computation- and memory-efficient Diffeomorphic Multi-level B-Spline Transform Composite (DMTC) method to implement nonrigid mass-preserving registration of two CT lung images on GPU. The framework consists of a hierarchy of B-Spline control grids of increasing resolution. A similarity criterion known as the sum of squared tissue volume difference (SSTVD) was adopted to preserve lung tissue mass. The use of SSTVD consists of the calculation of the tissue volume, the Jacobian, and their derivatives, which makes its implementation on GPU challenging due to memory constraints. The use of the DMTC method enabled reduced computation and memory storage of variables with minimal communication between GPU and Central Processing Unit (CPU) due to ability to pre-compute values. The method was assessed on six healthy human subjects. Results: Resultant GPU-generated displacement fields were compared against the previously validated CPU counterpart fields, showing good agreement with an average normalized root mean square error (nRMS) of 0.044 ± 0.015. Runtime and performance speedup are compared between single-threaded CPU, multi-threaded CPU, and GPU algorithms. Best performance speedup occurs at the highest resolution in the GPU implementation for the SSTVD cost and cost gradient computations, with a speedup of 112 times that of the single-threaded CPU version and 11 times over the twelve-threaded version when considering average time per iteration using a Nvidia Tesla K20X GPU. Conclusions: The proposed GPU-based DMTC method outperforms its multi-threaded CPU version in terms of runtime. Total registration time reduced runtime to 2.9 min on the GPU version, compared to 12.8 min on twelve-threaded CPU version and 112.5 min on a single-threaded CPU. Furthermore, the GPU implementation discussed in this work can be adapted for use of other cost functions that require calculation of the first derivatives. … (more)
- Is Part Of:
- Computer methods and programs in biomedicine. Issue 127(2016)
- Journal:
- Computer methods and programs in biomedicine
- Issue:
- Issue 127(2016)
- Issue Display:
- Volume 127, Issue 127 (2016)
- Year:
- 2016
- Volume:
- 127
- Issue:
- 127
- Issue Sort Value:
- 2016-0127-0127-0000
- Page Start:
- 290
- Page End:
- 300
- Publication Date:
- 2016-04
- Subjects:
- GPU -- Mass-preserving image registration -- Multi-level -- Lungs
Medicine -- Computer programs -- Periodicals
Biology -- Computer programs -- Periodicals
Computers -- Periodicals
Medicine -- Periodicals
Médecine -- Logiciels -- Périodiques
Biologie -- Logiciels -- Périodiques
Biology -- Computer programs
Medicine -- Computer programs
Periodicals
Electronic journals
610.28 - Journal URLs:
- http://www.sciencedirect.com/science/journal/01692607 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.cmpb.2015.12.018 ↗
- Languages:
- English
- ISSNs:
- 0169-2607
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.095000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 1846.xml