1399 H&E-stained sentinel lymph node sections of breast cancer patients: the CAMELYON dataset. Issue 6 (31st May 2018)
- Record Type:
- Journal Article
- Title:
- 1399 H&E-stained sentinel lymph node sections of breast cancer patients: the CAMELYON dataset. Issue 6 (31st May 2018)
- Main Title:
- 1399 H&E-stained sentinel lymph node sections of breast cancer patients: the CAMELYON dataset
- Authors:
- Litjens, Geert
Bandi, Peter
Ehteshami Bejnordi, Babak
Geessink, Oscar
Balkenhol, Maschenka
Bult, Peter
Halilovic, Altuna
Hermsen, Meyke
van de Loo, Rob
Vogels, Rob
Manson, Quirine F
Stathonikos, Nikolas
Baidoshvili, Alexi
van Diest, Paul
Wauters, Carla
van Dijk, Marcory
van der Laak, Jeroen - Abstract:
- Abstract: Background: The presence of lymph node metastases is one of the most important factors in breast cancer prognosis. The most common way to assess regional lymph node status is the sentinel lymph node procedure. The sentinel lymph node is the most likely lymph node to contain metastasized cancer cells and is excised, histopathologically processed, and examined by a pathologist. This tedious examination process is time-consuming and can lead to small metastases being missed. However, recent advances in whole-slide imaging and machine learning have opened an avenue for analysis of digitized lymph node sections with computer algorithms. For example, convolutional neural networks, a type of machine-learning algorithm, can be used to automatically detect cancer metastases in lymph nodes with high accuracy. To train machine-learning models, large, well-curated datasets are needed. Results: We released a dataset of 1, 399 annotated whole-slide images (WSIs) of lymph nodes, both with and without metastases, in 3 terabytes of data in the context of the CAMELYON16 and CAMELYON17 Grand Challenges. Slides were collected from five medical centers to cover a broad range of image appearance and staining variations. Each WSI has a slide-level label indicating whether it contains no metastases, macro-metastases, micro-metastases, or isolated tumor cells. Furthermore, for 209 WSIs, detailed hand-drawn contours for all metastases are provided. Last, open-source software tools toAbstract: Background: The presence of lymph node metastases is one of the most important factors in breast cancer prognosis. The most common way to assess regional lymph node status is the sentinel lymph node procedure. The sentinel lymph node is the most likely lymph node to contain metastasized cancer cells and is excised, histopathologically processed, and examined by a pathologist. This tedious examination process is time-consuming and can lead to small metastases being missed. However, recent advances in whole-slide imaging and machine learning have opened an avenue for analysis of digitized lymph node sections with computer algorithms. For example, convolutional neural networks, a type of machine-learning algorithm, can be used to automatically detect cancer metastases in lymph nodes with high accuracy. To train machine-learning models, large, well-curated datasets are needed. Results: We released a dataset of 1, 399 annotated whole-slide images (WSIs) of lymph nodes, both with and without metastases, in 3 terabytes of data in the context of the CAMELYON16 and CAMELYON17 Grand Challenges. Slides were collected from five medical centers to cover a broad range of image appearance and staining variations. Each WSI has a slide-level label indicating whether it contains no metastases, macro-metastases, micro-metastases, or isolated tumor cells. Furthermore, for 209 WSIs, detailed hand-drawn contours for all metastases are provided. Last, open-source software tools to visualize and interact with the data have been made available. Conclusions: A unique dataset of annotated, whole-slide digital histopathology images has been provided with high potential for re-use. … (more)
- Is Part Of:
- GigaScience. Volume 7:Issue 6(2018)
- Journal:
- GigaScience
- Issue:
- Volume 7:Issue 6(2018)
- Issue Display:
- Volume 7, Issue 6 (2018)
- Year:
- 2018
- Volume:
- 7
- Issue:
- 6
- Issue Sort Value:
- 2018-0007-0006-0000
- Page Start:
- Page End:
- Publication Date:
- 2018-05-31
- Subjects:
- breast cancer -- lymph node metastases -- whole-slide images -- grand challenge -- sentinel node
Information storage and retrieval systems -- Research -- Periodicals
Biology -- Research -- Periodicals
Medical sciences -- Research -- Periodicals
Database management -- Periodicals
570.285 - Journal URLs:
- http://www.gigasciencejournal.com/ ↗
http://www.oxfordjournals.org/ ↗ - DOI:
- 10.1093/gigascience/giy065 ↗
- Languages:
- English
- ISSNs:
- 2047-217X
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 12413.xml