Comparing artificial intelligence algorithms to 157 German dermatologists: the melanoma classification benchmark. (April 2019)

Record Type:: Journal Article
Title:: Comparing artificial intelligence algorithms to 157 German dermatologists: the melanoma classification benchmark. (April 2019)
Main Title:: Comparing artificial intelligence algorithms to 157 German dermatologists: the melanoma classification benchmark
Authors:: Brinker, Titus J.
Hekler, Achim
Hauschild, Axel
Berking, Carola
Schilling, Bastian
Enk, Alexander H.
Haferkamp, Sebastian
Karoglan, Ante
von Kalle, Christof
Weichenthal, Michael
Sattler, Elke
Schadendorf, Dirk
Gaiser, Maria R.
Klode, Joachim
Utikal, Jochen S.
Abstract:: Abstract: Background: Several recent publications have demonstrated the use of convolutional neural networks to classify images of melanoma at par with board-certified dermatologists. However, the non-availability of a public human benchmark restricts the comparability of the performance of these algorithms and thereby the technical progress in this field. Methods: An electronic questionnaire was sent to dermatologists at 12 German university hospitals. Each questionnaire comprised 100 dermoscopic and 100 clinical images (80 nevi images and 20 biopsy-verified melanoma images, each), all open-source. The questionnaire recorded factors such as the years of experience in dermatology, performed skin checks, age, sex and the rank within the university hospital or the status as resident physician. For each image, the dermatologists were asked to provide a management decision (treat/biopsy lesion or reassure the patient). Main outcome measures were sensitivity, specificity and the receiver operating characteristics (ROC). Results: Total 157 dermatologists assessed all 100 dermoscopic images with an overall sensitivity of 74.1%, specificity of 60.0% and an ROC of 0.67 (range = 0.538–0.769); 145 dermatologists assessed all 100 clinical images with an overall sensitivity of 89.4%, specificity of 64.4% and an ROC of 0.769 (range = 0.613–0.9). Results between test-sets were significantly different ( P < 0.05) confirming the need for a standardised benchmark. Conclusions: We present the … (more)
Is Part Of:: European journal of cancer. Volume 111(2019)
Journal:: European journal of cancer
Issue:: Volume 111(2019)
Issue Display:: Volume 111, Issue 2019 (2019)
Year:: 2019
Volume:: 111
Issue:: 2019
Issue Sort Value:: 2019-0111-2019-0000
Page Start:: 30
Page End:: 37
Publication Date:: 2019-04
Subjects:: Benchmark -- Artificial intelligence -- Deep learning -- Melanoma
Cancer -- Periodicals
Neoplasms -- Periodicals
Cancer -- Périodiques
Cancer
Tumors
Electronic journals
Periodicals
Electronic journals
616.994
Journal URLs:: http://www.sciencedirect.com/science/journal/09598049 ↗
http://rzblx1.uni-regensburg.de/ezeit/warpto.phtml?colors=7&jour_id=2879 ↗
http://www.clinicalkey.com/dura/browse/journalIssue/09598049 ↗
http://www.clinicalkey.com.au/dura/browse/journalIssue/09598049 ↗
http://www.elsevier.com/journals ↗
DOI:: 10.1016/j.ejca.2018.12.016 ↗
Languages:: English
ISSNs:: 0959-8049
Deposit Type:: Legaldeposit
View Content:: Available online (eLD content is only available in our Reading Rooms) ↗
Physical Locations:: British Library DSC - 3829.725100
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store
Ingest File:: 9677.xml