Explaining nonlinear classification decisions with deep Taylor decomposition. (May 2017)
- Record Type:
- Journal Article
- Title:
- Explaining nonlinear classification decisions with deep Taylor decomposition. (May 2017)
- Main Title:
- Explaining nonlinear classification decisions with deep Taylor decomposition
- Authors:
- Montavon, Grégoire
Lapuschkin, Sebastian
Binder, Alexander
Samek, Wojciech
Müller, Klaus-Robert - Abstract:
- Abstract: Nonlinear methods such as Deep Neural Networks (DNNs) are the gold standard for various challenging machine learning problems such as image recognition. Although these methods perform impressively well, they have a significant disadvantage, the lack of transparency, limiting the interpretability of the solution and thus the scope of application in practice. Especially DNNs act as black boxes due to their multilayer nonlinear structure. In this paper we introduce a novel methodology for interpreting generic multilayer neural networks by decomposing the network classification decision into contributions of its input elements. Although our focus is on image classification, the method is applicable to a broad set of input data, learning tasks and network architectures. Our method called deep Taylor decomposition efficiently utilizes the structure of the network by backpropagating the explanations from the output to the input layer. We evaluate the proposed method empirically on the MNIST and ILSVRC data sets. Abstract : Highlights: A novel method to explain nonlinear classification decisions in terms of input variables is introduced. The method is based on Taylor expansions and decomposes the output of a deep neural network in terms of input variables. The resulting deep Taylor decomposition can be applied directly to existing neural networks without retraining. The method is tested on two large-scale neural networks for image classification: BVLC CaffeNet andAbstract: Nonlinear methods such as Deep Neural Networks (DNNs) are the gold standard for various challenging machine learning problems such as image recognition. Although these methods perform impressively well, they have a significant disadvantage, the lack of transparency, limiting the interpretability of the solution and thus the scope of application in practice. Especially DNNs act as black boxes due to their multilayer nonlinear structure. In this paper we introduce a novel methodology for interpreting generic multilayer neural networks by decomposing the network classification decision into contributions of its input elements. Although our focus is on image classification, the method is applicable to a broad set of input data, learning tasks and network architectures. Our method called deep Taylor decomposition efficiently utilizes the structure of the network by backpropagating the explanations from the output to the input layer. We evaluate the proposed method empirically on the MNIST and ILSVRC data sets. Abstract : Highlights: A novel method to explain nonlinear classification decisions in terms of input variables is introduced. The method is based on Taylor expansions and decomposes the output of a deep neural network in terms of input variables. The resulting deep Taylor decomposition can be applied directly to existing neural networks without retraining. The method is tested on two large-scale neural networks for image classification: BVLC CaffeNet and GoogleNet. … (more)
- Is Part Of:
- Pattern recognition. Volume 65(2017:May)
- Journal:
- Pattern recognition
- Issue:
- Volume 65(2017:May)
- Issue Display:
- Volume 65 (2017)
- Year:
- 2017
- Volume:
- 65
- Issue Sort Value:
- 2017-0065-0000-0000
- Page Start:
- 211
- Page End:
- 222
- Publication Date:
- 2017-05
- Subjects:
- Deep neural networks -- Heatmapping -- Taylor decomposition -- Relevance propagation -- Image recognition
Pattern perception -- Periodicals
Perception des structures -- Périodiques
Patroonherkenning
006.4 - Journal URLs:
- http://www.sciencedirect.com/science/journal/00313203 ↗
http://www.sciencedirect.com/ ↗ - DOI:
- 10.1016/j.patcog.2016.11.008 ↗
- Languages:
- English
- ISSNs:
- 0031-3203
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 2626.xml