Designing architectures of convolutional neural networks to solve practical problems. (15th March 2018)
- Record Type:
- Journal Article
- Title:
- Designing architectures of convolutional neural networks to solve practical problems. (15th March 2018)
- Main Title:
- Designing architectures of convolutional neural networks to solve practical problems
- Authors:
- Ferreira, Martha Dais
Corrêa, Débora Cristina
Nonato, Luis Gustavo
de Mello, Rodrigo Fernandes - Abstract:
- Highlights: Our approach aims to support the estimation of Convolutional Neural Network (CNN) parameters. It intends to produce simpler CNN, reducing the complexity. This estimation was based on False Nearest Neighbors method. Caffe deep learning framework was used to conduct the training of CNN. Our results are comparable even to very complex and empirical CNN architectures. Abstract: The Convolutional Neural Network (CNN) figures among the state-of-the-art Deep Learning (DL) algorithms due to its robustness to support data shift, scale variations, and its capability of extracting relevant information from large-scale input data. However, setting appropriate parameters to define CNN architectures is still a challenging issue, mainly to tackle real-world problems. A typical approach consists in empirically assessing different CNN settings in order to select the most appropriate one. This procedure has clear limitations, including the choice of suitable predefined configurations as well as the high computational cost involved in evaluating each of them. This work presents a novel methodology to tackle the previously mentioned issues, providing mechanisms to estimate effective CNN configurations, including the size of convolutional masks (convolutional kernels) and the number of convolutional units (CNN neurons) per layer. Based on the False Nearest Neighbors (FNN), a well-known tool from the area of Dynamical Systems, the proposed method helps estimating CNN architecturesHighlights: Our approach aims to support the estimation of Convolutional Neural Network (CNN) parameters. It intends to produce simpler CNN, reducing the complexity. This estimation was based on False Nearest Neighbors method. Caffe deep learning framework was used to conduct the training of CNN. Our results are comparable even to very complex and empirical CNN architectures. Abstract: The Convolutional Neural Network (CNN) figures among the state-of-the-art Deep Learning (DL) algorithms due to its robustness to support data shift, scale variations, and its capability of extracting relevant information from large-scale input data. However, setting appropriate parameters to define CNN architectures is still a challenging issue, mainly to tackle real-world problems. A typical approach consists in empirically assessing different CNN settings in order to select the most appropriate one. This procedure has clear limitations, including the choice of suitable predefined configurations as well as the high computational cost involved in evaluating each of them. This work presents a novel methodology to tackle the previously mentioned issues, providing mechanisms to estimate effective CNN configurations, including the size of convolutional masks (convolutional kernels) and the number of convolutional units (CNN neurons) per layer. Based on the False Nearest Neighbors (FNN), a well-known tool from the area of Dynamical Systems, the proposed method helps estimating CNN architectures that are less complex and produce good results. Our experiments confirm that architectures estimated through the proposed approach are as effective as the complex ones defined by empirical and computationally intensive strategies. … (more)
- Is Part Of:
- Expert systems with applications. Volume 94(2018)
- Journal:
- Expert systems with applications
- Issue:
- Volume 94(2018)
- Issue Display:
- Volume 94, Issue 2018 (2018)
- Year:
- 2018
- Volume:
- 94
- Issue:
- 2018
- Issue Sort Value:
- 2018-0094-2018-0000
- Page Start:
- 205
- Page End:
- 217
- Publication Date:
- 2018-03-15
- Subjects:
- Convolutional neural network -- Architecture assessment -- Dynamical systems -- Handwritten digit recognition -- Face recognition -- Object recognition
Expert systems (Computer science) -- Periodicals
Systèmes experts (Informatique) -- Périodiques
Electronic journals
006.33 - Journal URLs:
- http://www.sciencedirect.com/science/journal/09574174 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.eswa.2017.10.052 ↗
- Languages:
- English
- ISSNs:
- 0957-4174
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3842.004220
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 5323.xml