Comparing ensemble strategies for deep learning: An application to facial expression recognition. (1st December 2019)
- Record Type:
- Journal Article
- Title:
- Comparing ensemble strategies for deep learning: An application to facial expression recognition. (1st December 2019)
- Main Title:
- Comparing ensemble strategies for deep learning: An application to facial expression recognition
- Authors:
- Renda, Alessandro
Barsacchi, Marco
Bechini, Alessio
Marcelloni, Francesco - Abstract:
- Highlights: CNNs can be used as solutions to the Facial Expression Recognition (FER) problem. Different CNN ensembling strategies are experimentally compared in the FER context. Exploiting different sources of variability is crucial to improve the accuracy of a CNN ensemble. Pre-training is found to be the most effective strategy to build CNNs ensembles for FER. Abstract: Recent works have shown that Convolutional Neural Networks (CNNs), because of their effectiveness in feature extraction and classification tasks, are suitable tools to address the Facial Expression Recognition (FER) problem. Further, it has been pointed out how ensembles of CNNs allow improving classification accuracy. Nevertheless, a detailed experimental analysis on how ensembles of CNNs could be effectively generated in the FER context has not been performed yet, although it would have considerable value for improving the results obtained in the FER task. This paper aims to present an extensive investigation on different aspects of the ensemble generation, focusing on the factors that influence the classification accuracy on the FER context. In particular, we evaluate several strategies for the ensemble generation, different aggregation schemes, and the dependence upon the number of base classifiers in the ensemble. The final objective is to provide some indications for building up effective ensembles of CNNs. Specifically, we observed that exploiting different sources of variability is crucial for theHighlights: CNNs can be used as solutions to the Facial Expression Recognition (FER) problem. Different CNN ensembling strategies are experimentally compared in the FER context. Exploiting different sources of variability is crucial to improve the accuracy of a CNN ensemble. Pre-training is found to be the most effective strategy to build CNNs ensembles for FER. Abstract: Recent works have shown that Convolutional Neural Networks (CNNs), because of their effectiveness in feature extraction and classification tasks, are suitable tools to address the Facial Expression Recognition (FER) problem. Further, it has been pointed out how ensembles of CNNs allow improving classification accuracy. Nevertheless, a detailed experimental analysis on how ensembles of CNNs could be effectively generated in the FER context has not been performed yet, although it would have considerable value for improving the results obtained in the FER task. This paper aims to present an extensive investigation on different aspects of the ensemble generation, focusing on the factors that influence the classification accuracy on the FER context. In particular, we evaluate several strategies for the ensemble generation, different aggregation schemes, and the dependence upon the number of base classifiers in the ensemble. The final objective is to provide some indications for building up effective ensembles of CNNs. Specifically, we observed that exploiting different sources of variability is crucial for the improvement of the overall accuracy. To this aim, pre-processing and pre-training procedures are able to provide a satisfactory variability across the base classifiers, while the use of different seeds does not appear as an effective solution. Bagging ensures a high ensemble gain, but the overall accuracy is limited by poor-performing base classifiers. The impact of increasing the ensemble size specifically depends on the adopted strategy, but also in the best case the performance gain obtained by involving additional base classifiers becomes not significant beyond a certain limit size, thus suggesting to avoid very large ensembles. Finally, the classic averaging voting proves to be an appropriate aggregation scheme, achieving accuracy values comparable to or slightly better than the other experimented operators. … (more)
- Is Part Of:
- Expert systems with applications. Volume 136(2019)
- Journal:
- Expert systems with applications
- Issue:
- Volume 136(2019)
- Issue Display:
- Volume 136, Issue 2019 (2019)
- Year:
- 2019
- Volume:
- 136
- Issue:
- 2019
- Issue Sort Value:
- 2019-0136-2019-0000
- Page Start:
- 1
- Page End:
- 11
- Publication Date:
- 2019-12-01
- Subjects:
- Facial Expression Recognition -- Convolutional Neural Networks -- Ensemble learning -- Ensemble construction
Expert systems (Computer science) -- Periodicals
Systèmes experts (Informatique) -- Périodiques
Electronic journals
006.33 - Journal URLs:
- http://www.sciencedirect.com/science/journal/09574174 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.eswa.2019.06.025 ↗
- Languages:
- English
- ISSNs:
- 0957-4174
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3842.004220
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 11397.xml