Continual learning for recurrent neural networks: An empirical evaluation. (November 2021)
- Record Type:
- Journal Article
- Title:
- Continual learning for recurrent neural networks: An empirical evaluation. (November 2021)
- Main Title:
- Continual learning for recurrent neural networks: An empirical evaluation
- Authors:
- Cossu, Andrea
Carta, Antonio
Lomonaco, Vincenzo
Bacciu, Davide - Abstract:
- Abstract: Learning continuously during all model lifetime is fundamental to deploy machine learning solutions robust to drifts in the data distribution. Advances in Continual Learning (CL) with recurrent neural networks could pave the way to a large number of applications where incoming data is non stationary, like natural language processing and robotics. However, the existing body of work on the topic is still fragmented, with approaches which are application-specific and whose assessment is based on heterogeneous learning protocols and datasets. In this paper, we organize the literature on CL for sequential data processing by providing a categorization of the contributions and a review of the benchmarks. We propose two new benchmarks for CL with sequential data based on existing datasets, whose characteristics resemble real-world applications. We also provide a broad empirical evaluation of CL and Recurrent Neural Networks in class-incremental scenario, by testing their ability to mitigate forgetting with a number of different strategies which are not specific to sequential data processing. Our results highlight the key role played by the sequence length and the importance of a clear specification of the CL scenario. Highlights: Experimental evaluation of recurrent neural networks in continual learning. Literature review on recurrent neural networks in continual learning. Benchmarks review for sequential data processing in continual learning. Experimental analysis of theAbstract: Learning continuously during all model lifetime is fundamental to deploy machine learning solutions robust to drifts in the data distribution. Advances in Continual Learning (CL) with recurrent neural networks could pave the way to a large number of applications where incoming data is non stationary, like natural language processing and robotics. However, the existing body of work on the topic is still fragmented, with approaches which are application-specific and whose assessment is based on heterogeneous learning protocols and datasets. In this paper, we organize the literature on CL for sequential data processing by providing a categorization of the contributions and a review of the benchmarks. We propose two new benchmarks for CL with sequential data based on existing datasets, whose characteristics resemble real-world applications. We also provide a broad empirical evaluation of CL and Recurrent Neural Networks in class-incremental scenario, by testing their ability to mitigate forgetting with a number of different strategies which are not specific to sequential data processing. Our results highlight the key role played by the sequence length and the importance of a clear specification of the CL scenario. Highlights: Experimental evaluation of recurrent neural networks in continual learning. Literature review on recurrent neural networks in continual learning. Benchmarks review for sequential data processing in continual learning. Experimental analysis of the effect of sequence length on catastrophic forgetting. Multi head and single head comparison for recurrent models in continual learning. … (more)
- Is Part Of:
- Neural networks. Volume 143(2021)
- Journal:
- Neural networks
- Issue:
- Volume 143(2021)
- Issue Display:
- Volume 143, Issue 2021 (2021)
- Year:
- 2021
- Volume:
- 143
- Issue:
- 2021
- Issue Sort Value:
- 2021-0143-2021-0000
- Page Start:
- 607
- Page End:
- 627
- Publication Date:
- 2021-11
- Subjects:
- Continual learning -- Recurrent neural networks -- Benchmarks -- Evaluation
Neural computers -- Periodicals
Neural networks (Computer science) -- Periodicals
Neural networks (Neurobiology) -- Periodicals
Nervous System -- Periodicals
Ordinateurs neuronaux -- Périodiques
Réseaux neuronaux (Informatique) -- Périodiques
Réseaux neuronaux (Neurobiologie) -- Périodiques
Neural computers
Neural networks (Computer science)
Neural networks (Neurobiology)
Periodicals
006.32 - Journal URLs:
- http://www.sciencedirect.com/science/journal/08936080 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.neunet.2021.07.021 ↗
- Languages:
- English
- ISSNs:
- 0893-6080
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 6081.280800
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 19902.xml