SuperFormer: Continual learning superposition method for text classification. (April 2023)
- Record Type:
- Journal Article
- Title:
- SuperFormer: Continual learning superposition method for text classification. (April 2023)
- Main Title:
- SuperFormer: Continual learning superposition method for text classification
- Authors:
- Zeman, Marko
Pucer, Jana Faganeli
Kononenko, Igor
Bosnić, Zoran - Abstract:
- Abstract: One of the biggest challenges in continual learning domains is the tendency of machine learning models to forget previously learned information over time. While overcoming this issue, the existing approaches often exploit large amounts of additional memory and apply model forgetting mitigation mechanisms which substantially prolong the training process. Therefore, we propose a novel SuperFormer method that alleviates model forgetting, while spending negligible additional memory and time. We tackle the continual learning challenges in a learning scenario, where we learn different tasks in a sequential order. We compare our method against several prominent continual learning methods, i.e., EWC, SI, MAS, GEM, PSP, etc. on a set of text classification tasks. We achieve the best average performance in terms of AUROC and AUPRC (0.7% and 0.9% gain on average, respectively) and the lowest training time among all the methods of comparison. On average, our method reduces the total training time by a factor of 5.4-8.5 in comparison to similarly performing methods. In terms of the additional memory, our method is on par with the most memory-efficient approaches.
- Is Part Of:
- Neural networks. Volume 161(2023)
- Journal:
- Neural networks
- Issue:
- Volume 161(2023)
- Issue Display:
- Volume 161, Issue 2023 (2023)
- Year:
- 2023
- Volume:
- 161
- Issue:
- 2023
- Issue Sort Value:
- 2023-0161-2023-0000
- Page Start:
- 418
- Page End:
- 436
- Publication Date:
- 2023-04
- Subjects:
- Deep learning -- Continual learning -- Superposition -- Transformers
Neural computers -- Periodicals
Neural networks (Computer science) -- Periodicals
Neural networks (Neurobiology) -- Periodicals
Nervous System -- Periodicals
Ordinateurs neuronaux -- Périodiques
Réseaux neuronaux (Informatique) -- Périodiques
Réseaux neuronaux (Neurobiologie) -- Périodiques
Neural computers
Neural networks (Computer science)
Neural networks (Neurobiology)
Periodicals
006.32 - Journal URLs:
- http://www.sciencedirect.com/science/journal/08936080 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.neunet.2023.01.040 ↗
- Languages:
- English
- ISSNs:
- 0893-6080
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 6081.280800
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 26310.xml