Working Memory Connections for LSTM. (December 2021)
- Record Type:
- Journal Article
- Title:
- Working Memory Connections for LSTM. (December 2021)
- Main Title:
- Working Memory Connections for LSTM
- Authors:
- Landi, Federico
Baraldi, Lorenzo
Cornia, Marcella
Cucchiara, Rita - Abstract:
- Abstract: Recurrent Neural Networks with Long Short-Term Memory (LSTM) make use of gating mechanisms to mitigate exploding and vanishing gradients when learning long-term dependencies. For this reason, LSTMs and other gated RNNs are widely adopted, being the standard de facto for many sequence modeling tasks. Although the memory cell inside the LSTM contains essential information, it is not allowed to influence the gating mechanism directly. In this work, we improve the gate potential by including information coming from the internal cell state. The proposed modification, named Working Memory Connection, consists in adding a learnable nonlinear projection of the cell content into the network gates. This modification can fit into the classical LSTM gates without any assumption on the underlying task, being particularly effective when dealing with longer sequences. Previous research effort in this direction, which goes back to the early 2000s, could not bring a consistent improvement over vanilla LSTM. As part of this paper, we identify a key issue tied to previous connections that heavily limits their effectiveness, hence preventing a successful integration of the knowledge coming from the internal cell state. We show through extensive experimental evaluation that Working Memory Connections constantly improve the performance of LSTMs on a variety of tasks. Numerical results suggest that the cell state contains useful information that is worth including in the gate structure.
- Is Part Of:
- Neural networks. Volume 144(2021)
- Journal:
- Neural networks
- Issue:
- Volume 144(2021)
- Issue Display:
- Volume 144, Issue 2021 (2021)
- Year:
- 2021
- Volume:
- 144
- Issue:
- 2021
- Issue Sort Value:
- 2021-0144-2021-0000
- Page Start:
- 334
- Page End:
- 341
- Publication Date:
- 2021-12
- Subjects:
- Long Short-Term Memory networks -- Cell-to-gate connections -- Gated RNNs -- Language modeling -- Image captioning
Neural computers -- Periodicals
Neural networks (Computer science) -- Periodicals
Neural networks (Neurobiology) -- Periodicals
Nervous System -- Periodicals
Ordinateurs neuronaux -- Périodiques
Réseaux neuronaux (Informatique) -- Périodiques
Réseaux neuronaux (Neurobiologie) -- Périodiques
Neural computers
Neural networks (Computer science)
Neural networks (Neurobiology)
Periodicals
006.32 - Journal URLs:
- http://www.sciencedirect.com/science/journal/08936080 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.neunet.2021.08.030 ↗
- Languages:
- English
- ISSNs:
- 0893-6080
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 6081.280800
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 21069.xml