Variational joint self‐attention for image captioning. Issue 8 (8th March 2022)
- Record Type:
- Journal Article
- Title:
- Variational joint self‐attention for image captioning. Issue 8 (8th March 2022)
- Main Title:
- Variational joint self‐attention for image captioning
- Authors:
- Shao, Xiangjun
Xiang, Zhenglong
Li, Yuanxiang
Zhang, Mingjie - Abstract:
- Abstract: The image captioning task has attracted great attention from many researchers, and significant progress has been made in the past few years. Existing image captioning models, which mainly apply attention‐based encoder‐decoder architecture, achieve great developments image captioning. These attention‐based models, however, are limited in the caption generation due to the potential errors resulting from the inaccurate detection of objects and incorrect attention to the objects. To alleviate the limitation, a Variational Joint Self‐Attention model (VJSA) is proposed to learn a latent semantic alignment between the given image and its label description for guiding better image captioning. Unlike the existing image captioning models, VJSA first uses a self‐attention module to encode the effective relationship information of intra‐sequence and inter‐sequences relationships. And then the variational neural inference module learns a distribution over the latent semantic alignment between the image and its corresponding description. In the decoding, the learned semantic alignment guides the decoder to generate the higher quality image caption. The results of the experiments reveal that the VJSA outperforms the compared models, and the performances of various metrics show that the proposed model is effective and feasible in image caption generation.
- Is Part Of:
- IET image processing. Volume 16:Issue 8(2022)
- Journal:
- IET image processing
- Issue:
- Volume 16:Issue 8(2022)
- Issue Display:
- Volume 16, Issue 8 (2022)
- Year:
- 2022
- Volume:
- 16
- Issue:
- 8
- Issue Sort Value:
- 2022-0016-0008-0000
- Page Start:
- 2075
- Page End:
- 2086
- Publication Date:
- 2022-03-08
- Subjects:
- Image processing -- Periodicals
621.36705 - Journal URLs:
- http://digital-library.theiet.org/content/journals/iet-ipr ↗
http://ieeexplore.ieee.org/servlet/opac?punumber=4149689 ↗
http://www.ietdl.org/IET-IPR ↗
https://ietresearch.onlinelibrary.wiley.com/journal/17519667 ↗
http://www.theiet.org/ ↗ - DOI:
- 10.1049/ipr2.12470 ↗
- Languages:
- English
- ISSNs:
- 1751-9659
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4363.252600
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 21492.xml