A question-guided multi-hop reasoning graph network for visual question answering. Issue 2 (March 2023)
- Record Type:
- Journal Article
- Title:
- A question-guided multi-hop reasoning graph network for visual question answering. Issue 2 (March 2023)
- Main Title:
- A question-guided multi-hop reasoning graph network for visual question answering
- Authors:
- Xu, Zhaoyang
Gu, Jinguang
Liu, Maofu
Zhou, Guangyou
Fu, Haidong
Qiu, Chen - Abstract:
- Abstract: Visual Question Answering (VQA) requires reasoning about the visually-grounded relations in the image and question context. A crucial aspect of solving complex questions is reliable multi-hop reasoning, i.e., dynamically learning the interplay between visual entities in each step. In this paper, we investigate the potential of the reasoning graph network on multi-hop reasoning questions, especially over 3 "hops." We call this model QMRGT: A Question-Guided Multi-hop Reasoning Graph Network. It constructs a cross-modal interaction module (CIM) and a multi-hop reasoning graph network (MRGT) and infers an answer by dynamically updating the inter-associated instruction between two modalities. Our graph reasoning module can apply to any multi-modal model. The experiments on VQA 2.0 and GQA (in fully supervised and O.O.D settings) datasets show that both QMRGT and pre-training V&L models+MRGT lead to improvement on visual question answering tasks. Graph-based multi-hop reasoning provides an effective signal for the visual question answering challenge, both for the O.O.D and high-level reasoning questions. Highlights: We explore the "hop" distribution of GQA datasets spanning five types of questions. We design a multi-hop reasoning graph network to generate question instructions. We study the strength of our model in multi-hop reasoning and visual-explainable.
- Is Part Of:
- Information processing & management. Volume 60:Issue 2(2023)
- Journal:
- Information processing & management
- Issue:
- Volume 60:Issue 2(2023)
- Issue Display:
- Volume 60, Issue 2 (2023)
- Year:
- 2023
- Volume:
- 60
- Issue:
- 2
- Issue Sort Value:
- 2023-0060-0002-0000
- Page Start:
- Page End:
- Publication Date:
- 2023-03
- Subjects:
- Visual question answering -- Multi-hop reasoning -- Reasoning graph network
Information storage and retrieval systems -- Periodicals
Information science -- Periodicals
Systèmes d'information -- Périodiques
Sciences de l'information -- Périodiques
Information science
Information storage and retrieval systems
Periodicals
658.4038 - Journal URLs:
- http://www.sciencedirect.com/science/journal/03064573 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.ipm.2022.103207 ↗
- Languages:
- English
- ISSNs:
- 0306-4573
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4493.893000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 25674.xml