Visual context learning based on textual knowledge for image–text retrieval. (August 2022)

Record Type:: Journal Article
Title:: Visual context learning based on textual knowledge for image–text retrieval. (August 2022)
Main Title:: Visual context learning based on textual knowledge for image–text retrieval
Authors:: Qin, Yuzhuo
Gu, Xiaodong
Tan, Zhenshan
Abstract:: Abstract: Image–text bidirectional retrieval is a significant task within cross-modal learning field. The main issue lies on the jointly embedding learning and accurately measuring image–text matching score. Most prior works make use of either intra-modality methods performing within two separate modalities or inter-modality ones combining two modalities tightly. However, intra-modality methods remain ambiguous when learning visual context due to the existence of redundant messages. And inter-modality methods increase the complexity of retrieval because of unifying two modalities closely when learning modal features. In this research, we propose an eclectic V isual C ontext L earning based on T extual knowledge N etwork (VCLTN), which transfers textual knowledge to visual modality for context learning and decreases the discrepancy of information capacity between two modalities. Specifically, VCLTN merges label semantics into corresponding regional features and employs those labels as intermediaries between images and texts for better modal alignment. Contextual knowledge of those labels learned within textual modality is utilized to guide the visual context learning. Besides, considering the homogeneity within each modality, global features are merged into regional features for assisting in the context learning. In order to alleviate the imbalance of information capacity between images and texts, entities together with relations inside the given caption are extracted and an … (more)
Is Part Of:: Neural networks. Volume 152(2022)
Journal:: Neural networks
Issue:: Volume 152(2022)
Issue Display:: Volume 152, Issue 2022 (2022)
Year:: 2022
Volume:: 152
Issue:: 2022
Issue Sort Value:: 2022-0152-2022-0000
Page Start:: 434
Page End:: 449
Publication Date:: 2022-08
Subjects:: Image–text retrieval -- Knowledge transfer -- Visual context learning -- Modal alignment
Neural computers -- Periodicals
Neural networks (Computer science) -- Periodicals
Neural networks (Neurobiology) -- Periodicals
Nervous System -- Periodicals
Ordinateurs neuronaux -- Périodiques
Réseaux neuronaux (Informatique) -- Périodiques
Réseaux neuronaux (Neurobiologie) -- Périodiques
Neural computers
Neural networks (Computer science)
Neural networks (Neurobiology)
Periodicals
006.32
Journal URLs:: http://www.sciencedirect.com/science/journal/08936080 ↗
http://www.elsevier.com/journals ↗
DOI:: 10.1016/j.neunet.2022.05.008 ↗
Languages:: English
ISSNs:: 0893-6080
Deposit Type:: Legaldeposit
View Content:: Available online (eLD content is only available in our Reading Rooms) ↗
Physical Locations:: British Library DSC - 6081.280800
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store
Ingest File:: 21788.xml