Sparse Relational Topical Coding on multi-modal data. (December 2017)
- Record Type:
- Journal Article
- Title:
- Sparse Relational Topical Coding on multi-modal data. (December 2017)
- Main Title:
- Sparse Relational Topical Coding on multi-modal data
- Authors:
- Song, Lingyun
Liu, Jun
Luo, Minnan
Qian, Buyue
Yang, Kuan - Abstract:
- Highlights: A novel non-probabilistic relational topic model is proposed for modeling both multi-modal documents and the links between them. Sparse latent representations can be effectively learned through directly imposing appropriate regularizers. The proposed learning problem can be efficiently solved by a simple coordinate descent algorithm. The proposed model achieves significantly better performance than all the competing baseline models. Abstract: Multi-modal data modeling lately has been an active research area in pattern recognition community. Existing studies mainly focus on modeling the content of multi-modal documents, whilst the links amongst documents are commonly ignored. However, link information has shown being of key importance in many applications, such as document navigation, classification, and clustering. In this paper, we present a non-probabilistic formulation of Relational Topic Model (RTM), i.e., Sparse Relational Multi-Modal Topical Coding (SRMMTC), to model both multi-modal documents and the corresponding link information. SRMMTC has the following three appealing properties: i) It can effectively produce sparse latent representations via directly imposing sparsity-inducing regularizers. ii) It handles the imbalance issues on multi-modal data collections by introducing regularization parameters for positive and negative links, respectively; iii) It can be solved by an efficient coordinate descent algorithm. We also explore a generalized version ofHighlights: A novel non-probabilistic relational topic model is proposed for modeling both multi-modal documents and the links between them. Sparse latent representations can be effectively learned through directly imposing appropriate regularizers. The proposed learning problem can be efficiently solved by a simple coordinate descent algorithm. The proposed model achieves significantly better performance than all the competing baseline models. Abstract: Multi-modal data modeling lately has been an active research area in pattern recognition community. Existing studies mainly focus on modeling the content of multi-modal documents, whilst the links amongst documents are commonly ignored. However, link information has shown being of key importance in many applications, such as document navigation, classification, and clustering. In this paper, we present a non-probabilistic formulation of Relational Topic Model (RTM), i.e., Sparse Relational Multi-Modal Topical Coding (SRMMTC), to model both multi-modal documents and the corresponding link information. SRMMTC has the following three appealing properties: i) It can effectively produce sparse latent representations via directly imposing sparsity-inducing regularizers. ii) It handles the imbalance issues on multi-modal data collections by introducing regularization parameters for positive and negative links, respectively; iii) It can be solved by an efficient coordinate descent algorithm. We also explore a generalized version of SRMMTC to find pairwise interactions amongst topics. Our methods are also capable of performing link prediction for documents, as well as the prediction of annotation words for attendant images in documents. Empirical studies on a set of benchmark datasets show that our proposed models significantly outperform many state-of-the-art methods. … (more)
- Is Part Of:
- Pattern recognition. Volume 72(2017:Dec.)
- Journal:
- Pattern recognition
- Issue:
- Volume 72(2017:Dec.)
- Issue Display:
- Volume 72 (2017)
- Year:
- 2017
- Volume:
- 72
- Issue Sort Value:
- 2017-0072-0000-0000
- Page Start:
- 368
- Page End:
- 380
- Publication Date:
- 2017-12
- Subjects:
- Multi-modal data -- Sparse latent representation -- Image annotation -- Link prediction
Pattern perception -- Periodicals
Perception des structures -- Périodiques
Patroonherkenning
006.4 - Journal URLs:
- http://www.sciencedirect.com/science/journal/00313203 ↗
http://www.sciencedirect.com/ ↗ - DOI:
- 10.1016/j.patcog.2017.08.005 ↗
- Languages:
- English
- ISSNs:
- 0031-3203
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 9247.xml