Representation learning using step-based deep multi-modal autoencoders. (November 2019)

Record Type:: Journal Article
Title:: Representation learning using step-based deep multi-modal autoencoders. (November 2019)
Main Title:: Representation learning using step-based deep multi-modal autoencoders
Authors:: Bhatt, Gaurav
Jha, Piyush
Raman, Balasubramanian
Abstract:: Abstract: Deep learning techniques have been successfully used in learning a common representation for multi-view data, wherein different modalities are projected onto a common subspace. In a broader perspective, the techniques used to investigate common representation learning falls under the categories of 'canonical correlation-based' approaches and 'autoencoder-based' approaches. In this paper, we investigate the performance of deep autoencoder-based methods on multi-view data. We propose a novel step-based correlation multi-modal deep convolution neural network (CorrMCNN) which reconstructs one view of the data given the other while increasing the interaction between the representations at each hidden layer or every intermediate step. The idea of step reconstruction reduces the constraint of reconstruction of original data, instead, the objective function is optimized for reconstruction of representative features. This helps the proposed model to generalize for representation and transfer learning tasks efficiently for high dimensional data. Finally, we evaluate the performance of the proposed model on three multi-view and cross-modal problems viz., audio articulation, cross-modal image retrieval and multilingual (cross-language) document classification . Through extensive experiments, we find that the proposed model performs much better than the current state-of-the-art deep learning techniques on all three multi-view and cross-modal tasks.
Is Part Of:: Pattern recognition. Volume 95(2019:Nov.)
Journal:: Pattern recognition
Issue:: Volume 95(2019:Nov.)
Issue Display:: Volume 95 (2019)
Year:: 2019
Volume:: 95
Issue Sort Value:: 2019-0095-0000-0000
Page Start:: 12
Page End:: 23
Publication Date:: 2019-11
Subjects:: Representation learning -- Transfer learning -- Convolution autoencoders -- Multilingual document classification
Pattern perception -- Periodicals
Perception des structures -- Périodiques
Patroonherkenning
006.4
Journal URLs:: http://www.sciencedirect.com/science/journal/00313203 ↗
http://www.sciencedirect.com/ ↗
DOI:: 10.1016/j.patcog.2019.05.032 ↗
Languages:: English
ISSNs:: 0031-3203
Deposit Type:: Legaldeposit
View Content:: Available online (eLD content is only available in our Reading Rooms) ↗
Physical Locations:: British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store
Ingest File:: 11157.xml