Multiple object cues for high performance vector quantization. (July 2017)
- Record Type:
- Journal Article
- Title:
- Multiple object cues for high performance vector quantization. (July 2017)
- Main Title:
- Multiple object cues for high performance vector quantization
- Authors:
- Ramesh, B.
Xiang, C.
Lee, T.H. - Abstract:
- Highlights: A multi-cue representation is proposed for high performance vector quantization. Keypoint detection using differential entropy is introduced for efficient sampling. Shape representation is improved using the proposed co-occurrence statistics. We report high accuracy on Caltech-101 dataset using the multi-cue representation. We report best results on Flickr-101 dataset using the multi-cue representation. Abstract: In this paper, we propose a multi-cue object representation for image classification using the standard bag-of-words model. Ever since the success of the bag-of-words model for image classification, several modifications of it have been proposed in the literature. These variants target to improve key aspects, such as efficient and compact dictionary learning, advanced image encoding techniques, pooling methods, and efficient kernels for the final classification step. In particular, "soft-encoding" methods such as sparse coding, locality constrained linear coding, Fisher vector encoding, have received great attention in the literature, to improve upon the "hard-assignment" obtained by vector quantization. Nevertheless, these methods come at a higher computational cost while little attention has been paid to the extracted local features. In contrast, we propose a novel multi-cue object representation for image classification using the simple vector quantization, and show highly competitive classification performance compared to state-of-the-art methods onHighlights: A multi-cue representation is proposed for high performance vector quantization. Keypoint detection using differential entropy is introduced for efficient sampling. Shape representation is improved using the proposed co-occurrence statistics. We report high accuracy on Caltech-101 dataset using the multi-cue representation. We report best results on Flickr-101 dataset using the multi-cue representation. Abstract: In this paper, we propose a multi-cue object representation for image classification using the standard bag-of-words model. Ever since the success of the bag-of-words model for image classification, several modifications of it have been proposed in the literature. These variants target to improve key aspects, such as efficient and compact dictionary learning, advanced image encoding techniques, pooling methods, and efficient kernels for the final classification step. In particular, "soft-encoding" methods such as sparse coding, locality constrained linear coding, Fisher vector encoding, have received great attention in the literature, to improve upon the "hard-assignment" obtained by vector quantization. Nevertheless, these methods come at a higher computational cost while little attention has been paid to the extracted local features. In contrast, we propose a novel multi-cue object representation for image classification using the simple vector quantization, and show highly competitive classification performance compared to state-of-the-art methods on popular datasets like Caltech-101 and MICC Flickr-101. Apart from the object representation, we also propose a novel keypoint detection scheme that helps to achieve a classification rate comparable to the popular dense keypoint sampling strategy, at a much lower computational cost. … (more)
- Is Part Of:
- Pattern recognition. Volume 67(2017:Jul.)
- Journal:
- Pattern recognition
- Issue:
- Volume 67(2017:Jul.)
- Issue Display:
- Volume 67 (2017)
- Year:
- 2017
- Volume:
- 67
- Issue Sort Value:
- 2017-0067-0000-0000
- Page Start:
- 380
- Page End:
- 395
- Publication Date:
- 2017-07
- Subjects:
- Log-polar transform -- Object classification -- Visual cues -- Bag-of-words model -- Flickr-101 dataset -- Caltech-101 dataset
Pattern perception -- Periodicals
Perception des structures -- Périodiques
Patroonherkenning
006.4 - Journal URLs:
- http://www.sciencedirect.com/science/journal/00313203 ↗
http://www.sciencedirect.com/ ↗ - DOI:
- 10.1016/j.patcog.2017.02.024 ↗
- Languages:
- English
- ISSNs:
- 0031-3203
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 1166.xml