A comparative study of irregular pyramid matching in bag-of-bags of words model for image retrieval. Issue 3 (March 2016)
- Record Type:
- Journal Article
- Title:
- A comparative study of irregular pyramid matching in bag-of-bags of words model for image retrieval. Issue 3 (March 2016)
- Main Title:
- A comparative study of irregular pyramid matching in bag-of-bags of words model for image retrieval
- Authors:
- Ren, Yi
- Abstract:
- Abstract In this paper, we assess three standard approaches to build irregular pyramid partitions for image retrieval in the bag-of-bags of words model that we recently proposed. These three approaches are: kernel $$k$$ k -means to optimize multilevel weighted graph cuts, normalized cuts and graph cuts, respectively. The bag-of-bags of words (BBoW) model is an approach based on irregular pyramid partitions over the image. An image is first represented as a connected graph of local features on a regular grid of pixels. Irregular partitions (subgraphs) of the image are further built by using graph partitioning methods. Each subgraph in the partition is then represented by its own signature. The BBoW model with the aid of graph extends the classical bag-of-words model, by embedding color homogeneity and limited spatial information through irregular partitions of an image. Compared with existing methods for image retrieval, such as spatial pyramid matching, the BBoW model does not assume that similar parts of a scene always appear at the same location in images of the same category. The extension of the proposed model to pyramid gives rise to a method we name irregular pyramid matching. The experiments onCaltech-101 benchmark demonstrate that applying kernel $$k$$ k -means to graph clustering process produces better retrieval results, as compared with other graph partitioning methods such as graph cuts and normalized cuts for BBoW. Moreover, this proposed method achievesAbstract In this paper, we assess three standard approaches to build irregular pyramid partitions for image retrieval in the bag-of-bags of words model that we recently proposed. These three approaches are: kernel $$k$$ k -means to optimize multilevel weighted graph cuts, normalized cuts and graph cuts, respectively. The bag-of-bags of words (BBoW) model is an approach based on irregular pyramid partitions over the image. An image is first represented as a connected graph of local features on a regular grid of pixels. Irregular partitions (subgraphs) of the image are further built by using graph partitioning methods. Each subgraph in the partition is then represented by its own signature. The BBoW model with the aid of graph extends the classical bag-of-words model, by embedding color homogeneity and limited spatial information through irregular partitions of an image. Compared with existing methods for image retrieval, such as spatial pyramid matching, the BBoW model does not assume that similar parts of a scene always appear at the same location in images of the same category. The extension of the proposed model to pyramid gives rise to a method we name irregular pyramid matching. The experiments onCaltech-101 benchmark demonstrate that applying kernel $$k$$ k -means to graph clustering process produces better retrieval results, as compared with other graph partitioning methods such as graph cuts and normalized cuts for BBoW. Moreover, this proposed method achieves comparable results and outperforms SPM in 19 object categories on the wholeCaltech-101 dataset. … (more)
- Is Part Of:
- Signal, image and video processing. Volume 10:Issue 3(2016)
- Journal:
- Signal, image and video processing
- Issue:
- Volume 10:Issue 3(2016)
- Issue Display:
- Volume 10, Issue 3 (2016)
- Year:
- 2016
- Volume:
- 10
- Issue:
- 3
- Issue Sort Value:
- 2016-0010-0003-0000
- Page Start:
- 471
- Page End:
- 478
- Publication Date:
- 2016-03
- Subjects:
- Content-based image retrieval -- Graph cuts -- Kernel $$k$$k-means -- Normalized cuts -- Clustering -- Bag-of-words
Signal processing -- Digital techniques -- Periodicals
Image processing -- Digital techniques -- Periodicals
Digital video -- Periodicals
621.3822 - Journal URLs:
- http://www.springerlink.com/content/120512/ ↗
http://www.springerlink.com/openurl.asp?genre=journal&issn=1863-1703 ↗
http://www.springer.com/gb/ ↗ - DOI:
- 10.1007/s11760-015-0763-7 ↗
- Languages:
- English
- ISSNs:
- 1863-1703
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 8275.985203
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 9994.xml