A new distance measure for non-identical data with application to image classification. (March 2017)
- Record Type:
- Journal Article
- Title:
- A new distance measure for non-identical data with application to image classification. (March 2017)
- Main Title:
- A new distance measure for non-identical data with application to image classification
- Authors:
- Swaminathan, Muthukaruppan
Yadav, Pankaj Kumar
Piloto, Obdulio
Sjöblom, Tobias
Cheong, Ian - Abstract:
- Abstract: Distance measures are part and parcel of many computer vision algorithms. The underlying assumption in all existing distance measures is that feature elements are independent and identically distributed. However, in real-world settings, data generally originate from heterogeneous sources even if they do possess a common data-generating mechanism. Since these sources are not identically distributed by necessity, the assumption of identical distribution is inappropriate. Here, we use statistical analysis to show that feature elements of local image descriptors are indeed non-identically distributed. To test the effect of omitting the unified distribution assumption, we created a new distance measure called the Poisson-Binomial radius (PBR). PBR is a bin-to-bin distance which accounts for the dispersion of bin-to-bin information. PBR's performance was evaluated on twelve benchmark data sets covering six different classification and recognition applications: texture, material, leaf, scene, ear biometrics and category-level image classification. Results from these experiments demonstrate that PBR outperforms state-of-the-art distance measures for most of the data sets and achieves comparable performance on the rest, suggesting that accounting for different distributions in distance measures can improve performance in classification and recognition tasks. Abstract : Highlights: Empirical evidence is provided that real-world data is non-identically distributed. PBR, theAbstract: Distance measures are part and parcel of many computer vision algorithms. The underlying assumption in all existing distance measures is that feature elements are independent and identically distributed. However, in real-world settings, data generally originate from heterogeneous sources even if they do possess a common data-generating mechanism. Since these sources are not identically distributed by necessity, the assumption of identical distribution is inappropriate. Here, we use statistical analysis to show that feature elements of local image descriptors are indeed non-identically distributed. To test the effect of omitting the unified distribution assumption, we created a new distance measure called the Poisson-Binomial radius (PBR). PBR is a bin-to-bin distance which accounts for the dispersion of bin-to-bin information. PBR's performance was evaluated on twelve benchmark data sets covering six different classification and recognition applications: texture, material, leaf, scene, ear biometrics and category-level image classification. Results from these experiments demonstrate that PBR outperforms state-of-the-art distance measures for most of the data sets and achieves comparable performance on the rest, suggesting that accounting for different distributions in distance measures can improve performance in classification and recognition tasks. Abstract : Highlights: Empirical evidence is provided that real-world data is non-identically distributed. PBR, the first distance measure to account for non-identical data is proposed. PBR was tested in 6 test applications using 12 benchmark data sets. PBR outperforms state-of-the-art measures for most data sets. Avoiding the identical distribution assumption can improve classification. … (more)
- Is Part Of:
- Pattern recognition. Volume 63(2017:Mar.)
- Journal:
- Pattern recognition
- Issue:
- Volume 63(2017:Mar.)
- Issue Display:
- Volume 63 (2017)
- Year:
- 2017
- Volume:
- 63
- Issue Sort Value:
- 2017-0063-0000-0000
- Page Start:
- 384
- Page End:
- 396
- Publication Date:
- 2017-03
- Subjects:
- Poisson-Binomial distribution -- Semi-metric distance -- Non-identical data -- Distance measure -- Image classification -- Image recognition
Pattern perception -- Periodicals
Perception des structures -- Périodiques
Patroonherkenning
006.4 - Journal URLs:
- http://www.sciencedirect.com/science/journal/00313203 ↗
http://www.sciencedirect.com/ ↗ - DOI:
- 10.1016/j.patcog.2016.10.018 ↗
- Languages:
- English
- ISSNs:
- 0031-3203
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 12847.xml