Gaze latent support vector machine for image classification improved by weakly supervised region selection. (December 2017)
- Record Type:
- Journal Article
- Title:
- Gaze latent support vector machine for image classification improved by weakly supervised region selection. (December 2017)
- Main Title:
- Gaze latent support vector machine for image classification improved by weakly supervised region selection
- Authors:
- Wang, Xin
Thome, Nicolas
Cord, Matthieu - Abstract:
- Highlights: A novel WSL model integrating human gaze annotation for image classification is proposed. Our model generalizes positive instance Gaze latent SVM. Our model generalizes top K instance latent SVM. Our model is validated on extensive experiments in various challenging contexts. A new food-related dataset with gaze annotation is published. Abstract: This paper deals with Weakly Supervised Learning (WSL), i.e. performing image classification by leveraging local information with models trained from global image labels. We propose a new WSL method which incorporates gaze features collected by an eye-tracker to guide the region selection policy. Our approach presents two appealing advantages: gaze features are cheap to collect, e.g. one order of magnitude faster than bounding boxes, and our method only requires gaze features during training, while being gaze free at test time. For this purpose, the training objective is enriched with a gaze loss, from which we derive a concave-convex upper bound, leading to an off-the-shelf CCCP optimization scheme. Extensive experiments are conducted to validate the effectiveness of the approach for WSL image classification on two public datasets with gaze annotation, i.e. PASCAL VOC 2012 action and POET. In addition, we publicly release a new food-related dataset, the Gaze-based UPMC Food dataset (UPMC-G20), which covers 20 food categories and 2, 000 images. This dataset intends to promote the research in the food-related computerHighlights: A novel WSL model integrating human gaze annotation for image classification is proposed. Our model generalizes positive instance Gaze latent SVM. Our model generalizes top K instance latent SVM. Our model is validated on extensive experiments in various challenging contexts. A new food-related dataset with gaze annotation is published. Abstract: This paper deals with Weakly Supervised Learning (WSL), i.e. performing image classification by leveraging local information with models trained from global image labels. We propose a new WSL method which incorporates gaze features collected by an eye-tracker to guide the region selection policy. Our approach presents two appealing advantages: gaze features are cheap to collect, e.g. one order of magnitude faster than bounding boxes, and our method only requires gaze features during training, while being gaze free at test time. For this purpose, the training objective is enriched with a gaze loss, from which we derive a concave-convex upper bound, leading to an off-the-shelf CCCP optimization scheme. Extensive experiments are conducted to validate the effectiveness of the approach for WSL image classification on two public datasets with gaze annotation, i.e. PASCAL VOC 2012 action and POET. In addition, we publicly release a new food-related dataset, the Gaze-based UPMC Food dataset (UPMC-G20), which covers 20 food categories and 2, 000 images. This dataset intends to promote the research in the food-related computer vision community. … (more)
- Is Part Of:
- Pattern recognition. Volume 72(2017:Dec.)
- Journal:
- Pattern recognition
- Issue:
- Volume 72(2017:Dec.)
- Issue Display:
- Volume 72 (2017)
- Year:
- 2017
- Volume:
- 72
- Issue Sort Value:
- 2017-0072-0000-0000
- Page Start:
- 59
- Page End:
- 71
- Publication Date:
- 2017-12
- Subjects:
- Weakly supervised learning -- Human gaze -- image classification
Pattern perception -- Periodicals
Perception des structures -- Périodiques
Patroonherkenning
006.4 - Journal URLs:
- http://www.sciencedirect.com/science/journal/00313203 ↗
http://www.sciencedirect.com/ ↗ - DOI:
- 10.1016/j.patcog.2017.07.001 ↗
- Languages:
- English
- ISSNs:
- 0031-3203
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 4666.xml