Visual tracking using spatio-temporally nonlocally regularized correlation filter. (November 2018)
- Record Type:
- Journal Article
- Title:
- Visual tracking using spatio-temporally nonlocally regularized correlation filter. (November 2018)
- Main Title:
- Visual tracking using spatio-temporally nonlocally regularized correlation filter
- Authors:
- Zhang, Kaihua
Li, Xuejun
Song, Huihui
Liu, Qingshan
Lian, Wei - Abstract:
- Highlights: A novel regularized CF based tracking approach has been proposed with promising results on three benchmark datasets. Our method effectively captures the long-term spatio-temporally nonlocal superpixel appearance information to regularize the CF learning. Our method deals well with the challenging factors such as large viewpoint changes and non-rigid deformation. Abstract: Due to the factors like rapidly fast motion, cluttered backgrounds, arbitrary object appearance variation and shape deformation, an effective target representation plays a key role in robust visual tracking. Existing methods often employ bounding boxes for target representations, which are easily polluted by noisy clutter backgrounds that may cause drifting problem when the target undergoes large-scale non-rigid or articulated motions. To address this issue, in this paper, motivated by the spatio-temporal nonlocality of target appearance reoccurrence in a video, we explore the nonlocal information to accurately represent and segment the target, yielding an object likelihood map to regularize a correlation filter (CF) for visual tracking. Specifically, given a set of tracked target bounding boxes, we first generate a set of superpixels to represent the foreground and background, and then update the appearance of each superpixel with its long-term spatio-temporally nonlocal counterparts. Then, with the updated appearances, we formulate a spatio-temporally graphical model comprised of theHighlights: A novel regularized CF based tracking approach has been proposed with promising results on three benchmark datasets. Our method effectively captures the long-term spatio-temporally nonlocal superpixel appearance information to regularize the CF learning. Our method deals well with the challenging factors such as large viewpoint changes and non-rigid deformation. Abstract: Due to the factors like rapidly fast motion, cluttered backgrounds, arbitrary object appearance variation and shape deformation, an effective target representation plays a key role in robust visual tracking. Existing methods often employ bounding boxes for target representations, which are easily polluted by noisy clutter backgrounds that may cause drifting problem when the target undergoes large-scale non-rigid or articulated motions. To address this issue, in this paper, motivated by the spatio-temporal nonlocality of target appearance reoccurrence in a video, we explore the nonlocal information to accurately represent and segment the target, yielding an object likelihood map to regularize a correlation filter (CF) for visual tracking. Specifically, given a set of tracked target bounding boxes, we first generate a set of superpixels to represent the foreground and background, and then update the appearance of each superpixel with its long-term spatio-temporally nonlocal counterparts. Then, with the updated appearances, we formulate a spatio-temporally graphical model comprised of the superpixel label consistency potentials. Afterwards, we generate segmentation by optimizing the graphical model via iteratively updating the appearance model and estimating the labels. Finally, with the segmentation mask, we obtain an object likelihood map that is employed to adaptively regularize the CF learning by suppressing the clutter background noises while making full use of the long-term stable target appearance information. Extensive evaluations on the OTB50, SegTrack, Youtube-Objects datasets demonstrate the effectiveness of the proposed method, which performs favorably against some state-of-art methods. … (more)
- Is Part Of:
- Pattern recognition. Volume 83(2018:Nov.)
- Journal:
- Pattern recognition
- Issue:
- Volume 83(2018:Nov.)
- Issue Display:
- Volume 83 (2018)
- Year:
- 2018
- Volume:
- 83
- Issue Sort Value:
- 2018-0083-0000-0000
- Page Start:
- 185
- Page End:
- 195
- Publication Date:
- 2018-11
- Subjects:
- Visual tracking -- Video segmentation -- Nonlocal appearance learning -- Graphical model -- Optical flow
Pattern perception -- Periodicals
Perception des structures -- Périodiques
Patroonherkenning
006.4 - Journal URLs:
- http://www.sciencedirect.com/science/journal/00313203 ↗
http://www.sciencedirect.com/ ↗ - DOI:
- 10.1016/j.patcog.2018.05.017 ↗
- Languages:
- English
- ISSNs:
- 0031-3203
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 16621.xml