Adaptive feature fusion for visual object tracking. (March 2021)
- Record Type:
- Journal Article
- Title:
- Adaptive feature fusion for visual object tracking. (March 2021)
- Main Title:
- Adaptive feature fusion for visual object tracking
- Authors:
- Zhao, Shaochuan
Xu, Tianyang
Wu, Xiao-Jun
Zhu, Xue-Feng - Abstract:
- Highlights: We propose an adaptive feature fusion mechanism to provide both semantic and discriminative feature representations by automatically fusing multi-level convolutional layers. We reformulate the update strategy. Through joint training the projection matrix layer and correlation layer, a more convincing target localization formulation can be achieved. We validate our method on several benchmarking datasets with state-of-the-art methods. The experimental results and corresponding analysis demonstrate the merit of the proposed tracker. Abstract: Recent advanced trackers, consisting of discriminative classification component and dedicated bounding box estimation, have achieved improved performance in the visual tracking community. The most essential factor for the development is the utilization of different Convolutional Neural Networks (CNNs), which significantly improves the model capacity via offline trained deep feature representations. Though powerful deep structures emphasize more semantic appearance through high dimensional latent variables, how to achieve effective feature adaptation in the online tracking stage has not been sufficiently considered yet. To this end, we argue the necessity of exploring hierarchical and complementary appearance descriptors from different convolutional layers to achieve online tracking adaptation. Therefore, in this paper, we propose an adaptive feature fusion mechanism, which can balance the detection granularities from shallowHighlights: We propose an adaptive feature fusion mechanism to provide both semantic and discriminative feature representations by automatically fusing multi-level convolutional layers. We reformulate the update strategy. Through joint training the projection matrix layer and correlation layer, a more convincing target localization formulation can be achieved. We validate our method on several benchmarking datasets with state-of-the-art methods. The experimental results and corresponding analysis demonstrate the merit of the proposed tracker. Abstract: Recent advanced trackers, consisting of discriminative classification component and dedicated bounding box estimation, have achieved improved performance in the visual tracking community. The most essential factor for the development is the utilization of different Convolutional Neural Networks (CNNs), which significantly improves the model capacity via offline trained deep feature representations. Though powerful deep structures emphasize more semantic appearance through high dimensional latent variables, how to achieve effective feature adaptation in the online tracking stage has not been sufficiently considered yet. To this end, we argue the necessity of exploring hierarchical and complementary appearance descriptors from different convolutional layers to achieve online tracking adaptation. Therefore, in this paper, we propose an adaptive feature fusion mechanism, which can balance the detection granularities from shallow to deep convolutional layers. To be specific, the correlation between template and instance is employed to generate adaptive weights to achieve advanced saliency and discrimination. In addition, considering temporal appearance variation, the projection matrix for the multi-channel inputs is jointly updated with the correlation classifier to further enhance the robustness. The experimental results on four recent benchmarks, i.e., OTB-2015, VOT2018, LaSOT and TrackingNet, demonstrate the effectiveness and robustness of the proposed method, with superior performance compared to the state-of-the-art approaches. … (more)
- Is Part Of:
- Pattern recognition. Volume 111(2021)
- Journal:
- Pattern recognition
- Issue:
- Volume 111(2021)
- Issue Display:
- Volume 111, Issue 2021 (2021)
- Year:
- 2021
- Volume:
- 111
- Issue:
- 2021
- Issue Sort Value:
- 2021-0111-2021-0000
- Page Start:
- Page End:
- Publication Date:
- 2021-03
- Subjects:
- Visual tracking -- Deep neural network -- Feature fusion -- Online adaptation
Pattern perception -- Periodicals
Perception des structures -- Périodiques
Patroonherkenning
006.4 - Journal URLs:
- http://www.sciencedirect.com/science/journal/00313203 ↗
http://www.sciencedirect.com/ ↗ - DOI:
- 10.1016/j.patcog.2020.107679 ↗
- Languages:
- English
- ISSNs:
- 0031-3203
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 14921.xml