Pseudo loss active learning for deep visual tracking. (October 2022)
- Record Type:
- Journal Article
- Title:
- Pseudo loss active learning for deep visual tracking. (October 2022)
- Main Title:
- Pseudo loss active learning for deep visual tracking
- Authors:
- Cui, Zhiyan
Lu, Na
Wang, Weifeng - Abstract:
- Highlights: A novel pseudo loss describing the spatial context uncertainty of the tracked target and its surroundings is proposed, based on which the most informative samples from the unlabeled pool could be selected. An interval threshold and a temporal penalty are adopted to avoid drastic target appearance variation and reduce the redundancy in the selected samples. Theoretical analysis has validated the effectiveness of the proposed pseudo loss for sample uncertainty evaluation. PLAL has obtained comparable performance (98%–100%) to the model trained on the entire training dataset with only 3% data and the best performance comparing with multiple active learning methods on different tracking benchmarks. Abstract: In visual tracking tasks, the training data are commonly composed of a large number of video sequences and each frame in the sequences needs to be labeled manually, which is labor-intensive and time-consuming. In addition, considering the similarity among the consecutive frames in the same sequence, there is significant redundancy in the training data. To address these problems, a novel pseudo loss active learning (PLAL) method is developed in this paper. PLAL aims to select the most informative and least redundant data for training to reduce the cost of labeling and maintain competitive tracking results simultaneously. Firstly, the Gaussian distribution based pseudo label is generated for the unlabeled candidates based on the tracking model which is initiallyHighlights: A novel pseudo loss describing the spatial context uncertainty of the tracked target and its surroundings is proposed, based on which the most informative samples from the unlabeled pool could be selected. An interval threshold and a temporal penalty are adopted to avoid drastic target appearance variation and reduce the redundancy in the selected samples. Theoretical analysis has validated the effectiveness of the proposed pseudo loss for sample uncertainty evaluation. PLAL has obtained comparable performance (98%–100%) to the model trained on the entire training dataset with only 3% data and the best performance comparing with multiple active learning methods on different tracking benchmarks. Abstract: In visual tracking tasks, the training data are commonly composed of a large number of video sequences and each frame in the sequences needs to be labeled manually, which is labor-intensive and time-consuming. In addition, considering the similarity among the consecutive frames in the same sequence, there is significant redundancy in the training data. To address these problems, a novel pseudo loss active learning (PLAL) method is developed in this paper. PLAL aims to select the most informative and least redundant data for training to reduce the cost of labeling and maintain competitive tracking results simultaneously. Firstly, the Gaussian distribution based pseudo label is generated for the unlabeled candidates based on the tracking model which is initially trained on a small amount of training data. Then, the pseudo loss based on cross entropy is designed to compute the difference between the pseudo label and the target response map. The pseudo loss measures the uncertainty of the target spatial context which is used as the informativeness criterion of the image frame for selection. Meanwhile, a sampling interval threshold and a temporal penalty are employed for frame selection to avoid drastic variation in target appearance and reduce the redundancy within the consecutive candidate frames. Only the selected frames are labeled by the oracle (human expert) and then added to the training data. Extensive experiments on public benchmarks (OTB2013, OTB2015, VOT2018, UAV123, GOT-10K, TrackingNet, LaSOT, OxUvA and TLP) demonstrate that PLAL method outperforms the baseline and other recent active learning approaches. With only 3% of labeled data from the training dataset, PLAL reaches competitive performance (98-100%) compared to the model trained on the entire training dataset. … (more)
- Is Part Of:
- Pattern recognition. Volume 130(2022)
- Journal:
- Pattern recognition
- Issue:
- Volume 130(2022)
- Issue Display:
- Volume 130, Issue 2022 (2022)
- Year:
- 2022
- Volume:
- 130
- Issue:
- 2022
- Issue Sort Value:
- 2022-0130-2022-0000
- Page Start:
- Page End:
- Publication Date:
- 2022-10
- Subjects:
- Active learning -- Visual tracking -- Pseudo loss -- Pseudo label
Pattern perception -- Periodicals
Perception des structures -- Périodiques
Patroonherkenning
006.4 - Journal URLs:
- http://www.sciencedirect.com/science/journal/00313203 ↗
http://www.sciencedirect.com/ ↗ - DOI:
- 10.1016/j.patcog.2022.108773 ↗
- Languages:
- English
- ISSNs:
- 0031-3203
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 22236.xml