#5779. The multi-level classification and regression network for visual tracking via residual channel attention
July 2026 | publication date |
Proposal available till | 12-05-2025 |
4 total number of authors per manuscript | 0 $ |
The title of the journal is available only for the authors who have already paid for |
|
|
Journal’s subject area: |
Statistics, Probability and Uncertainty;
Applied Mathematics;
Computational Theory and Mathematics;
Electrical and Electronic Engineering;
Computer Vision and Pattern Recognition;
Signal Processing;
Artificial Intelligence; |
Places in the authors’ list:
1 place - free (for sale)
2 place - free (for sale)
3 place - free (for sale)
4 place - free (for sale)
Abstract:
Siamese networks used for target tracking has attracted widespread attention due to its balanced tracking accuracy and efficient execution speed. However, when there exist similar semantic information in the search area, it is difficult for most Siamese trackers to adapt to the interference of similar semantic information to target localization, which greatly affects the robustness of Siamese trackers. In order to effectively mine feature information and improve localization accuracy, this work proposes one Siamese multi-level classification and regression (SiamMCAR) framework. SiamMCAR first introduces the residual channel attention module into the template branch of Siamese subnetwork. By utilizing the relationship between feature channels to determine the channel weight of the target template feature, making the attention of extracting the template feature is focused on the channel feature of target foreground. Then, one multi-level classification and regression subnetwork containing multiple classification and regression modules is constructed. Feature maps of the output of different classification and regression modules are weighted and fused by using multiple trained weights, which enables the multi-level classification and regression subnetwork to obtain more results of classification and regression of the shallow cross-correlation response map, thereby making the localization more accurate.
Keywords:
Multi-level classification and regression; Residual channel attention; Siamese network; Visual tracking
Contacts :