#5779. The multi-level classification and regression network for visual tracking via residual channel attention

July 2026publication date
Proposal available till 12-05-2025
4 total number of authors per manuscript0 $

The title of the journal is available only for the authors who have already paid for
Journal’s subject area:
Statistics, Probability and Uncertainty;
Applied Mathematics;
Computational Theory and Mathematics;
Electrical and Electronic Engineering;
Computer Vision and Pattern Recognition;
Signal Processing;
Artificial Intelligence;
Places in the authors’ list:
place 1place 2place 3place 4
FreeFreeFreeFree
2350 $1200 $1050 $900 $
Contract5779.1 Contract5779.2 Contract5779.3 Contract5779.4
1 place - free (for sale)
2 place - free (for sale)
3 place - free (for sale)
4 place - free (for sale)

Abstract:
Siamese networks used for target tracking has attracted widespread attention due to its balanced tracking accuracy and efficient execution speed. However, when there exist similar semantic information in the search area, it is difficult for most Siamese trackers to adapt to the interference of similar semantic information to target localization, which greatly affects the robustness of Siamese trackers. In order to effectively mine feature information and improve localization accuracy, this work proposes one Siamese multi-level classification and regression (SiamMCAR) framework. SiamMCAR first introduces the residual channel attention module into the template branch of Siamese subnetwork. By utilizing the relationship between feature channels to determine the channel weight of the target template feature, making the attention of extracting the template feature is focused on the channel feature of target foreground. Then, one multi-level classification and regression subnetwork containing multiple classification and regression modules is constructed. Feature maps of the output of different classification and regression modules are weighted and fused by using multiple trained weights, which enables the multi-level classification and regression subnetwork to obtain more results of classification and regression of the shallow cross-correlation response map, thereby making the localization more accurate.
Keywords:
Multi-level classification and regression; Residual channel attention; Siamese network; Visual tracking

Contacts :
0