Abstract
Discriminative correlation filters (DCF) have achieved enormous popularity in the tracking community. Recently, the performance advancement in DCF-based trackers is predominantly driven by the use of convolutional features. In pursuit of extreme tracking performance, state-of-the-art trackers (e.g., cascade correlation tracking [1] and HCF [2]) equip DCF with hierarchical convolutional features to capture both semantics and spatial details of the target appearance. While such methods have been shown to work well, multiple feature integration results in high model complexity which significantly increases the over-fitting risk and computational burden. In this paper, we present a coarse-to-fine framework for cascade correlation tracking (CCT). Instead of integrating hierarchical features, this framework decomposes a complicated tracker into two low-complexity modules, a coarse tracker \({\mathcal {C}}\) and a refined tracker \({\mathcal {R}}\) , working in a coarse-to-fine manner. The coarse tracker \({\mathcal {C}}\) employs low-resolution semantic convolutional features extracted from a large search area to cope with large target displacement and appearance change between adjacent frames. By contrast, the refined tracker \({\mathcal {R}}\) employs high-resolution handcraft features extracted from a small search area to further refine the coarse location of \({\mathcal {C}}\) . Our CCT tracker enjoys the strong discriminative power of \({\mathcal {C}}\) and the high efficiency of \({\mathcal {R}}\) . Experiments on the OTB2013 and TC128 benchmarks show that CCT performs favorably against state-of-the-art trackers.
http://bit.ly/2E8bDjA
Δεν υπάρχουν σχόλια:
Δημοσίευση σχολίου