SiamCDA: Complementarity-and distractor-aware RGB-T tracking based on Siamese network

Tianlu Zhang, Xueru Liu, Qiang Zhang, Jungong Han

Allbwn ymchwil: Cyfraniad at gyfnodolynErthygladolygiad gan gymheiriaid

59 Dyfyniadau (Scopus)
407 Wedi eu Llwytho i Lawr (Pure)

Crynodeb

Recent years have witnessed the prevalence of using the Siamese network for RGB-T tracking because of its remarkable success in RGB object tracking. Despite their faster than real-time speeds, existing RGB-T Siamese trackers suffer from low accuracy and poor robustness, compared to other state-of-the-art RGB-T trackers. To address such issues, a new complementarity- and distractor-aware RGB-T tracker based on Siamese network (referred to as SiamCDA) is developed in this paper. To this end, several modules are presented, where the feature pyramid network (FPN) is incorporated into the Siamese network to capture the cross-level information within unimodal features extracted from the RGB or the thermal images. Next, a complementarity-aware multi-modal feature fusion module (CA-MF) is specially designed to capture the cross-modal information between RGB features and thermal features. In the final bounding box selection phase, a distractor-aware region proposal selection module (DAS) further enhances the robustness of our tracker. On top of the technical modules, we also build a large-scale, diverse synthetic RGB-T tracking dataset, containing more than 4831 pairs of synthetic RGB-T videos and 12K synthetic RGB-T images. Extensive experiments on three RGB-T tracking benchmark datasets demonstrate the outstanding performance of our proposed tracker with a tracking speed over 37 frames per second (FPS).
Iaith wreiddiolSaesneg
Tudalennau (o-i)1403-1417
Nifer y tudalennau15
CyfnodolynIEEE Transactions on Circuits and Systems for Video Technology
Cyfrol32
Rhif cyhoeddi3
Dyddiad ar-lein cynnar09 Ebr 2021
Dynodwyr Gwrthrych Digidol (DOIs)
StatwsCyhoeddwyd - 01 Maw 2022

Ôl bys

Gweld gwybodaeth am bynciau ymchwil 'SiamCDA: Complementarity-and distractor-aware RGB-T tracking based on Siamese network'. Gyda’i gilydd, maen nhw’n ffurfio ôl bys unigryw.

Dyfynnu hyn