Joint Cross-Modal and Unimodal Features for RGB-D Salient Object Detection

Nianchang Huang, Yi Liu, Qiang Zhang, Jungong Han

Allbwn ymchwil: Cyfraniad at gyfnodolynErthygladolygiad gan gymheiriaid

29 Dyfyniadau (Scopus)
287 Wedi eu Llwytho i Lawr (Pure)

Crynodeb

RGB-D salient object detection is one of the basic tasks in computer vision. Most existing models focus on investigating efficient ways of fusing the complementary information from RGB and depth images for better saliency detection. However, for many real-life cases, where one of the input images has poor visual quality or contains affluent saliency cues, fusing cross-modal features does not help to improve the detection accuracy, when compared to using unimodal features only. In view of this, a novel RGB-D salient object detection model is proposed by simultaneously exploiting the cross-modal features from the RGB-D images and the unimodal features from the input RGB and depth images for saliency detection. To this end, a Multi-branch Feature Fusion Module is presented to effectively capture the cross-level and cross-modal complementary information between RGB-D images, as well as the cross-level unimodal features from the RGB images and the depth images separately. On top of that, a Feature Selection Module is designed to adaptively select those highly discriminative features for the final saliency prediction from the fused cross-modal features and the unimodal features. Extensive evaluations on four benchmark datasets demonstrate that the proposed model outperforms the state-of-the-art approaches by a large margin.
Iaith wreiddiolSaesneg
Rhif yr erthygl9147051
Tudalennau (o-i)2428-2441
Nifer y tudalennau14
CyfnodolynIEEE Transactions on Multimedia
Cyfrol23
Dynodwyr Gwrthrych Digidol (DOIs)
StatwsCyhoeddwyd - 24 Gorff 2020

Ôl bys

Gweld gwybodaeth am bynciau ymchwil 'Joint Cross-Modal and Unimodal Features for RGB-D Salient Object Detection'. Gyda’i gilydd, maen nhw’n ffurfio ôl bys unigryw.

Dyfynnu hyn