TY - JOUR
T1 - Revisiting Feature Fusion for RGB-T Salient Object Detection
AU - Zhang, Qiang
AU - Xiao, Tonglin
AU - Huang, Nianchang
AU - Zhang, Dingwen
AU - Han, Jungong
N1 - Funding Information:
Manuscript received December 12, 2019; revised June 17, 2020; accepted July 22, 2020. Date of publication August 6, 2020; date of current version May 5, 2021. This work was supported in part by the National Natural Science Foundation of China under Grant 61773301 and Grant 61876140 and in part by the China Post-Doctoral Support Scheme for Innovative Talents under Grant BX20180236. This article was recommended by Associate Editor Q. Tian. (Corresponding authors: Dingwen Zhang; Jungong Han.) Qiang Zhang is with the Key Laboratory of Electronic Equipment Structure Design, Ministry of Education, Xidian University, Xi’an 710071, China, and also with the Center for Complex Systems, School of Mechano-Electronic Engineering, Xidian University, Xi’an 710071, China (e-mail: [email protected]).
Publisher Copyright:
© 2021 Institute of Electrical and Electronics Engineers Inc.. All rights reserved.
PY - 2021/5/1
Y1 - 2021/5/1
N2 - While many RGB-based saliency detection algorithms have recently shown the capability of segmenting salient objects from an image, they still suffer from unsatisfactory performance when dealing with complex scenarios, insufficient illumination or occluded appearances. To overcome this problem, this article studies RGB-T saliency detection, where we take advantage of thermal modality's robustness against illumination and occlusion. To achieve this goal, we revisit feature fusion for mining intrinsic RGB-T saliency patterns and propose a novel deep feature fusion network, which consists of the multi-scale, multi-modality, and multi-level feature fusion modules. Specifically, the multi-scale feature fusion module captures rich contexture features from each modality feature, while the multi-modality and multi-level feature fusion modules integrate complementary features from different modality features and different level of features, respectively. To demonstrate the effectiveness of the proposed approach, we conduct comprehensive experiments on the RGB-T saliency detection benchmark. The experimental results demonstrate that our approach outperforms other state-of-the-art methods and the conventional feature fusion modules by a large margin.
AB - While many RGB-based saliency detection algorithms have recently shown the capability of segmenting salient objects from an image, they still suffer from unsatisfactory performance when dealing with complex scenarios, insufficient illumination or occluded appearances. To overcome this problem, this article studies RGB-T saliency detection, where we take advantage of thermal modality's robustness against illumination and occlusion. To achieve this goal, we revisit feature fusion for mining intrinsic RGB-T saliency patterns and propose a novel deep feature fusion network, which consists of the multi-scale, multi-modality, and multi-level feature fusion modules. Specifically, the multi-scale feature fusion module captures rich contexture features from each modality feature, while the multi-modality and multi-level feature fusion modules integrate complementary features from different modality features and different level of features, respectively. To demonstrate the effectiveness of the proposed approach, we conduct comprehensive experiments on the RGB-T saliency detection benchmark. The experimental results demonstrate that our approach outperforms other state-of-the-art methods and the conventional feature fusion modules by a large margin.
KW - RGB-T
KW - Salient object detection
KW - feature fusion
KW - multi-level
KW - multi-modality
KW - multi-scale
UR - http://www.scopus.com/inward/record.url?scp=85099552122&partnerID=8YFLogxK
U2 - 10.1109/TCSVT.2020.3014663
DO - 10.1109/TCSVT.2020.3014663
M3 - Article
SN - 1051-8215
VL - 31
SP - 1804
EP - 1818
JO - IEEE Transactions on Circuits and Systems for Video Technology
JF - IEEE Transactions on Circuits and Systems for Video Technology
IS - 5
M1 - 9161021
ER -