TY - JOUR
T1 - Hyperspectral Classification Based on Lightweight 3-D-CNN With Transfer Learning
AU - Zhang, Haokui
AU - Li, Ying
AU - Jiang, Yenan
AU - Wang, Peng
AU - Shen, Qiang
AU - Shen, Chunhua
N1 - Funding Information:
Dr. Shen was a recipient of the Australian Research Council Future Fellowship in 2012.
Funding Information:
Manuscript received August 29, 2018; revised January 7, 2019; accepted February 25, 2019. Date of publication April 11, 2019; date of current version July 22, 2019. This work was supported in part by the Foundation Project for Advanced Research Field of China under Grant 614023804016HK03002, in part by the National Natural Science Foundation of China under Grant 61871460 and Grant 61876152, in part by the National Key Research and Development Program of China under Grant 2016YFB0502502, and in part by the Innovation Foundation for Doctor Dissertation of Northwestern Polytechnical University under Grant CX201816. (Corresponding author: Ying Li.) H. Zhang is with the Shaanxi Provincial Key Laboratory of Speech and Image Information Processing, School of Computer Science, Northwestern Polytechnical University, Xi’an 710129, China, and also with the School of Computer Science, The University of Adelaide, Adelaide, SA 5005, Australia (e-mail: [email protected]).
Publisher Copyright:
© 1980-2012 IEEE.
PY - 2019/8/31
Y1 - 2019/8/31
N2 - Recently, hyperspectral image (HSI) classification approaches based on deep learning (DL) models have been proposed and shown promising performance. However, because of very limited available training samples and massive model parameters, DL methods may suffer from overfitting. In this paper, we propose an end-to-end 3-D lightweight convolutional neural network (CNN) (abbreviated as 3-D-LWNet) for limited samples-based HSI classification. Compared with conventional 3-D-CNN models, the proposed 3-D-LWNet has a deeper network structure, less parameters, and lower computation cost, resulting in better classification performance. To further alleviate the small sample problem, we also propose two transfer learning strategies: 1) cross-sensor strategy, in which we pretrain a 3-D model in the source HSI data sets containing a greater number of labeled samples and then transfer it to the target HSI data sets and 2) cross-modal strategy, in which we pretrain a 3-D model in the 2-D RGB image data sets containing a large number of samples and then transfer it to the target HSI data sets. In contrast to previous approaches, we do not impose restrictions over the source data sets, in which they do not have to be collected by the same sensors as the target data sets. Experiments on three public HSI data sets captured by different sensors demonstrate that our model achieves competitive performance for HSI classification compared to several state-of-the-art methods.
AB - Recently, hyperspectral image (HSI) classification approaches based on deep learning (DL) models have been proposed and shown promising performance. However, because of very limited available training samples and massive model parameters, DL methods may suffer from overfitting. In this paper, we propose an end-to-end 3-D lightweight convolutional neural network (CNN) (abbreviated as 3-D-LWNet) for limited samples-based HSI classification. Compared with conventional 3-D-CNN models, the proposed 3-D-LWNet has a deeper network structure, less parameters, and lower computation cost, resulting in better classification performance. To further alleviate the small sample problem, we also propose two transfer learning strategies: 1) cross-sensor strategy, in which we pretrain a 3-D model in the source HSI data sets containing a greater number of labeled samples and then transfer it to the target HSI data sets and 2) cross-modal strategy, in which we pretrain a 3-D model in the 2-D RGB image data sets containing a large number of samples and then transfer it to the target HSI data sets. In contrast to previous approaches, we do not impose restrictions over the source data sets, in which they do not have to be collected by the same sensors as the target data sets. Experiments on three public HSI data sets captured by different sensors demonstrate that our model achieves competitive performance for HSI classification compared to several state-of-the-art methods.
KW - hyperspectral classification
KW - deep learning
KW - 3D lightweight convolutional network
KW - transfer learning
KW - 3-D lightweight convolutional network (3-D-LWNet)
KW - deep learning (DL)
UR - http://www.scopus.com/inward/record.url?scp=85069753764&partnerID=8YFLogxK
U2 - 10.1109/TGRS.2019.2902568
DO - 10.1109/TGRS.2019.2902568
M3 - Article
SN - 0196-2892
VL - 57
SP - 5813
EP - 5828
JO - IEEE Transactions on Geoscience and Remote Sensing
JF - IEEE Transactions on Geoscience and Remote Sensing
IS - 8
M1 - 8685710
ER -