HyperDiff: Masked Diffusion Model with High-efficient Transformer for Hyperspectral Image Cross-Scene Classification

Pei Zhang, Dong Wang, Chanyue Wu, Jing Yang, Lei Kang, Zongwen Bai, Ying Li, Qiang Shen

Allbwn ymchwil: Pennod mewn Llyfr/Adroddiad/Trafodion CynhadleddTrafodion Cynhadledd (Nid-Cyfnodolyn fathau)

8 Wedi eu Llwytho i Lawr (Pure)

Crynodeb

Hyperspectral Image (HSI) cross-scene classification is a challenging task in remote sensing, particularly when realtime processing of Target Domain (TD) HSI is required, and data cannot be reused for training. While deep learning methods have shown promising results, the generalization ability of HIS representations remains limited, mainly due to class label imbalance. This paper introduces a dual-stage learning framework based on transfer learning to enhance classification accuracy in the TD. The framework includes a self-supervised learning stage and a supervised fine-tuning stage. The self-supervised stage focuses on learning robust representations by leveraging inherent structures within HSI data, while the fine-tuning stage uses training labels to extract semantic information. A masked diffusion model predicts masked tokens from unmasked ones, capturing both high-level structures and fine details in HIS data. An efficient spatiospectral Transformer, which removes self-attention from the decoder, is proposed to enhance the selfsupervised process. This design allows mask tokens to obtain information from visible tokens without interacting with each other, reducing sequence length and computational costs. By decoding each mask token conditionally independently, only a subset of masked tokens is processed. Extensive experiments on two public HSI datasets demonstrate that the proposed method outperforms state-of-the-art techniques.
Iaith wreiddiolSaesneg
Teitl2025 IEEE International Conference on Acoustics, Speech and Signal Processing
CyhoeddwrIEEE Press
StatwsDerbyniwyd/Yn y wasg - 20 Rhag 2024
Digwyddiad2025 IEEE International Conference on Acoustics, Speech and Signal Processing - Hyderabad, India
Hyd: 06 Ebr 202511 Ebr 2025

Cynhadledd

Cynhadledd2025 IEEE International Conference on Acoustics, Speech and Signal Processing
Gwlad/TiriogaethIndia
DinasHyderabad
Cyfnod06 Ebr 202511 Ebr 2025

Ôl bys

Gweld gwybodaeth am bynciau ymchwil 'HyperDiff: Masked Diffusion Model with High-efficient Transformer for Hyperspectral Image Cross-Scene Classification'. Gyda’i gilydd, maen nhw’n ffurfio ôl bys unigryw.

Dyfynnu hyn