中文版 | English
题名

Joint Classification of Hyperspectral Image and LiDAR Data Based on Spectral Prompt Tuning

作者
发表日期
2024
DOI
发表期刊
ISSN
1558-0644
卷号62
摘要
The pretrained vision-language models (VLMs) have achieved outstanding performance in various visual tasks, primarily due to the knowledge they have acquired from massive image-text pairs. This enables VLMs to generalize to a wide range of downstream tasks. This article presents the first attempt to adapt VLMs for the joint classification task of hyperspectral image (HSI) and LiDAR data, aiming to leverage the well-learned VLMs to extract more generalizable features from diverse remote sensing image sources. Initially, using a patch encoder (PE), low-dimensional patches of HSI and LiDAR data are transformed into high-dimensional latent feature representations, meeting the dimensional requirements of VLMs for visual input data. Unlike traditional classifiers that rely on discrete class labels, VLM-based classification methods depend on continuous vectors, which can be derived from textual templates with class names, i.e., prompts. The classification performance of VLM-based methods heavily relies on these prompts, but prompt engineering not only demands extensive expert knowledge but also is extremely time-consuming. To address this, prompt tuning (PT) methods are introduced to enhance the generalizability of VLMs by adding spectral-based prompts to the vision encoder and incorporating randomly initialized, learnable text prompts (TPs) into the text encoder. Finally, through a novel class-discriminative loss function, the distance between text features of different classes is increased, thereby enhancing the model’s discriminative ability. Experimental results on the Houston 2013, Trento, and MUUFL datasets demonstrate that the proposed method can achieve competitive classification accuracy with a limited number of labeled pixels.
相关链接[IEEE记录]
收录类别
SCI ; EI
学校署名
其他
ESI学科分类
GEOSCIENCES
引用统计
成果类型期刊论文
条目标识符http://sustech.caswiz.com/handle/2SGJ60CL/783817
专题工学院_计算机科学与工程系
作者单位
1.Engineering Research Center of Intelligent Control for Underground Space, Ministry of Education, and the School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, China
2.Department of Computer Science and Engineering, Southern University of Science and Technology, Shenzhen, China
推荐引用方式
GB/T 7714
Yi Kong,Yuhu Cheng,Yang Chen,et al. Joint Classification of Hyperspectral Image and LiDAR Data Based on Spectral Prompt Tuning[J]. IEEE Transactions on Geoscience and Remote Sensing,2024,62.
APA
Yi Kong,Yuhu Cheng,Yang Chen,&Xuesong Wang.(2024).Joint Classification of Hyperspectral Image and LiDAR Data Based on Spectral Prompt Tuning.IEEE Transactions on Geoscience and Remote Sensing,62.
MLA
Yi Kong,et al."Joint Classification of Hyperspectral Image and LiDAR Data Based on Spectral Prompt Tuning".IEEE Transactions on Geoscience and Remote Sensing 62(2024).
条目包含的文件
条目无相关文件。
个性服务
原文链接
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
导出为Excel格式
导出为Csv格式
Altmetrics Score
谷歌学术
谷歌学术中相似的文章
[Yi Kong]的文章
[Yuhu Cheng]的文章
[Yang Chen]的文章
百度学术
百度学术中相似的文章
[Yi Kong]的文章
[Yuhu Cheng]的文章
[Yang Chen]的文章
必应学术
必应学术中相似的文章
[Yi Kong]的文章
[Yuhu Cheng]的文章
[Yang Chen]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
[发表评论/异议/意见]
暂无评论

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。