An Enhanced Geometric-Spectral Feature Learning Framework for Airborne Multispectral Point Cloud Classification

2026-06-08Computer Vision and Pattern Recognition

Computer Vision and Pattern RecognitionArtificial Intelligence
AI summary

The authors worked on improving how to classify land types using 3D data that includes multiple light wavelengths (multispectral point clouds). They created two new datasets and designed a model with two main parts: one focuses on the overall spectral information, and the other focuses on the geometric shape features, both enhanced with attention mechanisms. They combined these parts using a special fusion block and a loss function that helps the model learn better when data is uneven or similar between classes. Their method outperformed existing techniques on airborne data, and they are sharing their code and datasets for others to use.

multispectral point cloudland-cover classificationattention mechanismsfeature fusionself-attentionpoint convolutionresidual blockspectral featuresgeometric featuresloss function
Authors
Xian Li, Yanfeng Gu, Aleksandra Pižurica
Abstract
Multispectral point cloud (MPC) is composed of 3D spatial-spectral information, which holds tremendous potential for accurate land-cover classification. However, the representation power of classification models is limited by inherent high-dimensional and heterogeneous spatial-spectral information, unbalanced sample distribution, and inter-class spectral similarity of airborne MPCs. We build two MPC datasets and propose an enhanced geometric-spectral feature learning framework based on attentions for airborne MPC classification. A key component in our model is a two-stream feature fusion method with attention mechanisms, which enhances the representation capability of spatial-spectral features from high-dimensional heterogeneous MPCs. The first stream aims to extract position-encoded global spectral features with fusion self-attention, and the second stream comprises a multikernel point convolution and feature aggregation attention to extract spectral-guided geometric features. We then develop a residual attention fusion block to integrate the most informative geometric-spectral features from the two parallel streams. Another important contribution of this work is a joint loss function to improve the learning ability on unbalanced and interclass similar samples. Experimental results on two airborne MPC datasets demonstrate the effectiveness of the proposed method compared with the state-of-the-art methods. Furthermore, the codes and datasets used in this paper will be made available freely at https://github.com/HITlixian/TGRS_GSFF.