Semantic-Aware Cross-Modal Transfer for UAV-LiDAR Individual Tree Segmentation

Abstract

Cross-modal semantic segmentation of individual tree LiDAR point clouds is critical for accurately characterizing tree attributes, quantifying ecological interactions, and estimating carbon storage. However, in forest environments, this task faces key challenges such as high annotation costs and poor cross-domain generalization. To address these issues, this study proposes a cross-modal semantic transfer framework tailored for individual tree point cloud segmentation in forested scenes. Leveraging co-registered UAV-acquired RGB imagery and LiDAR data, we construct a technical pipeline of “2D semantic inference—3D spatial mapping—cross-modal fusion” to enable annotation-free semantic parsing of 3D individual trees. Specifically, we first introduce a novel Multi-Source Feature Fusion Network (MSFFNet) to achieve accurate instance-level segmentation of individual trees in the 2D image domain. Subsequently, we develop a hierarchical two-stage registration strategy to effectively align dense matched point clouds (MPC) generated from UAV imagery with LiDAR point clouds. On this basis, we propose a probabilistic cross-modal semantic transfer model that builds a semantic probability field through multi-view projection and the expectation–maximization algorithm. By integrating geometric features and semantic confidence, the model establishes semantic correspondences between 2D pixels and 3D points, thereby achieving spatially consistent semantic label mapping. This facilitates the transfer of semantic annotations from the 2D image domain to the 3D point cloud domain. The proposed method is evaluated on two forest datasets. The results demonstrate that the proposed individual tree instance segmentation approach achieves the highest performance, with an IoU of 87.60%, compared to state-of-the-art methods such as Mask R-CNN, SOLOV2, and Mask2Former. Furthermore, the cross-modal semantic label transfer framework significantly outperforms existing mainstream methods in individual tree point cloud semantic segmentation across complex forest scenarios.

Details

Identifier / keyword

airborne LiDAR; UAV images; instance segmentation; point cloud semantic segmentation; 2D-3D semantic mapping

Title

Semantic-Aware Cross-Modal Transfer for UAV-LiDAR Individual Tree Segmentation

Author

Zhou Fuyang¹; He, Haiqing¹

; Chen, Ting²; Zhang, Tao¹; Yang Minglu¹; Ye, Yuan³; Liu Jiahao⁴

¹ School of Surveying and Geoinformation Engineering, East China University of Technology, Nanchang 330013, China; [email protected] (F.Z.); [email protected] (T.Z.); [email protected] (M.Y.), Jiangxi Key Laboratory of Watershed Ecological Process and Information, East China University of Technology, Nanchang 330013, China
² School of Water Resources and Environmental Engineering, East China University of Technology, Nanchang 330013, China; [email protected]
³ Shenzhen DJI Innovations Technology Co., Ltd., Shenzhen 518057, China; [email protected]
⁴ Jiangxi Helicopter Co., Ltd., Jingdezhen 333036, China; [email protected]

Publication title

Remote Sensing; Basel

Volume

Issue

First page

2805

Number of pages

Publication year

2025

Publication date

2025

Publisher

MDPI AG

Place of publication

Basel

Country of publication

Switzerland

Publication subject

Physics

e-ISSN

20724292

Source type

Scholarly Journal

Language of publication

English

Document type

Journal Article

Publication history

Online publication date

2025-08-13

Milestone dates

2025-07-15 (Received); 2025-08-12 (Accepted)

Publication history

First posting date

13 Aug 2025

DOI

https://doi.org/10.3390/rs17162805

ProQuest document ID

3244059133

Document URL

https://www.proquest.com/scholarly-journals/semantic-aware-cross-modal-transfer-uav-lidar/docview/3244059133/se-2?accountid=208611

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Last updated

2025-08-27

Database

ProQuest One Academic

Semantic-Aware Cross-Modal Transfer for UAV-LiDAR Individual Tree Segmentation

Content area

Abstract

Details