Content area
Introduction
The development of automated high-throughput plant phenotyping systems with non-destructive characteristics fundamentally relies on achieving accurate segmentation of botanical structures at both semantic and instance levels. However, most existing approaches rely heavily on empirically determined threshold parameters and rarely integrate semantic and instance segmentation within a unified framework.
Methods
To address these limitations, this study introduces a methodology leveraging 2D image data of real plants, i.e., Caladium bicolor, captured using a custom-designed plant cultivation platform. A high-quality 3D point cloud dataset was generated through reconstruction. Building on this foundation, we propose a streamlined Dual-Task Segmentation Network (DSN) incorporating a multi-head hierarchical attention mechanism to achieve superior segmentation performance. Also, the dual-task framework employs Multi-Value Conditional Random Field (MV-CRF) to enable semantic segmentation of stem-leaf and individual leaf identification through the DSN architecture when processing manually-annotated 3D point cloud data. The network features a dual-branch architecture: one branch predicts the semantic class of each point, while the other embeds points into a high-dimensional vector space for instance clustering. Multi-task joint optimization is facilitated through the MV-CRF model.
Results and discussion
Benchmark evaluations validate the novel framework’s segmentation efficacy, yielding 99.16% macro-averaged precision, 95.73% class-wise recognition rate, and an average Intersection over Union of 93.64%, while comparative analyses confirm its superiority over nine benchmark architectures in 3D point cloud analytics. For instance segmentation, the model achieved leading metrics of 87.94%, 72.36%, and 71.61%, respectively. Furthermore, ablation studies validated the effectiveness of the network’s design and substantiated the rationale behind each architectural choice.
Details
Comparative analysis;
Datasets;
Deep learning;
Conditional random fields;
Ablation;
Leaves;
Semantic segmentation;
Registration;
Automation;
Plants (botany);
Genotype & phenotype;
Benchmarks;
Efficiency;
Vector spaces;
Image reconstruction;
Image segmentation;
Computer vision;
Phenotyping;
Clustering;
Three dimensional models;
Effectiveness;
Instance segmentation;
Plant growth;
Annotations;
Morphology;
Semantics
1 School of Electronics and Information, Guangdong Polytechnic Normal University, Guangzhou, China
2 School of Information Engineering, Guangdong University of Technology, Guangzhou, China
3 School of Computer Science and Technology, Guangdong University of Technology, Guangzhou, China
4 Guangzhou Huitong Agricultural Technology Co., Ltd., Guangzhou, China
5 Guangzhou Huitong Agricultural Technology Co., Ltd., Guangzhou, China, Guangzhou iGrowLite Agricultural Technology Co., Ltd., Guangzhou, China