Improving the accuracy of DBH estimation in Chinese fir using multi-source data fusion and interpretable machine learning algorithms

Rurao Fu; Huaiqing Zhang; Guangxing Wang; Xueyan Zhu; Hua Sun; Linlong Wang; Zeyu Cui; Jing Zhang; Longhua Yu; Rurao Fu; Huaiqing Zhang; Guangxing Wang; Xueyan Zhu; Hua Sun; Linlong Wang; Zeyu Cui; Jing Zhang; Longhua Yu

doi:10.48130/smartfor-0026-0004

Reliable, non-destructive estimation of individual-tree diameter at breast height (DBH) is essential for assessing forest biomass and growth. This study proposes an interpretable DBH estimation framework grounded in UAV-RGB imagery, which quantifies the incremental gains from stepwise multi-source feature fusion, and utilizes SHAP to characterize model reliance on different information sources. Using 986 Chinese fir trees from 20 plots in the Huangfengqiao State-owned Forest Farm (Hunan, China), we constructed four feature sets by progressively adding forest spatial structure (FSS), vegetation indices plus texture features (VI + TF), and climatic factors (CF) to a baseline set of individual-tree features (ITF), and benchmarked six machine-learning models. Under within-distribution evaluation, multi-source fusion consistently improved performance: the full feature set (Set 4) reduced mean MAE (averaged over the six models) from 1.61 to 1.50 cm, and increased mean R² from 0.79 to 0.82, with the multilayer perceptron (MLP) achieving the best accuracy (MAE = 1.44 cm, RMSE = 1.84 cm, R² = 0.84). SHAP analysis revealed that tree height and crown width are the primary information sources, while VI and texture features, as well as climatic variables, provide complementary predictive contributions. We conducted cross-region validation using a larger historical inventory dataset and found that incorporating CF improved accuracy under both within-region evaluation and the stricter leave-one-site-out (LOSO) extrapolation setting: MAE decreased by 15.1% under within-region evaluation, and by 7.17% under LOSO, while overall performance was more conservative under LOSO, reflecting distribution shift. Collectively, the proposed framework provides a low-cost, interpretable, and transferable strategy for UAV-RGB based DBH estimation.

HTML

Introduction

Accurate measurement of DBH is fundamental to a wide range of forest science applications, including forest structure analysis, species competition assessment, ecological niche evaluation, and biomass estimation^[1]. Against the backdrop of accelerating climate change and increasing anthropogenic disturbance, improving the efficiency and accuracy of DBH monitoring has become an urgent priority for both forest research and management practice. However, conventional DBH assessment relies on ground-based field measurements and plot inventories, which although accurate, are labor- and cost-intensive, limiting spatiotemporal scalability for forest monitoring^[2]. Consequently, there is a growing need for cost-effective approaches that enable rapid and consistent DBH monitoring at landscape scales over extended periods.

Proximal remote sensing technologies, when integrated with artificial intelligence algorithms, offer a practical pathway for improving DBH estimation^[3]. In recent years, unmanned aerial vehicles (UAVs) have been widely adopted in forest health monitoring, structural attribute assessment, tree height measurement, and biomass estimation, owing to their high efficiency, operational flexibility, and relatively low cost^[4]. High-resolution RGB imagery, in particular, has emerged as a cost-effective data source due to its affordability, accessibility, broad spatial coverage, and potential for repeated measurements. These characteristics make UAV-RGB derived data a promising basis for supporting continuous updates and iterative refinement of forest digital twin workflows^[5]. Recent studies have shown that texture and spectral features derived from RGB imagery can be strongly associated with DBH^[6]. When coupled with machine learning techniques, these features enable high-accuracy DBH estimation, even when relying solely on two-dimensional visible-spectrum data.

In recent years, computer vision has advanced substantially in DBH estimation using high-resolution RGB imagery. With the development of image-segmentation techniques such as convolutional neural networks (CNNs) and YOLO architectures^[7], deep learning has been widely adopted in forestry applications to extract individual-tree crown attributes and tree height across large forested areas^[8]. However, DBH is not directly observable from RGB imagery; researchers typically estimate it indirectly using individual-tree features such as crown width and tree height, using regression-type prediction models^[9]. Although strong correlations between DBH and these features have been well documented, ecosystem heterogeneity and nonlinear effects of environmental conditions and stand competition can complicate these relationships^[10]. Traditional linear models often fail to capture these complex interactions effectively^[11].

To address these limitations, machine learning (ML) techniques have been adopted to DBH prediction tasks^[12]. Compared with traditional regression models, ML methods impose fewer parametric assumptions and can flexibly capture complex nonlinear relationships among multivariate features, thereby enhancing prediction accuracy and generalization ability^[13]. Previous studies have demonstrated that Bayesian neural networks (BNNs) can accurately estimate DBH across different tree species^[14], and that Support vector regression (SVR), Random forest (RF), and Artificial neural networks (ANNs) outperform traditional linear regression in modeling individual-tree DBH for larch^[15]. Similarly, Iizuka^[16] demonstrated that incorporating multiple variables could effectively improve the prediction accuracy of SVR for estimating the DBH of Japanese cypress (Chamaecyparis obtusa). In addition, deep learning approaches, such as Deep learning algorithms (DLAs), have also been applied to DBH estimation and have achieved notable improvements in modeling accuracy^[17]. However, many existing studies emphasize maximizing accuracy with a limited set of structural predictors, whereas the integration of environmental drivers and systematic interpretability analyses of model predictions receive comparatively less attention.

Most previous studies have primarily emphasized improving predictive performance in ML-based DBH estimation, while giving comparatively less attention to model interpretability. Complex ML models are often regarded as 'black boxes', with opaque internal decision processes and unclear feature contributions, which limit their practical application in forest resource management. Shapley Additive Explanations (SHAP), a widely used post hoc interpretability framework^[18], quantifies the contribution of each feature to an individual prediction and supports both global and local assessments of model behavior, thereby offering actionable insights for diagnosis and optimization of complex models^[19]. As the dimensionality of input predictors increases, the focus in DBH modeling has gradually shifted from accuracy alone to joint consideration of predictive performance and interpretability^[20]. Robust interpretability analysis can improve user confidence, support evidence-based model selection, and reveal key drivers and interaction patterns underlying DBH variation, thereby informing the development of regionally adaptive forest resource estimation models.

Chinese fir (Cunninghamia lanceolata (Lamb.) Hook.) is a major plantation timber species across subtropical regions and contributes substantially to regional carbon storage and timber supply. With the increasing demand for precision forest management, there is a clear need for an efficient, low-cost, and interpretable approach to individual-tree DBH estimation. Accordingly, this study investigates how stepwise multi-source feature fusion affects DBH prediction and leverages an interpretable modeling framework to better understand the determinants of DBH variation. Specifically, we aim to: (1) benchmark multiple ML models under progressively enriched feature sets; (2) quantify the marginal contributions of structural, spectral–texture, and climatic predictors to DBH estimation, and further validate the added value of climatic factors under both within-region evaluation and the stricter leave-one-site-out extrapolation setting using cross-region data; 3) employ SHAP to interpret model behavior at both global and local scales, thereby identifying key contributing factors and improving model transparency. Collectively, this work enhances UAV-RGB-based DBH estimation and provides an interpretable multi-source fusion framework that can support more intelligent forest inventory and resource monitoring.

Discussion

This study proposes a UAV-RGB based framework for estimating individual-tree DBH in Chinese fir. By combining stepwise multi-source feature fusion with SHAP-based interpretation, the framework improves predictive accuracy while enhancing model interpretability. Under within-distribution evaluation, expanding predictors from individual-tree features (ITF) to forest spatial structure (FSS), vegetation indices plus texture features (VI + TF), and climatic factors (CF) improved performance for all six models, with the best results achieved under the full feature set (Set 4). Model responses to feature enrichment differed across algorithms: MLP and SVR showed larger relative gains under higher-dimensional inputs, indicating that complementary information sources provide additional constraints that improve model fit and reduce prediction error. Moreover, cross-region validation further demonstrated that incorporating climatic factors increases accuracy under within-region evaluation; however, under the stricter leave-one-site-out (LOSO) extrapolation setting, overall performance declined noticeably relative to within-distribution evaluation.

Contributions of multi-source feature fusion and model architecture to DBH estimation Accuracy
A broadly consistent conclusion in prior data-fusion studies is that integrating multi-source predictors can improve the accuracy of estimating ecological and forestry target variables. Different information sources are complementary in spatial scale, sensitivity, and error structure, and joint modeling can enhance the identifiability of the target variable while reducing systematic bias associated with relying on a single source^[33,34]. This is consistent with the gains observed in our within-distribution evaluation. When DBH is estimated using only a limited set of tree attributes, the available information is often insufficient to represent the underlying variability. In the context of UAV-RGB based modeling, no single predictor source can fully characterize DBH, whereas structural attributes, spectral-textural proxies, and climatic background information provide complementary constraints. Integrating these multi-source predictors allows the model to represent tree structure, canopy spectral-textural patterns, and external site conditions within a richer feature space, thereby yielding more stable reductions in error and improved fit under within-distribution conditions.

From a modeling perspective, the observed differences in how algorithms benefit from increasing feature dimensionality are expected. The larger relative gains of MLP and SVR under Set 4 suggest that, when sample size is adequate, and predictors are more diverse, nonlinear relationships and potential interactions between DBH and multiple predictor groups become more influential for performance. These models are often better suited to learning complex nonlinear mappings in high-dimensional regression settings, and therefore can more readily translate additional information into error reduction^[35]. In contrast, the comparatively strong stability of tree-based models under lower-dimensional inputs is often related to their splitting rules, ensemble averaging, and relative insensitivity to certain noisy predictors. When information is limited, they can achieve a robust baseline without requiring complex function approximation. As the feature space expands and correlations and redundancy increase, differences among models in their ability to represent and exploit informative signals become more pronounced^[36]. Accordingly, understanding model mechanisms is important for selecting algorithms and designing predictor sets that match the requirements of a given estimation task.

The DBH class-based analysis further highlights heterogeneity in the error structure. For MLP, MAE increased with DBH class, and uncertainty was most pronounced for large trees (> 20 cm); the benefits of multi-source feature fusion were also concentrated in the medium and large DBH classes. Consistent with the regression scatter patterns, predictions in the high-DBH range tended to be more dispersed and showed indications of systematic bias, with larger trees more likely to be underestimated. This pattern may be attributable to the relatively small and imbalanced sample size of large trees, as well as greater crown-shape complexity that can amplify feature-extraction uncertainty. Similar findings have been reported in UAV-based estimation studies, where errors and bias become larger when analyses are stratified by diameter class^[37,38]. Future work could address this issue by increasing the representation of large trees and by adopting DBH class-stratified sampling or weighted training, enabling a more rigorous examination of error propagation and uncertainty across diameter classes.

SHAP-based interpretability and feature contribution mechanisms
SHAP provides a powerful quantitative tool for understanding model behavior. Under Set 4, we performed both global and local interpretation for the six models. Tree height (H) and crown width (CW) consistently ranked as the top two predictors across all models, which accords with biological intuition regarding the structural basis of DBH and is also consistent with findings from prior studies that rely primarily on individual-tree structural predictors^[15]. SHAP analysis further indicated that several vegetation indices and texture features (e.g., ASM, WI, and GBRI), as well as climatic factors (e.g., DD18 and MAP), contributed substantially in some models. Their SHAP summary and waterfall patterns exhibited clear contribution directions and sample-level dispersion, which helps differentiate model-specific reliance patterns and provides practical evidence for feature selection and iterative model refinement.

Importantly, SHAP describes predictive contributions rather than ecological causality^[39]. A high SHAP value indicates that, given the observed data distribution and modeling assumptions, the model relies more heavily on a variable to form predictions; it does not imply that the variable causally drives DBH growth in a physiological or ecological sense. Although the contribution directions of some predictors broadly align with ecological expectations, higher DD18 and greater precipitation are generally associated with larger predicted DBH; however, a more appropriate reading is that these variables provide predictive information that co-varies with growth differences in the observed data. Their ecological influence is more likely expressed through indirect pathways, such as modifying growing-season length and regulating hydrothermal stress and carbon assimilation and allocation, rather than through a direct effect on DBH. To more clearly distinguish predictive contribution from ecological mechanism, future work could integrate structural equation modeling, multi-scale attribution, or causal-inference frameworks to test plausible pathways for key climatic variables, thereby improving the verifiability of mechanism-oriented interpretation while maintaining predictive performance.

Regional extension and climatic-factor validation
In the cross-region validation experiments, we found that incorporating climatic factors (CF) improved overall DBH estimation accuracy under both the relatively lenient within-region evaluation and the stricter leave-one-site-out (LOSO) extrapolation setting. However, the two validation settings correspond to different application scenarios. Under within-region evaluation, training and test samples come from the same region and therefore share similar site conditions, stand structure, and environmental gradients. As a result, spatial dependence can lead to more optimistic performance estimates, which are more representative of estimation tasks in known areas^[40]. In contrast, LOSO holds out an entire region, making the test data more likely to deviate from the training data in covariate distributions and their joint structure. This setting is closer to practical requirements for cross-region transfer and prediction at unseen sites, and therefore typically yields more stringent and conservative estimates^[41]. For spatial extrapolation tasks, grouped or block-based validation is widely considered more appropriate for assessing true transferability to unknown regions and for reducing performance inflation caused by spatial autocorrelation.

From a machine-learning perspective, LOSO represents a stronger form of distribution shift. This implies that feature combinations and model configurations that perform well under within-region evaluation do not necessarily retain their advantage under extrapolation. This pattern is clearly reflected in our feature-increment experiment. For the MLP model used in this analysis, performance was optimal when only the top three key features were included, whereas adding further features led to pronounced fluctuations and degradation. This indicates that, for MLP, a larger feature set does not necessarily translate into stronger generalization, and the model can be more sensitive to regional differences and covariate shift.

The non-monotonic performance and instability of MLP at certain feature-set sizes may be explained by its learning mechanism. Owing to its strong nonlinear fitting capacity, MLP can more readily absorb weakly relevant signals or region-specific patterns as input dimensionality increases, which can reduce transferability under distribution shift^[42]. When the feature distributions or feature-response relationships shift in the held-out region, these patterns may not transfer, leading to higher extrapolation error and greater fold-to-fold variability^[43]. Such behavior is more likely when inputs are high-dimensional, predictors are strongly correlated, or marginal distributions differ substantially across regions. These considerations also have direct implications for hyperparameter tuning. Under within-region evaluation, hyperparameter search often favors weaker regularization and higher capacity to minimize validation error; under a strong extrapolation setting, such as LOSO, carrying over those configurations can lead to overfitting to the statistical structure of the training regions and reduce cross-region stability. Therefore, for LOSO extrapolation, we recommend tuning toward stronger regularization and more conservative capacity control, trading some in-region fit for improved robustness under extrapolation.

Limitations and prospects
UAV-RGB imagery is cost-efficient and enables rapid coverage, making the proposed framework a practical pathway for low-cost, intelligent forest-resource monitoring. However, under real-world deployment, DBH estimation errors arise not only from the regression model itself but also from upstream processing steps, including the quality of image preprocessing, single-tree segmentation errors, edge-tree identification bias, and the resulting propagation of feature-extraction errors. Therefore, a key application-oriented priority is to quantify uncertainty sources and their propagation pathways. Future work can conduct end-to-end uncertainty assessment and use perturbation experiments or related sensitivity analyses to quantify how uncertainties propagate to the final DBH predictions, thereby providing more reliable error bounds and a clearer scope of applicability for operational deployment.

In addition, while the current framework performs well in our experiments, its evaluation has so far been limited to the data conditions considered in this study. Its broader applicability should be further validated in more heterogeneous settings, such as mixed-species stands, structurally more complex forests, and regions spanning stronger climatic gradients. Under these conditions, domain shift is likely to be more pronounced, and the feature set and validation strategy may require corresponding adjustments, for example, adopting stricter spatial blocking or grouped validation, and further examining feature selection and regularization strategies targeted at cross-domain robustness. Moreover, the extrapolation results in this study highlight the inherent limitations of conventional supervised learning models in cross-region generalization. Looking forward, a promising direction is to adopt a pretraining–fine-tuning paradigm: leveraging large-scale, multi-source long-term inventory datasets for offline pretraining to enhance feature representation, followed by fine tuning on target-region data to improve robustness to distribution shift and deployment performance. This direction falls within the broader framework of transfer learning and domain adaptation and should be systematically evaluated under cross-region and multi-temporal data settings.

[1]	Umemi K, Inoue A. 2024. A model for predicting mean diameter at breast height from mean tree height and stand density. Journal of Forest Research 29(3):186−195 doi: 10.1080/13416979.2024.2311946 CrossRef Google Scholar
[2]	Song C, Yang B, Zhang L, Wu D. 2021. A handheld device for measuring the diameter at breast height of individual trees using laser ranging and deep-learning based image recognition. Plant Methods 17(1):67 doi: 10.1186/s13007-021-00748-z CrossRef Google Scholar
[3]	Bian L, Zhang H, Ge Y, Čepl J, Stejskal J et al. 2022. Closing the gap between phenotyping and genotyping: review of advanced, image-based phenotyping technologies in forestry. Annals of Forest Science 79(1):22 doi: 10.1186/s13595-022-01143-x CrossRef Google Scholar
[4]	Liu K, Shen X, Cao L, Wang G, Cao F. 2018. Estimating forest structural attributes using UAV-LiDAR data in Ginkgo plantations. ISPRS Journal of Photogrammetry and Remote Sensing 146:465−482 doi: 10.1016/j.isprsjprs.2018.11.001 CrossRef Google Scholar
[5]	Qiu H, Zhang H, Lei K, Zhang H, Hu X. 2023. Forest digital twin: a new tool for forest management practices based on spatio-temporal data, 3D simulation Engine, and intelligent interactive environment. Computers and Electronics in Agriculture 215:108416 doi: 10.1016/j.compag.2023.108416 CrossRef Google Scholar
[6]	Mao Z, Lu Z, Wu Y, Deng L. 2023. DBH estimation for individual tree: two-dimensional images or three-dimensional point clouds? Remote Sensing 15(16):4116 doi: 10.3390/rs15164116 CrossRef Google Scholar
[7]	Zhu X, Chen F, Zhang X, Zheng Y, Peng X, et al. 2024. Detection the maturity of multi-cultivar olive fruit in orchard environments based on Olive-EfficientDet. Scientia Horticulturae 324:112607 doi: 10.1016/j.scienta.2023.112607 CrossRef Google Scholar
[8]	Hu G, Wang T, Wan M, Bao W, Zeng W. 2022. UAV remote sensing monitoring of pine forest diseases based on improved Mask R-CNN. International Journal of Remote Sensing 43:1274−1305 doi: 10.1080/01431161.2022.2032455 CrossRef Google Scholar
[9]	Moe KT, Owari T, Furuya N, Hiroshima T, Morimoto J. 2020. Application of UAV photogrammetry with LiDAR data to facilitate the estimation of tree locations and DBH values for high-value timber species in northern Japanese mixed-wood forests. Remote Sensing 12(17):2865 doi: 10.3390/rs12172865 CrossRef Google Scholar
[10]	Fu L, Duan G, Ye Q, Meng X, Luo P, et al. 2020. Prediction of Individual Tree Diameter Using a Nonlinear Mixed-Effects Modeling Approach and Airborne LiDAR Data. Remote Sensing 12(7):1066 doi: 10.3390/rs12071066 CrossRef Google Scholar
[11]	Guo H, Jia W, Li D, Sun Y, Wang F, et al. 2023. Modelling branch growth of Korean pine plantations based on stand conditions and climatic factors. Forest Ecology and Management 546:121318 doi: 10.1016/j.foreco.2023.121318 CrossRef Google Scholar
[12]	He X, Lei X, Liu D, Lei Y. 2023. Developing machine learning models with multiple environmental data to predict stand biomass in natural coniferous-broad leaved mixed forests in Jilin Province of China. Computers and Electronics in Agriculture 212:108162 doi: 10.1016/j.compag.2023.108162 CrossRef Google Scholar
[13]	Jordan MI, Mitchell TM. 2015. Machine learning: Trends, perspectives, and prospects. Science 349:255−260 doi: 10.1126/science.aaa8415 CrossRef Google Scholar
[14]	Xu J, Su M, Sun Y, Pan W, Cui H, et al. 2024. Tree crown segmentation and diameter at breast height prediction based on BlendMask in unmanned aerial vehicle imagery. Remote Sensing 16(2):368 doi: 10.3390/rs16020368 CrossRef Google Scholar
[15]	Sun Y, Jin X, Pukkala T, Li F. 2022. Predicting individual tree diameter of Larch (Larix olgensis) from UAV-LiDAR data using six different algorithms. Remote Sensing 14(5):1125 doi: 10.3390/rs14051125 CrossRef Google Scholar
[16]	Iizuka K, Kosugi Y, Noguchi S, Iwagami S. 2022. Toward a comprehensive model for estimating diameter at breast height of Japanese cypress (Chamaecyparis obtusa) using crown size derived from unmanned aerial systems. Computers and Electronics in Agriculture 192:106579 doi: 10.1016/j.compag.2021.106579 CrossRef Google Scholar
[17]	Ercanlı İ. 2020. Innovative deep learning artificial intelligence applications for predicting relationships between individual tree height and diameter at breast height. Forest Ecosystems 7(1):12 doi: 10.1186/s40663-020-00226-3 CrossRef Google Scholar
[18]	Lundberg SM, Lee SI. 2017. A unified approach to interpreting model predictions. Advances in Neural Information Processing Systems 30 (NIPS 2017), Long Beach, CA, USA. https://proceedings.neurips.cc/paper_files/paper/2017/file/8a20a8621978632d76c43dfd28b67767-Paper.pdf
[19]	Xiang H, Shen Z, Tan L, Gao C, Wu G, et al. 2024. Community identification and carbon storage monitoring of Heritiera littoralis with UAV hyperspectral imaging. Ecological Indicators 167:112653 doi: 10.1016/j.ecolind.2024.112653 CrossRef Google Scholar
[20]	Li X, Du H, Mao F, Xu Y, Huang Z, et al. 2024. Estimation aboveground biomass in subtropical bamboo forests based on an interpretable machine learning framework. Environmental Modelling & Software 178:106071 doi: 10.1016/j.envsoft.2024.106071 CrossRef Google Scholar
[21]	da Silva AKV, Borges MVV, Batista TS, da Silva Junior CA, Furuya DEG, et al. 2021. Predicting eucalyptus diameter at breast height and total height with UAV-based spectral indices and machine learning. Forests 12(5):582 doi: 10.3390/f12050582 CrossRef Google Scholar
[22]	Qiu H, Zhang H, Lei K, Wang J, Zhang H, et al. 2025. A novel method for forest spatial structure heterogeneity evaluation of plantation utilizing point-wise vector network and neighborhood index. Computers and Electronics in Agriculture 229:109774 doi: 10.1016/j.compag.2024.109774 CrossRef Google Scholar
[23]	Wang L, Zhang H, Lei K, Yang T, Zhang J, et al. 2024. A novel forest dynamic growth visualization method by incorporating spatial structural parameters based on convolutional neural network. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing 17:3471−3488 doi: 10.1109/JSTARS.2023.3342445 CrossRef Google Scholar
[24]	Hui G, Zhang G, Zhao Z, Yang A. 2019. Methods of forest structure research: a review. Current Forestry Reports 5:142−154 doi: 10.1007/s40725-019-00090-7 CrossRef Google Scholar
[25]	Wang T, Wang G, Innes JL, Seely B, Chen B. 2017. ClimateAP: an application for dynamic local downscaling of historical and future climate data in Asia Pacific. Frontiers of Agricultural Science and Engineering 4(4):448−458 doi: 10.15302/j-fase-2017172 CrossRef Google Scholar
[26]	Zhang X, Wang H, Chhin S, Zhang J. 2020. Effects of competition, age and climate on tree slenderness of Chinese fir plantations in southern China. Forest Ecology and Management 458:117815 doi: 10.1016/j.foreco.2019.117815 CrossRef Google Scholar
[27]	Liu Z, Huang T, Wu Y, Zhang X, Liu C, et al. 2024. Aboveground biomass inversion of forestland in a Jinsha River dry-hot valley by integrating high and medium spatial resolution optical images: a case study on Yuanmou County of Southwest China. Ecological Informatics 83:102796 doi: 10.1016/j.ecoinf.2024.102796 CrossRef Google Scholar
[28]	Ahmad Anees S, Mehmood K, Khan WR, Sajjad M, Alahmadi TA, et al. 2024. Integration of machine learning and remote sensing for above ground biomass estimation through Landsat-9 and field data in temperate forests of the Himalayan region. Ecological Informatics 82:102732 doi: 10.1016/j.ecoinf.2024.102732 CrossRef Google Scholar
[29]	Qadeer A, Shakir M, Wang L, Talha SM. 2024. Evaluating machine learning approaches for aboveground biomass prediction in fragmented high-elevated forests using multi-sensor satellite data. Remote Sensing Applications: Society and Environment 36:101291 doi: 10.1016/j.rsase.2024.101291 CrossRef Google Scholar
[30]	Prokhorenkova L, Gusev G, Vorobev A, Dorogush AV, Gulin A. 2018. CatBoost: unbiased boosting with categorical features. Advances in Neural Information Processing Systems 31 (NeurIPS 2018), Montréal, Canada. https://proceedings.neurips.cc/paper_files/paper/2018/file/14491b756b3a51daac41c24863285549-Paper.pdf
[31]	Ke G, Meng Q, Finley T, Wang T, Chen W, et al. 2017. LightGBM: a highly efficient gradient boosting decision tree. Advances in Neural Information Processing Systems 30 (NIPS 2017), Long Beach, CA, USA. https://proceedings.neurips.cc/paper_files/paper/2017/file/6449f44a102fde848669bdd9eb6b76fa-Paper.pdf
[32]	Bayat M, Bettinger P, Hassani M, Heidari S. 2021. Ten-year estimation of Oriental beech (Fagus orientalis Lipsky) volume increment in natural forests: a comparison of an artificial neural networks model, multiple linear regression and actual increment. Forestry 94(4):598−609 doi: 10.1093/forestry/cpab001 CrossRef Google Scholar
[33]	Yan X, Li J, Smith AR, Yang D, Ma T, et al. 2023. Evaluation of machine learning methods and multi-source remote sensing data combinations to construct forest above-ground biomass models. International Journal of Digital Earth 16(2):4471−4491 doi: 10.1080/17538947.2023.2270459 CrossRef Google Scholar
[34]	Liang Y, Kou W, Lai H, Wang J, Wang Q, et al. 2022. Improved estimation of aboveground biomass in rubber plantations by fusing spectral and textural information from UAV-based RGB imagery. Ecological Indicators 142:109286 doi: 10.1016/j.ecolind.2022.109286 CrossRef Google Scholar
[35]	Borisov V, Leemann T, Seßler K, Haug J, Pawelczyk M, et al. 2024. Deep neural networks and tabular data: a survey. IEEE Transactions on Neural Networks and Learning Systems 35:7499−7519 doi: 10.1109/TNNLS.2022.3229161 CrossRef Google Scholar
[36]	Shwartz-Ziv R, Armon A. 2022. Tabular data: deep learning is not all you need. Information Fusion 81:84−90 doi: 10.1016/j.inffus.2021.11.011 CrossRef Google Scholar
[37]	Tinkham WT, Swayze NC, Hoffman CM, Lad LE, Battaglia MA. 2022. Modeling the missing DBHs: Influence of model form on UAV DBH characterization. Forests 13(12):2077 doi: 10.3390/f13122077 CrossRef Google Scholar
[38]	Erfanifard Y, Hosingholizade A, Griess VC, Millan VEG, Pirasteh S. 2025. Estimating tree diameter at breast height (DBH) from UAV data: a comparison of oblique–Vertical imagery fusion and allometric modeling. Science of Remote Sensing 2025:100331 doi: 10.1016/j.srs.2025.100331 CrossRef Google Scholar
[39]	Heskes T, Sijben E, Bucur IG, Claassen T. 2020. Causal shapley values: Exploiting causal knowledge to explain individual predictions of complex models. Advances in Neural Information Processing Systems 33 (NeurIPS 2020), Vancouver, Canada. pp. 4778−4789 https://proceedings.neurips.cc/paper_files/paper/2020/file/32e54441e6382a7fbacbbbaf3c450059-Paper.pdf
[40]	Roberts DR, Bahn V, Ciuti S, Boyce MS, Elith J, et al. 2017. Cross-validation strategies for data with temporal, spatial, hierarchical, or phylogenetic structure. Ecography 40(8):913−929 doi: 10.1111/ecog.02881 CrossRef Google Scholar
[41]	Ploton P, Mortier F, Réjou-Méchain M, Barbier N, Picard N, et al. 2020. Spatial validation reveals poor predictive performance of large-scale ecological mapping models. Nature Communications 11:4540 doi: 10.1038/s41467-020-18321-y CrossRef Google Scholar
[42]	Geirhos R, Jacobsen JH, Michaelis C, Zemel R, Brendel W, et al. 2020. Shortcut learning in deep neural networks. Nature Machine Intelligence 2(11):665−673 doi: 10.1038/s42256-020-00257-z CrossRef Google Scholar
[43]	Zhou K, Liu Z, Qiao Y, Xiang T, Loy CC. 2023. Domain generalization: a survey. IEEE Transactions on Pattern Analysis and Machine Intelligence 45(4):4396−4415 doi: 10.1109/TPAMI.2022.3195549 CrossRef Google Scholar

Dataset	Number of images	Numbers of crowns
Train	348	43,848
Validation	116	20,548
Test	116	19,738

VI	Name	Algorithm formula
VARI	Visible atmospherically resistant index	$ (g-r)/(g+r-b) $
ExR	Excess red vegetation index	$ 1.4r-g $
ExB	Excess blue vegetation index	$ 1.4b-g $
ExG	Excess Green Vegetation Index	$ 2g-r-b $
GBRI	Green–blue ratio index	$ g/b $
RBRI	Red–blue ratio index	$ r/b $
WI	Woebbecke index	$ (g-b)/(r-g) $
GLI	Green leaf index	$ (2g-b-r)/(2g+b+r) $
NDI	Normalized difference index	$ (r-g)/(r+g+0.01) $
MGRVI	Modified green red vegetation index	$ ({g}^{2}-{r}^{2})/({g}^{2}+{r}^{2}) $

TF	Texture features	Algorithm formula
MEA	Mean	$ mea=\displaystyle\sum \limits_{i,j}^{N-1}{iP}_{i,j} $
VAR	Variance	$ var=\displaystyle\sum \limits_{i,j=0}^{N-1}{iP}_{i,j}{(i-mea)}^{2} $
HOM	Homogeneity	$ hom=\displaystyle\sum \limits_{i,j=0}^{N-1}i\dfrac{{P}_{i,j}}{1+{i-j}^{2}} $
CON	Contrast	$ con=\displaystyle\sum \limits_{i,j=0}^{N-1}i{P}_{i,j}{(i-j)}^{2} $
DIS	Dissimilarity	$ dis=\displaystyle\sum \limits_{i,j=0}^{N-1}i{P}_{i,j}\left\| i-j\right\| $
ENT	Entropy	$ ent=\displaystyle\sum \limits_{i,j=0}^{N-1}i{P}_{i,j}\left(-\ln {P}_{i,j}\right) $
ASM	Angular second moment	$ asm=\displaystyle\sum \limits_{i,j=0}^{N-1}i{{{P}^{2}}}_{i,j} $
COR	Correlation	$ cor=\displaystyle\sum \limits_{i,j=0}^{N-1}i{P}_{i,j}\left[\dfrac{(i-mea)(j-mea)}{\sqrt{{var}_{i}*{var}_{j}}}\right] $

Model	Hyper-parameter values
RF	max_samples: 0.2, 0.5, 0.8; max_depth:1, 5, 10, 15; min_samples_split: 2, 5, 7, 10
XGBoost	max_depth: 3, 5, 10, 15; reg_lambda: 0.1, 0.5, 1, 2; min_child_weight: 1, 5, 10, 15; learning_rate: 0.001, 0.01, 0.1, 0.05
CatBoost	depth: 3, 5, 7, 10; subsample: 0.2, 0.5, 0.8; min_data_in_leaf: 1, 3, 5, 10; learning_rate: 0.001, 0.01, 0.1, 0.5
LightGBM	subsample: 0.2, 0.5, 0.8; max_depth: −1, 5, 10, 15; num_leaves: 5, 10, 15, 20; min_child_sample: 5, 10, 15, 20; learning_rate: 0.001, 0.01, 0.1, 0.05
SVR	gamma: scale, auto; kernel: rbf, sigmoid; epsilon: 0.1, 0.3, 0.5, 0.7; C: 0.1, 1, 10, 50, 90, 100, 110
MLP	solver: adam, sgd; alpha: 0.01, 0.1, 1, 2, 5, 10; learning_rate_init: 0.1, 0.01, 0.001, 0.05; hidden_layer_sizes: (64, ), (64, 32), (128, ), (128,64), (128, 64, 32), (128, 65, 32, 16)

{{lists.name}}

Improving the accuracy of DBH estimation in Chinese fir using multi-source data fusion and interpretable machine learning algorithms

Abstract

Rights and permissions

References

About this article

Cite this article

Article Metrics

Access History

Other Articles By Authors