可解释性
随机森林
协变量
物种丰富度
广义线性模型
广义加性模型
生物多样性
机器学习
统计
生态学
线性模型
计算机科学
数学
生物
作者
Lian Brugere,Youngsang Kwon,Amy E. Frazier,Peter Kedron
标识
DOI:10.1016/j.foreco.2023.120972
摘要
Biodiversity is in decline globally and predicting species diversity is critically important if current trends are to be reversed. Tree species richness (TSR) has long been a key measure of biodiversity, but considerable uncertainties exist in current models, particularly given the classic statistical assumptions and poor ecological interpretability of machine learning outcomes. Here, we test several ecologically interpretable machine learning approaches to predict TSR and interpret the driving environmental factors in the continental United States. We develop two artificial neural networks (ANN) and one random forest (RF) model to predict TSR using Forest Inventory and Analysis data and 20 environmental covariates and compare them to a classic generalized linear model (GLM). Models were evaluated on an independent, unseen testing dataset using R2 and Mean Absolute Error (MAE) and residual spatial autocorrelation analysis. An Interpretable Machine Learning approach, SHapley Additive exPlanations (SHAP), was adopted to explain the major environmental factors driving TSR. Compared to a baseline GLM (R2 = 0.7; MAE = 4.7), the ANN and RF models achieved R2 greater than 0.9 and MAE<3.1. Additionally, the ANN and RF models produced less spatially clustered TSR residuals than the GLM. SHAP analysis suggested that TSR is best predicted by Aridity Index, Forest Area, Altitude, Mean Precipitation of the Driest Quarter and Mean Annual Temperature. SHAP further revealed a non-linear relationship of environmental covariates with TSR and complex interactions that were not revealed by the GLM. The study highlights the need for conservation efforts of forest areas and reducing precipitation-related physiological stress on tree species in low forested but arid regions. The machine learning approach used here is transferrable for studies of biodiversity for other organisms or prediction of TSR under future climatic scenarios.
科研通智能强力驱动
Strongly Powered by AbleSci AI