作者
Azamat Suleymanov,Anne C Richer-De-Forges,Nicolas Saby,Dominique Arrouays,Manuel Martín,Antonio Bispo
摘要
Accurate soil property and class predictions through spatial modelling necessitate a thoughtful selection of explanatory variables and sample size, as their choice greatly impacts model performance. Within the framework of Global Soil Nutrient and Nutrient Budgets maps (GSNmap), the FAO Global Soil Partnership (GSP) launched a country-driven digital soil mapping (DSM) approach. The GSP asked the countries if they could implement the DSM prediction of ten soil properties, using their national point data and a set of widely available covariates (GSP_Cov). In this study, we examined the effect of including additional national-based covariates and soil observations on the performance of the prediction models using mainland France as a pilot. The learning soil dataset was based on a systematic 16-to-16 km grid. For a subset of soil properties, we also assessed using repeated k-fold cross-validation the effect of adding to this dataset many other irregularly spread measurements. The GSP_Cov included common widely available covariates that represented information about terrain, climate, and organisms. The second set of covariates consisted of the GSP_Cov, extended to extra covariates available at a national level, such as previously existing soil maps, geological maps, remote sensing products and others. Random Forest approach in combination with the Boruta selection method was employed for mapping ten soil properties: soil organic carbon (SOC), pH (water), total nitrogen (N), available phosphorus (P), available potassium (K), cation exchange capacity (CEC), bulk density (BD), and texture (clay, silt, and sand). The results revealed noteworthy enhancements in prediction performance for more than half of the properties, although, for some of them, the improvements were negligible. The most significant improvements were obtained for pH, CEC and texture, where geological variables and a previous pH map significantly contributed to the increase in accuracy. Adding numerous points (around 25,000) to the learning dataset improved the performance of soil particle-size fractions predictions. By broadening the spectrum of covariates and better covering the feature and geographical spaces considered in soil prediction models, this research underscores the importance of implementing a more diverse range of covariates at a national scale and of densifying soil information to enlarge the feature and geographical spaces of multidimensional soil/covariates combinations. This information should be taken into account in national and continental digital soil mapping endeavours.