Mapping Key Soil Properties in Low Relief Areas Using Integrated Machine Learning and Geostatistics

37 Pages Posted: 14 Oct 2024

See all articles by Jiangheng Qiu

Jiangheng Qiu

Henan Agricultural University

Feng Liu

affiliation not provided to SSRN

Decai Wang

Henan Agricultural University

Kun Yan

Henan Agricultural University

Junhui Guo

affiliation not provided to SSRN

Yongkang Feng

Henan Agricultural University

Weijie Huang

Henan Agricultural University

Abstract

Digital soil mapping based on the soil-landscape model can infer soil information using readily available environmental covariates such as topography and vegetation. However, in low-relief areas where topographic factors are relatively uniform and vegetation conditions are similar, there is a big challenge for its application. Meanwhile, geostatistical models often have better performance in low relief areas compared to topographically complex areas. Therefore, we speculate that the method of combining geostatistical modelling with soil-landscape modelling can achieve higher prediction accuracy. We comprehensively selected multiple environmental covariates suitable for the study area and compared four models: Inverse Distance Weighting (IDW), Ordinary Kriging (OK), Random Forest (RF), and Random Forest Regression Kriging (RF-RK). These models were used to predict six soil properties (clay, silt, sand content, pH, cation exchange capacity, and soil organic matter content) in the study area, and their prediction accuracies were evaluated. The results were as follows: (1) When the spatial autocorrelation of soil property data was weak, the RF model, which does not consider spatial autocorrelation, yielded more accurate predictions for clay content and soil pH, with an R2 of 0.62 and 0.30, and an RMSE of 5.19% and 0.58, respectively. (2) When the spatial autocorrelation was strong, the RF-RK model, which accounts for both the soil-environment relationship and spatial autocorrelation, provided more accurate results for sand, silt content, cation exchange capacity, and soil organic matter. The RF-RK model achieved R² values of 0.81, 0.82, 0.79, and 0.69, and RMSE values of 3.43%, 5.23%, 1.68 cmol(+)/kg, and 6.02 g/kg, respectively. (3) Soil type and distance from the Yangtze River were the most important environmental variables explaining the spatial distribution of soil properties. The spatial heterogeneity of soil type and the geographical influence of the Yangtze River explained soil property variations better than other variables. This study highlights the potential of the hybrid modeling approach for digital soil mapping in low relief areas. High-precision spatial maps of key soil attributes in the study area can provide critical data support for land planning and sustainable agricultural development.

Keywords: Digital soil mapping, Soil texture, Soil organic matter, Soil-landscape model, Geostatistical models

Suggested Citation

Qiu, Jiangheng and Liu, Feng and Wang, Decai and Yan, Kun and Guo, Junhui and Feng, Yongkang and Huang, Weijie, Mapping Key Soil Properties in Low Relief Areas Using Integrated Machine Learning and Geostatistics. Available at SSRN: https://ssrn.com/abstract=4986895 or http://dx.doi.org/10.2139/ssrn.4986895

Jiangheng Qiu

Henan Agricultural University ( email )

Zhengzhou
China

Feng Liu

affiliation not provided to SSRN ( email )

No Address Available

Decai Wang (Contact Author)

Henan Agricultural University ( email )

Zhengzhou
China

Kun Yan

Henan Agricultural University ( email )

Zhengzhou
China

Junhui Guo

affiliation not provided to SSRN ( email )

No Address Available

Yongkang Feng

Henan Agricultural University ( email )

Zhengzhou
China

Weijie Huang

Henan Agricultural University ( email )

Zhengzhou
China

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
19
Abstract Views
126
PlumX Metrics