Publications

Wang, Y; Qin, RZ; Cheng, HZ; Liang, TG; Zhang, KP; Chai, N; Gao, JL; Feng, QS; Hou, MJ; Liu, J; Liu, CL; Zhang, WJ; Fang, YJ; Huang, J; Zhang, F (2022). Can Machine Learning Algorithms Successfully Predict Grassland Aboveground Biomass?. REMOTE SENSING, 14(16), 3843.

Abstract
The timely and accurate estimation of grassland aboveground biomass (AGB) is important. Machine learning (ML) has been widely used in the past few decades to deal with complex relationships. In this study, based on an 11-year period (2005-2015) of AGB data (1620 valid AGB measurements) on the Three-River Headwaters Region (TRHR), combined with remote sensing data, weather data, terrain data, and soil data, we compared the predictive performance of a linear statistical method, machine learning (ML) methods, and evaluated their temporal and spatial scalability. The results show that machine learning can predict grassland biomass well, and the existence of an independent validation set can help us better understand the prediction performance of the model. Our findings show the following: (1) The random forest (RF) based on variables obtained through stepwise regression analysis (SRA) was the best model (R-vad(2) = 0.60, RMSEvad = 1245.85 kg DW (dry matter weight)/ha, AIC = 5583.51, and BIC = 5631.10). It also had the best predictive capability of years with unknown areas (R-indep(2) = 0.50, RMSEindep = 1332.59 kg DW/ha). (2) Variable screening improved the accuracy of all of the models. (3) All models' predictive accuracy varied between 0.45 and 0.60, and the RMSE values were lower than 1457.26 kg DW/ha, indicating that the results were reliably accurate.

DOI:
10.3390/rs14163843

ISSN:
2072-4292