Publications

Wang, YX; Yin, Y; Gao, BB; Zeng, YL; Zhao, YY; Chen, ZY; Feng, QL; Xu, H; Yang, JY (2025). A Novel Spatial Prediction Method Integrating Exploratory Spatial Data Analysis Into Random Forest for Large-Scale Daily Air Temperature Mapping. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 63, 3000318.

Abstract
Accurately predicting spatially continuous daily air temperature (Ta) is critical for agriculture, environmental management, and ecology. While meteorological stations provide precise Ta data, their spatial coverage is limited. Remotely sensed land surface temperature (LST), often fused with meteorological data, offers broader spatial coverage but struggles due to complex relationships between Ta and LST, influenced by factors like topography and human activities. Traditional supervised learning methods often fail to capture the spatial autocorrelation and heterogeneity inherent in the relationships, indicating the need for a more robust approach that integrates geographic knowledge. This study proposes the spatially varying coefficients random forest (SVCRF) model, to integrate exploratory spatial data analysis (ESDAs) into random forest (RF) to capture spatially nonstationary relationships. It first stratifies the study area based on bivariate Local Indicators of Spatial Association and geographical detector, then builds several spatial RFs with specific spatial positions and extent. In each spatial RF, the distance from observation/prediction sites to its position is added as a key predictor variable to model the local spatial variations of the relationships within the spatial extent. Applied to daily Ta mapping at 1 km resolution across China using data from 5425 meteorological stations, the SVCRF model demonstrated superior accuracy, achieving root-mean-squared error (RMSE) of 1.315 degrees C and mean absolute error (MAE) of 1.014 degrees C . Compared to RF, regression kriging (RK), and geographically weighted regression (GWR), it reduced MAE by 0.351 degrees C , 0.786 degrees C , and 0.831 degrees C , respectively. The model also offers high interpretability, with uncertainty estimates aligning with actual errors and spatially resolved variable importance highlighting spatial patterns.

DOI:
10.1109/TGRS.2025.3550573

ISSN:
1558-0644