Das, A; Sahu, M (2024). Leveraging Satellite Data for Predicting PM10 Concentration with Machine Learning Models: A Study in the Plains of North Bengal, India. AEROSOL AND AIR QUALITY RESEARCH, 24(12), 240066.
Abstract
The current air quality monitoring network is sparse and economically impractical in remote areas. Remote sensing offers an effective solution, providing real-time observations with high spatial and temporal resolution. This study aimed to estimate PM10 concentrations in Siliguri City, West Bengal, from 2019 to 2022, using Aerosol Optical Depth (AOD) at a 10 x 10 km spatial resolution. During the study period, the average PM10 level was 141.89 mu g m(-)(3), surpassing India's National Ambient Air Quality Standards (NAAQS). Five different machine learning regression models, namely linear regression (LR), Support Vector Regression (SVR), Random Forest (RF), Gradient Boosting (GB), and Extreme Gradient Boosting (XGB), were employed and evaluated using Root Mean Squared Error (RMSE) and Mean Absolute Error (MAE) along with R-2 for predicting the daily ground-level PM10 concentration using AOD, land cover data, and meteorological parameters. Through statistical testing, it was determined that dew point temperature, precipitation, and Normalized Difference Vegetation Index (NDVI) were statistically significant (p < 0.05) with AOD. Tree-based regression models, particularly RF, outperformed other models, achieving an R-2 value of 0.83 and RMSE of 25.51
DOI:
10.4209/aaqr.240066
ISSN:
1680-8584