Publications

Zhao, FH; Xia, JW; Zhu, L; Sun, HF; Zhao, DX (2023). Retrieval of Volcanic Ash Cloud Base Height Using Machine Learning Algorithms. ATMOSPHERE, 14(2), 228.

Abstract
There are distinct differences between radiation characteristics of volcanic ash and meteorological clouds, and conventional retrieval methods for cloud base height (CBH) of the latter are difficult to apply to volcanic ash without substantial parameterisation and model correction. Furthermore, existing CBH inversion methods have limitations, including the involvement of many empirical formulae and a dependence on the accuracy of upstream cloud products. A machine learning (ML) method was developed for the retrieval of volcanic ash cloud base height (VBH) to reduce uncertainties in physical CBH retrieval methods. This new methodology takes advantage of polar-orbit active remote-sensing data from the Cloud-Aerosol Lidar with Orthogonal Polarization (CALIOP), from vertical profile information and from geostationary passive remote-sensing measurements from the Spinning Enhanced Visible and Infrared Imager (SEVIRI) and the Advanced Geostationary Radiation Imager (AGRI) aboard the Meteosat Second Generation (MSG) and FengYun-4B (FY-4B) satellites, respectively. The methodology involves a statistics-based algorithm with hybrid use of principal component analysis (PCA) and one of four ML algorithms including the k-nearest neighbour (KNN), extreme gradient boosting (XGBoost), random forest (RF), and gradient boosting decision tree (GBDT) methods. Eruptions of the Eyjafjallajokull volcano (Iceland) during April-May 2010, the Puyehue-Cordon Caulle volcanic complex (Chilean Andes) in June 2011, and the Hunga Tonga-Hunga Ha'apai volcano (Tonga) in January 2022 were selected as typical cases for the construction of the training and validation sample sets. We demonstrate that a combination of PCA and GBDT performs more accurately than other combinations, with a mean absolute error (MAE) of 1.152 km, a root mean square error (RMSE) of 1.529 km, and a Pearson's correlation coefficient (r) of 0.724. Use of PCA as an additional process before training reduces feature relevance between input predictors and improves algorithm accuracy. Although the ML algorithm performs well under relatively simple single-layer volcanic ash cloud conditions, it tends to overestimate VBH in multi-layer conditions, which is an unresolved problem in meteorological CBH retrieval.

DOI:
10.3390/atmos14020228

ISSN:
2073-4433