Estimation of surface-level NO2 and O3 concentrations using TROPOMI data and machine learning over East Asia
2021
Kang, Yoojin | Choi, Hyunyoung | Im, Jungho | Park, Seohui | Shin, Minso | Song, Chang-Keun | Kim, Sangmin
In East Asia, air quality has been recognized as an important public health problem. In particular, the surface concentrations of air pollutants are closely related to human life. This study aims to develop models for estimating high spatial resolution surface concentrations of NO₂ and O₃ from TROPOspheric Monitoring Instrument (TROPOMI) data in East Asia. The machine learning was adopted by fusion of various satellite-based variables, numerical model-based meteorological variables, and land-use variables. Four machine learning approaches—Support Vector Regression (SVR), Random Forest (RF), Extreme Gradient Boost (XGB), and Light Gradient Boosting Machine (LGBM)—were evaluated and compared with Multiple Linear Regression (MLR) as a base statistical method. This study also modeled the NO₂ and O₃ concentrations over the ocean surface (i.e., land model for scheme 1 and ocean model for scheme 2). The estimated surface concentrations were validated through three cross-validation approaches (i.e., random, temporal, and spatial). The results showed that the NO₂ model produced R² of 0.63–0.70 and normalized root-mean-square-error (nRMSE) of 38.3–42.2% and the O₃ model resulted in R² of 0.65–0.78 and nRMSE of 19.6–24.7% for scheme 1. The indirect validation based on the stations near the coastline for scheme 2 showed slight decrease (~0.3–2.4%) in nRMSE when compared to scheme 1. The contributions of input variables to the models were analyzed based on SHapely Additive exPlanations (SHAP) values. The NO₂ vertical column density among the TROPOMI-derived variables showed the largest contribution in both the NO₂ and O₃ models.
Show more [+] Less [-]AGROVOC Keywords
Bibliographic information
This bibliographic record has been provided by National Agricultural Library