Site-scale modeling of surface ozone in Northern Bavaria using machine learning algorithms, regional dynamic models, and a hybrid model. (1st January 2021)
- Record Type:
- Journal Article
- Title:
- Site-scale modeling of surface ozone in Northern Bavaria using machine learning algorithms, regional dynamic models, and a hybrid model. (1st January 2021)
- Main Title:
- Site-scale modeling of surface ozone in Northern Bavaria using machine learning algorithms, regional dynamic models, and a hybrid model
- Authors:
- Nabavi, Seyed Omid
Nölscher, Anke C.
Samimi, Cyrus
Thomas, Christoph
Haimberger, Leopold
Lüers, Johannes
Held, Andreas - Abstract:
- Abstract: Ozone (O3 ) is a harmful pollutant when present in the lowermost layer of the atmosphere. Therefore, the European Commission formulated directives to regulate O3 concentrations in near-surface air. However, almost 50% of the 5068 air quality stations in Europe do not monitor O3 concentrations. This study aims to provide a hybrid modeling system that fills these gaps in the hourly surface O3 observations on a site scale with much higher accuracy than existing O3 models. This hybrid model was developed using estimations from multiple linear regression-based eXtreme Gradient Boosting Machines (MLR-XGBM) and O3 reanalysis from European regional air quality models (CAMS-EU). The binary classification of extremely high O3 events and the 1- and 24-h forecasts of hourly O3 were investigated as secondary aims. In this study thirteen stations in Northern Bavaria, out of which six do not monitor O3, were chosen as test sites. Considering the computational complexity of machine learning algorithms (MLAs), we also applied two recent MLA interpretation methods, namely SHapley Additive exPlanations (SHAP) and Local interpretable model-agnostic explanations (LIME). With SHAP, we showed an increasing effect of temperature on O3 concentrations which intensifies for temperatures exceeding 17 °C. According to LIME, O3 concentration peaks are mainly governed by meteorological factors under dry and warm conditions on a regional scale, whereas local nitrogen oxide concentrations controlAbstract: Ozone (O3 ) is a harmful pollutant when present in the lowermost layer of the atmosphere. Therefore, the European Commission formulated directives to regulate O3 concentrations in near-surface air. However, almost 50% of the 5068 air quality stations in Europe do not monitor O3 concentrations. This study aims to provide a hybrid modeling system that fills these gaps in the hourly surface O3 observations on a site scale with much higher accuracy than existing O3 models. This hybrid model was developed using estimations from multiple linear regression-based eXtreme Gradient Boosting Machines (MLR-XGBM) and O3 reanalysis from European regional air quality models (CAMS-EU). The binary classification of extremely high O3 events and the 1- and 24-h forecasts of hourly O3 were investigated as secondary aims. In this study thirteen stations in Northern Bavaria, out of which six do not monitor O3, were chosen as test sites. Considering the computational complexity of machine learning algorithms (MLAs), we also applied two recent MLA interpretation methods, namely SHapley Additive exPlanations (SHAP) and Local interpretable model-agnostic explanations (LIME). With SHAP, we showed an increasing effect of temperature on O3 concentrations which intensifies for temperatures exceeding 17 °C. According to LIME, O3 concentration peaks are mainly governed by meteorological factors under dry and warm conditions on a regional scale, whereas local nitrogen oxide concentrations control base O3 concentrations during cold and wet periods. While recently developed MLAs for the spatial estimation of hourly O3 concentrations had a station-based root-mean-square error (RMSE) above 27 μg/m 3, our proposed model significantly reduced the estimation errors by about 66% with an RMSE of 9.49 μg/m 3 . We also found that logistic regression (LR) and MLR-XGBM performed best in the site-scale classification and 24-h forecast of O3 concentrations (with a station-averaged accuracy and RMSE of 0.95 and 19.34 μg/m 3, respectively). Graphical abstract: Image 1 Highlights: Hourly ozone can be reconstructed using observations in the surrounding stations. Compared to existing models, our model yielded a threefold reduction of errors. High ozone levels (>70 μg/m 3 ) are mainly governed by meteorological factors. Ozone formation intensifies when temperature values are above 17 °C. … (more)
- Is Part Of:
- Environmental pollution. Volume 268(2021)Part A
- Journal:
- Environmental pollution
- Issue:
- Volume 268(2021)Part A
- Issue Display:
- Volume 268, Issue 2021 (2021)
- Year:
- 2021
- Volume:
- 268
- Issue:
- 2021
- Issue Sort Value:
- 2021-0268-2021-0000
- Page Start:
- Page End:
- Publication Date:
- 2021-01-01
- Subjects:
- Downscaling -- Surface ozone -- Ensemble learning -- Simulation interpretability
Pollution -- Periodicals
Pollution -- Environmental aspects -- Periodicals
Environmental Pollution -- Periodicals
Pollution -- Périodiques
Pollution -- Aspect de l'environnement -- Périodiques
Pollution -- Effets physiologiques -- Périodiques
Pollution
Pollution -- Environmental aspects
Periodicals
Electronic journals
363.73 - Journal URLs:
- http://www.sciencedirect.com/science/journal/02697491 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.envpol.2020.115736 ↗
- Languages:
- English
- ISSNs:
- 0269-7491
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3791.539000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 15190.xml