Predicting monthly high-resolution PM2.5 concentrations with random forest model in the North China Plain. (November 2018)
- Record Type:
- Journal Article
- Title:
- Predicting monthly high-resolution PM2.5 concentrations with random forest model in the North China Plain. (November 2018)
- Main Title:
- Predicting monthly high-resolution PM2.5 concentrations with random forest model in the North China Plain
- Authors:
- Huang, Keyong
Xiao, Qingyang
Meng, Xia
Geng, Guannan
Wang, Yujie
Lyapustin, Alexei
Gu, Dongfeng
Liu, Yang - Abstract:
- Abstract: Exposure to fine particulate matter (PM2.5 ) remains a worldwide public health issue. However, epidemiological studies on the chronic health impacts of PM2.5 in the developing countries are hindered by the lack of monitoring data. Despite the recent development of using satellite remote sensing to predict ground-level PM2.5 concentrations in China, methods for generating reliable historical PM2.5 exposure, especially prior to the construction of PM2.5 monitoring network in 2013, are still very rare. In this study, a high-performance machine-learning model was developed directly at monthly level to estimate PM2.5 levels in North China Plain. We developed a random forest model using the latest Multi-angle implementation of atmospheric correction (MAIAC) aerosol optical depth (AOD), meteorological parameters, land cover and ground PM2.5 measurements from 2013 to 2015. A multiple imputation method was applied to fill the missing values of AOD. We used 10-fold cross-validation (CV) to evaluate model performance and a separate time period, January 2016 to December 2016, was used to validate our model's capability of predicting historical PM2.5 concentrations. The overall model CV R 2 and relative prediction error (RPE) were 0.88 and 18.7%, respectively. Validation results beyond the modeling period (2013–2015) shown that this model can accurately predict historical PM2.5 concentrations at the monthly (R 2 = 0.74, RPE = 27.6%), seasonal (R 2 = 0.78, RPE = 21.2%) andAbstract: Exposure to fine particulate matter (PM2.5 ) remains a worldwide public health issue. However, epidemiological studies on the chronic health impacts of PM2.5 in the developing countries are hindered by the lack of monitoring data. Despite the recent development of using satellite remote sensing to predict ground-level PM2.5 concentrations in China, methods for generating reliable historical PM2.5 exposure, especially prior to the construction of PM2.5 monitoring network in 2013, are still very rare. In this study, a high-performance machine-learning model was developed directly at monthly level to estimate PM2.5 levels in North China Plain. We developed a random forest model using the latest Multi-angle implementation of atmospheric correction (MAIAC) aerosol optical depth (AOD), meteorological parameters, land cover and ground PM2.5 measurements from 2013 to 2015. A multiple imputation method was applied to fill the missing values of AOD. We used 10-fold cross-validation (CV) to evaluate model performance and a separate time period, January 2016 to December 2016, was used to validate our model's capability of predicting historical PM2.5 concentrations. The overall model CV R 2 and relative prediction error (RPE) were 0.88 and 18.7%, respectively. Validation results beyond the modeling period (2013–2015) shown that this model can accurately predict historical PM2.5 concentrations at the monthly (R 2 = 0.74, RPE = 27.6%), seasonal (R 2 = 0.78, RPE = 21.2%) and annual (R 2 = 0.76, RPE = 16.9%) level. The annual mean predicted PM2.5 concentration from 2013 to 2016 in our study domain was 67.7 μg/m3 and Southern Hebei, Western Shandong and Northern Henan were the most polluted areas. Using this computationally efficient, monthly and high-resolution model, we can provide reliable historical PM2.5 concentrations for epidemiological studies on PM2.5 health effects in China. Graphical abstract: Image 1 Highlights: MAIAC AOD at 1-km resolution was used to predict PM2.5 levels in North China Plain. A high performance machine learning model was developed directly at monthly level. This model can predict historical PM2.5 with high accuracy at monthly, seasonal and annual level. The multiple imputation method substantially increased PM2.5 coverage to 100%. Abstract : Random forest model developed at monthly level using satellite data can be applied to estimate long-term PM2.5 concentrations in North China Plain. … (more)
- Is Part Of:
- Environmental pollution. Volume 242(2018)Part A
- Journal:
- Environmental pollution
- Issue:
- Volume 242(2018)Part A
- Issue Display:
- Volume 242, Issue 1 (2018)
- Year:
- 2018
- Volume:
- 242
- Issue:
- 1
- Issue Sort Value:
- 2018-0242-0001-0000
- Page Start:
- 675
- Page End:
- 683
- Publication Date:
- 2018-11
- Subjects:
- PM2.5 -- MAIAC AOD -- Machine learning -- North China Plain
Pollution -- Periodicals
Pollution -- Environmental aspects -- Periodicals
Environmental Pollution -- Periodicals
Pollution -- Périodiques
Pollution -- Aspect de l'environnement -- Périodiques
Pollution -- Effets physiologiques -- Périodiques
Pollution
Pollution -- Environmental aspects
Periodicals
Electronic journals
363.73 - Journal URLs:
- http://www.sciencedirect.com/science/journal/02697491 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.envpol.2018.07.016 ↗
- Languages:
- English
- ISSNs:
- 0269-7491
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3791.539000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 19281.xml