A machine learning-based approach for fusing measurements from standard sites, low-cost sensors, and satellite retrievals: Application to NO2 pollution hotspot identification. (1st June 2023)
- Record Type:
- Journal Article
- Title:
- A machine learning-based approach for fusing measurements from standard sites, low-cost sensors, and satellite retrievals: Application to NO2 pollution hotspot identification. (1st June 2023)
- Main Title:
- A machine learning-based approach for fusing measurements from standard sites, low-cost sensors, and satellite retrievals: Application to NO2 pollution hotspot identification
- Authors:
- Fu, Jianbo
Tang, Die
Grieneisen, Michael L.
Yang, Fumo
Yang, Jianzhao
Wu, Guanghui
Wang, Chunying
Zhan, Yu - Abstract:
- Abstract: While low-cost sensors (LCSs) and satellite retrievals are valuable supplements to regulatory air quality monitoring stations (AQMs), measurements from LCSs and satellite retrievals suffer from considerable bias and uncertainty. Here, we proposed a machine learning-based approach named the Fusion-Imputation-Gradient-Boosting-Machine (FI-GBM) model which fused the NO2 measurements from AQM, LCS, and the TROPOspheric Monitoring Instrument (TROPOMI) for mapping hourly ground-level NO2 at 1 km resolution. Based on the machine-learned relationships among AQM, LCS, TROPOMI measurements, and environmental covariates, the LCS and TROPOMI data were assimilated into AQM data. We selected Tangshan, an industrial city in North China, for the demonstration. The FI-GBM model showed high predictive performance in the sample-based cross-validation ( R 2 = 0.89). The R 2 values of the cell-, area-, and month-based cross-validations were 0.67, 0.59, and 0.64, respectively. Fusing LCS and TROPOMI data improved the predictive performance compared to the benchmark models using neither or only one of them. The FI-GBM model showed decent utilization of the strengths of TROPOMI and LCS in regional and local-scale monitoring, respectively. It is noteworthy that the FI-GBM model could automatically filter noisy samples from LCS data, which was critical for discriminating between true and false-positive pollution hotspots. This study provides a data-noise-reduction approach for fusingAbstract: While low-cost sensors (LCSs) and satellite retrievals are valuable supplements to regulatory air quality monitoring stations (AQMs), measurements from LCSs and satellite retrievals suffer from considerable bias and uncertainty. Here, we proposed a machine learning-based approach named the Fusion-Imputation-Gradient-Boosting-Machine (FI-GBM) model which fused the NO2 measurements from AQM, LCS, and the TROPOspheric Monitoring Instrument (TROPOMI) for mapping hourly ground-level NO2 at 1 km resolution. Based on the machine-learned relationships among AQM, LCS, TROPOMI measurements, and environmental covariates, the LCS and TROPOMI data were assimilated into AQM data. We selected Tangshan, an industrial city in North China, for the demonstration. The FI-GBM model showed high predictive performance in the sample-based cross-validation ( R 2 = 0.89). The R 2 values of the cell-, area-, and month-based cross-validations were 0.67, 0.59, and 0.64, respectively. Fusing LCS and TROPOMI data improved the predictive performance compared to the benchmark models using neither or only one of them. The FI-GBM model showed decent utilization of the strengths of TROPOMI and LCS in regional and local-scale monitoring, respectively. It is noteworthy that the FI-GBM model could automatically filter noisy samples from LCS data, which was critical for discriminating between true and false-positive pollution hotspots. This study provides a data-noise-reduction approach for fusing multisource measurements in order to identify pollution hotspots and trace pollutant sources, thereby promoting cleaner production. Graphical abstract: Image 1 Highlights: NO2 data from standard sites, satellite retrievals, and low-cost sensors were fused. Hourly NO2 was mapped at 1-km resolution for an industrial city in North China. Satellite retrievals mitigated overall estimation bias. The new algorithm can automatically screen noise in low-cost sensor data. Low-cost sensor data helped identify pollution hotspots in industrial areas. … (more)
- Is Part Of:
- Atmospheric environment. Volume 302(2023)
- Journal:
- Atmospheric environment
- Issue:
- Volume 302(2023)
- Issue Display:
- Volume 302, Issue 2023 (2023)
- Year:
- 2023
- Volume:
- 302
- Issue:
- 2023
- Issue Sort Value:
- 2023-0302-2023-0000
- Page Start:
- Page End:
- Publication Date:
- 2023-06-01
- Subjects:
- Data fusion -- Nitrogen dioxide -- Low-cost sensor -- Dense monitoring network -- TROPOMI -- Machine learning
Air -- Pollution -- Periodicals
Air -- Pollution -- Meteorological aspects -- Periodicals
551.51 - Journal URLs:
- http://www.sciencedirect.com/web-editions/journal/13522310 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.atmosenv.2023.119756 ↗
- Languages:
- English
- ISSNs:
- 1352-2310
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 1767.120000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 26845.xml