A practical framework for predicting residential indoor PM2.5 concentration using land-use regression and machine learning methods. (February 2021)
- Record Type:
- Journal Article
- Title:
- A practical framework for predicting residential indoor PM2.5 concentration using land-use regression and machine learning methods. (February 2021)
- Main Title:
- A practical framework for predicting residential indoor PM2.5 concentration using land-use regression and machine learning methods
- Authors:
- Li, Zhiyuan
Tong, Xinning
Ho, Jason Man Wai
Kwok, Timothy C.Y.
Dong, Guanghui
Ho, Kin-Fai
Yim, Steve Hung Lam - Abstract:
- Abstract: People typically spend most of their time indoors. It is of importance to establish prediction models to estimate PM2.5 concentration in indoor environments (e.g., residential households) to allow accurate assessments of exposure in epidemiological studies. This study aimed to develop models to predict PM2.5 concentration in residential households. PM2.5 concentration and related parameters (e.g., basic information about the households and ventilation settings) were collected in 116 households during the winter and summer seasons in Hong Kong. Outdoor PM2.5 concentration at households was estimated using a land-use regression model. The random forest machine learning algorithm was then applied to develop indoor PM2.5 prediction models. The results show that the random forest model achieved a promising predictive accuracy, with R 2 and cross-validation R 2 values of 0.93 and 0.65, respectively. Outdoor PM2.5 concentration was the most important predictor variable, followed in descending order by the household marked number, outdoor temperature, outdoor relative humidity, average household area and air conditioning. The external validation result using an independent dataset confirmed the potential application of the random forest model, with an R 2 value of 0.47. Overall, this study shows the value of a combined land-use regression and machine learning approach in establishing indoor PM2.5 prediction models that provide a relatively accurate assessment of exposureAbstract: People typically spend most of their time indoors. It is of importance to establish prediction models to estimate PM2.5 concentration in indoor environments (e.g., residential households) to allow accurate assessments of exposure in epidemiological studies. This study aimed to develop models to predict PM2.5 concentration in residential households. PM2.5 concentration and related parameters (e.g., basic information about the households and ventilation settings) were collected in 116 households during the winter and summer seasons in Hong Kong. Outdoor PM2.5 concentration at households was estimated using a land-use regression model. The random forest machine learning algorithm was then applied to develop indoor PM2.5 prediction models. The results show that the random forest model achieved a promising predictive accuracy, with R 2 and cross-validation R 2 values of 0.93 and 0.65, respectively. Outdoor PM2.5 concentration was the most important predictor variable, followed in descending order by the household marked number, outdoor temperature, outdoor relative humidity, average household area and air conditioning. The external validation result using an independent dataset confirmed the potential application of the random forest model, with an R 2 value of 0.47. Overall, this study shows the value of a combined land-use regression and machine learning approach in establishing indoor PM2.5 prediction models that provide a relatively accurate assessment of exposure for use in epidemiological studies. Graphical abstract: Image 1 Highlights: A practical indoor PM2.5 prediction framework was proposed. The combined land-use regression and machine learning approach was applied. External validation was proposed to evaluate the prediction model. The random forest model outperformed the linear mixed-effect regression model. The outdoor PM2.5 concentration is the most important predictor variable. … (more)
- Is Part Of:
- Chemosphere. Volume 265(2021)
- Journal:
- Chemosphere
- Issue:
- Volume 265(2021)
- Issue Display:
- Volume 265, Issue 2021 (2021)
- Year:
- 2021
- Volume:
- 265
- Issue:
- 2021
- Issue Sort Value:
- 2021-0265-2021-0000
- Page Start:
- Page End:
- Publication Date:
- 2021-02
- Subjects:
- Indoor air -- PM2.5 -- Households -- Prediction model -- Random forest
Pollution -- Periodicals
Pollution -- Physiological effect -- Periodicals
Environmental sciences -- Periodicals
Atmospheric chemistry -- Periodicals
551.511 - Journal URLs:
- http://www.sciencedirect.com/science/journal/00456535/ ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.chemosphere.2020.129140 ↗
- Languages:
- English
- ISSNs:
- 0045-6535
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3172.280000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 15371.xml