Estimating hourly average indoor PM2.5 using the random forest approach in two megacities, China. (August 2020)
- Record Type:
- Journal Article
- Title:
- Estimating hourly average indoor PM2.5 using the random forest approach in two megacities, China. (August 2020)
- Main Title:
- Estimating hourly average indoor PM2.5 using the random forest approach in two megacities, China
- Authors:
- Xu, Chunyu
Xu, Dongqun
Liu, Zhe
Li, Yunpu
Li, Ning
Chartier, Ryan
Chang, Junrui
Wang, Qin
Wu, Yaxi
Li, Na - Abstract:
- Abstract: This study developed a predictive model for hourly indoor fine particulate matter (PM2.5 ) concentration based on the random forest regression (RFR) method and compared its performance with the traditional multiple linear regression (MLR) method. The concentrations of indoor and outdoor PM2.5 were monitored at a total of 66 apartments in Nanjing (NJ) and Beijing (BJ), China, during both the heating and non-heating seasons. In total, 14, 442 pairs of hourly indoor and outdoor PM2.5 were measured by light-scattering nephelometer, while potential influencing factors were obtained via questionnaires. Hourly indoor PM2.5 prediction were developed based on either the RFR or MLR method. A ten-fold cross-validation (10-fold CV) analysis was used to evaluate the predictive power of the models. The 10-fold CV results revealed the MLR models agree fairly well with the measured data, with coefficients of determination ( R 2 ) ranging from 0.70 (BJ) to 0.73 (NJ), while the root mean square error (RMSE) ranged from 28.0 μg/m 3 (NJ) to 28.2 μg/m 3 (BJ). Overall, the RFR models outperformed the reference MLR method as indicated by higher CV R 2 (0.82 in BJ and 0.78 in NJ, respectively) and lower CV RMSE (20.4 μg/m 3 in BJ and 24.3 μg/m 3 in NJ, respectively). Our results show that the RFR approach can exceed the predictive power of the classic MLR method and is a promising methodology for estimating indoor PM2.5 concentrations in Chinese megacities when direct PM2.5 measurementsAbstract: This study developed a predictive model for hourly indoor fine particulate matter (PM2.5 ) concentration based on the random forest regression (RFR) method and compared its performance with the traditional multiple linear regression (MLR) method. The concentrations of indoor and outdoor PM2.5 were monitored at a total of 66 apartments in Nanjing (NJ) and Beijing (BJ), China, during both the heating and non-heating seasons. In total, 14, 442 pairs of hourly indoor and outdoor PM2.5 were measured by light-scattering nephelometer, while potential influencing factors were obtained via questionnaires. Hourly indoor PM2.5 prediction were developed based on either the RFR or MLR method. A ten-fold cross-validation (10-fold CV) analysis was used to evaluate the predictive power of the models. The 10-fold CV results revealed the MLR models agree fairly well with the measured data, with coefficients of determination ( R 2 ) ranging from 0.70 (BJ) to 0.73 (NJ), while the root mean square error (RMSE) ranged from 28.0 μg/m 3 (NJ) to 28.2 μg/m 3 (BJ). Overall, the RFR models outperformed the reference MLR method as indicated by higher CV R 2 (0.82 in BJ and 0.78 in NJ, respectively) and lower CV RMSE (20.4 μg/m 3 in BJ and 24.3 μg/m 3 in NJ, respectively). Our results show that the RFR approach can exceed the predictive power of the classic MLR method and is a promising methodology for estimating indoor PM2.5 concentrations in Chinese megacities when direct PM2.5 measurements are not possible. Graphical abstract: Image 1 Highlights: High intraday variations of hourly indoor PM2.5 were detected. Random forest regression (RFR) was applied to modeling the hourly indoor PM2.5 . RFR performed better than the traditional multiple linear regression (MLR) model. The outdoor PM2.5 levels were the most important predictor of indoor PM2.5 . … (more)
- Is Part Of:
- Building and environment. Volume 180(2020)
- Journal:
- Building and environment
- Issue:
- Volume 180(2020)
- Issue Display:
- Volume 180, Issue 2020 (2020)
- Year:
- 2020
- Volume:
- 180
- Issue:
- 2020
- Issue Sort Value:
- 2020-0180-2020-0000
- Page Start:
- Page End:
- Publication Date:
- 2020-08
- Subjects:
- Indoor air -- Fine particulate matter -- Prediction model -- Random forest regression -- Machine learning
PM2.5 fine particulate matter -- NHS non-heating season -- HS heating season -- AC air conditioning -- RFR random forest regression -- MLR multiple linear regression
Buildings -- Environmental engineering -- Periodicals
Building -- Research -- Periodicals
Constructions -- Technique de l'environnement -- Périodiques
Electronic journals
696 - Journal URLs:
- http://www.sciencedirect.com/science/journal/03601323 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.buildenv.2020.107025 ↗
- Languages:
- English
- ISSNs:
- 0360-1323
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 2359.355000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 13926.xml