A strategy for modelling heavy-tailed greenhouse gases (GHG) data using the generalised extreme value distribution: Are we overestimating GHG flux using the sample mean?. (15th September 2020)
- Record Type:
- Journal Article
- Title:
- A strategy for modelling heavy-tailed greenhouse gases (GHG) data using the generalised extreme value distribution: Are we overestimating GHG flux using the sample mean?. (15th September 2020)
- Main Title:
- A strategy for modelling heavy-tailed greenhouse gases (GHG) data using the generalised extreme value distribution: Are we overestimating GHG flux using the sample mean?
- Authors:
- Dhanoa, M.S.
Louro, A.
Cardenas, L.M.
Shepherd, A.
Sanderson, R.
López, S.
France, J. - Abstract:
- Abstract: In this study, we draw up a strategy for analysis of greenhouse gas (GHG) field data. The distribution of GHG flux data generally exhibits excessive skewness and kurtosis. This results in a heavy tailed distribution that is much longer than the tail of a log-normal distribution or outlier induced skewness. The generalised extreme value (GEV) distribution is well-suited to model such data. We evaluated GEV as a model for the analysis and a means of extraction of a robust average of carbon dioxide (CO2 ) and nitrous oxide (N2 O) flux data measured in an agricultural field. The option of transforming CO2 flux data to the Box-Cox scale in order to make the distribution normal was also investigated. The results showed that average CO2 estimates from GEV are less affected by data in the long tail compared to the sample mean. The data for N2 O flux were much more complex than CO2 flux data due to the presence of negative fluxes. The estimate of the average value from GEV was much more consistent with maximum data frequency position. The analysis of GEV, which considers the effects of hot-spot-like observations, suggests that sample means and log-means may overestimate GHG fluxes from agricultural fields. In this study, the arithmetic CO2 sample mean of 65.6 (mean log-scale 65.9) kg CO2 –C ha −1 d −1 was reduced to GEV mean of 60.1 kg CO2 –C ha −1 d −1 . The arithmetic N2 O sample mean of 1.038 (mean log-scale 1.038) kg N2 O–N ha −1 d −1 was substantially reduced to GEVAbstract: In this study, we draw up a strategy for analysis of greenhouse gas (GHG) field data. The distribution of GHG flux data generally exhibits excessive skewness and kurtosis. This results in a heavy tailed distribution that is much longer than the tail of a log-normal distribution or outlier induced skewness. The generalised extreme value (GEV) distribution is well-suited to model such data. We evaluated GEV as a model for the analysis and a means of extraction of a robust average of carbon dioxide (CO2 ) and nitrous oxide (N2 O) flux data measured in an agricultural field. The option of transforming CO2 flux data to the Box-Cox scale in order to make the distribution normal was also investigated. The results showed that average CO2 estimates from GEV are less affected by data in the long tail compared to the sample mean. The data for N2 O flux were much more complex than CO2 flux data due to the presence of negative fluxes. The estimate of the average value from GEV was much more consistent with maximum data frequency position. The analysis of GEV, which considers the effects of hot-spot-like observations, suggests that sample means and log-means may overestimate GHG fluxes from agricultural fields. In this study, the arithmetic CO2 sample mean of 65.6 (mean log-scale 65.9) kg CO2 –C ha −1 d −1 was reduced to GEV mean of 60.1 kg CO2 –C ha −1 d −1 . The arithmetic N2 O sample mean of 1.038 (mean log-scale 1.038) kg N2 O–N ha −1 d −1 was substantially reduced to GEV mean of 0.0157 kg N2 O–N ha −1 d −1 . Our analysis suggests that GHG data should be analysed assuming a GEV distribution of the data, including a Box-Cox transformation when negative data are observed, rather than only calculating basic log and log-normal summaries. Results of GHG studies may end up in national inventories. Thus, it is necessary and important to follow all procedures that contribute to minimise any bias in the data. Highlights: Using sample means may be overestimating GHG fluxes. GEV solves excessive skewness and kurtosis of greenhouse gas flux data. Strategy of options for analysing GHG data rather than black-box approach. CO2 estimates from GEV less affected by data in the long tail than sample mean. CO2 estimates from Box-Cox are more affected by long-tail data than from GEV. … (more)
- Is Part Of:
- Atmospheric environment. Volume 237(2020)
- Journal:
- Atmospheric environment
- Issue:
- Volume 237(2020)
- Issue Display:
- Volume 237, Issue 2020 (2020)
- Year:
- 2020
- Volume:
- 237
- Issue:
- 2020
- Issue Sort Value:
- 2020-0237-2020-0000
- Page Start:
- Page End:
- Publication Date:
- 2020-09-15
- Subjects:
- Nitrous oxide -- Carbon dioxide -- Generalised extreme value -- Finney correction -- Heavy-tailed data -- Skewness correction
Air -- Pollution -- Periodicals
Air -- Pollution -- Meteorological aspects -- Periodicals
551.51 - Journal URLs:
- http://www.sciencedirect.com/web-editions/journal/13522310 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.atmosenv.2020.117500 ↗
- Languages:
- English
- ISSNs:
- 1352-2310
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 1767.120000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 18806.xml