Machine learning models inaccurately predict current and future high-latitude C balances. (1st January 2023)
- Record Type:
- Journal Article
- Title:
- Machine learning models inaccurately predict current and future high-latitude C balances. (1st January 2023)
- Main Title:
- Machine learning models inaccurately predict current and future high-latitude C balances
- Authors:
- Shirley, Ian A
Mekonnen, Zelalem A
Grant, Robert F
Dafflon, Baptiste
Riley, William J - Abstract:
- Abstract: The high-latitude carbon (C) cycle is a key feedback to the global climate system, yet because of system complexity and data limitations, there is currently disagreement over whether the region is a source or sink of C. Recent advances in big data analytics and computing power have popularized the use of machine learning (ML) algorithms to upscale site measurements of ecosystem processes, and in some cases forecast the response of these processes to climate change. Due to data limitations, however, ML model predictions of these processes are almost never validated with independent datasets. To better understand and characterize the limitations of these methods, we develop an approach to independently evaluate ML upscaling and forecasting. We mimic data-driven upscaling and forecasting efforts by applying ML algorithms to different subsets of regional process-model simulation gridcells, and then test ML performance using the remaining gridcells. In this study, we simulate C fluxes and environmental data across Alaska using ecosys, a process-rich terrestrial ecosystem model, and then apply boosted regression tree ML algorithms to training data configurations that mirror and expand upon existing AmeriFLUX eddy-covariance data availability. We first show that a ML model trained using ecosys outputs from currently-available Alaska AmeriFLUX sites incorrectly predicts that Alaska is presently a modeled net C source. Increased spatial coverage of the training datasetAbstract: The high-latitude carbon (C) cycle is a key feedback to the global climate system, yet because of system complexity and data limitations, there is currently disagreement over whether the region is a source or sink of C. Recent advances in big data analytics and computing power have popularized the use of machine learning (ML) algorithms to upscale site measurements of ecosystem processes, and in some cases forecast the response of these processes to climate change. Due to data limitations, however, ML model predictions of these processes are almost never validated with independent datasets. To better understand and characterize the limitations of these methods, we develop an approach to independently evaluate ML upscaling and forecasting. We mimic data-driven upscaling and forecasting efforts by applying ML algorithms to different subsets of regional process-model simulation gridcells, and then test ML performance using the remaining gridcells. In this study, we simulate C fluxes and environmental data across Alaska using ecosys, a process-rich terrestrial ecosystem model, and then apply boosted regression tree ML algorithms to training data configurations that mirror and expand upon existing AmeriFLUX eddy-covariance data availability. We first show that a ML model trained using ecosys outputs from currently-available Alaska AmeriFLUX sites incorrectly predicts that Alaska is presently a modeled net C source. Increased spatial coverage of the training dataset improves ML predictions, halving the bias when 240 modeled sites are used instead of 15. However, even this more accurate ML model incorrectly predicts Alaska C fluxes under 21st century climate change because of changes in atmospheric CO2, litter inputs, and vegetation composition that have impacts on C fluxes which cannot be inferred from the training data. Our results provide key insights to future C flux upscaling efforts and expose the potential for inaccurate ML upscaling and forecasting of high-latitude C cycle dynamics. … (more)
- Is Part Of:
- Environmental research letters. Volume 18:Number 1(2023)
- Journal:
- Environmental research letters
- Issue:
- Volume 18:Number 1(2023)
- Issue Display:
- Volume 18, Issue 1 (2023)
- Year:
- 2023
- Volume:
- 18
- Issue:
- 1
- Issue Sort Value:
- 2023-0018-0001-0000
- Page Start:
- Page End:
- Publication Date:
- 2023-01-01
- Subjects:
- machine learning -- upscaling -- forecasting -- independent evaluation -- carbon cycle -- high-latitudes -- climate change
Environmental sciences -- Periodicals
Human ecology -- Research -- Periodicals
Environmental health -- Periodicals
333.7 - Journal URLs:
- http://iopscience.iop.org/1748-9326 ↗
http://www.iop.org/EJ/toc/1748-9326 ↗
http://ioppublishing.org/ ↗ - DOI:
- 10.1088/1748-9326/acacb2 ↗
- Languages:
- English
- ISSNs:
- 1748-9326
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3791.592955
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 25639.xml