A generalised approach for identifying influential data in hydrological modelling. (January 2019)
- Record Type:
- Journal Article
- Title:
- A generalised approach for identifying influential data in hydrological modelling. (January 2019)
- Main Title:
- A generalised approach for identifying influential data in hydrological modelling
- Authors:
- Wright, David P.
Thyer, Mark
Westra, Seth
Renard, Benjamin
McInerney, David - Abstract:
- Abstract: Influence diagnostics are used to identify data points that have a disproportionate impact on model parameters, performance and/or predictions, providing valuable information for use in model calibration. Regression-theory influence diagnostics identify influential data by combining the leverage and the standardised residuals, and are computationally more efficient than case-deletion approaches. This study evaluates the performance of a range of regression-theory influence diagnostics on ten case studies with a variety of model structures and inference scenarios including: nonlinear model response, heteroscedastic residual errors, data uncertainty and Bayesian priors. A new technique is developed, generalised Cook's distance, that is able to accurately identify the same influential data as standard case deletion approaches (Spearman rank correlation: 0.93–1.00) at a fraction of the computational cost (<1%). This is because generalised Cook's distance uses a generalised leverage formulation which outperforms linear and nonlinear leverage formulations due to less restrictive assumptions. Generalised Cook's distance has the potential to enable influential data to be efficiently identified on a wide variety of hydrological and environmental modelling problems. Highlights: Influential data points have a disproportionate impact on model predictions. A new generalised Cook's distance accurately identifies influential data points. More efficient (<1% computational cost)Abstract: Influence diagnostics are used to identify data points that have a disproportionate impact on model parameters, performance and/or predictions, providing valuable information for use in model calibration. Regression-theory influence diagnostics identify influential data by combining the leverage and the standardised residuals, and are computationally more efficient than case-deletion approaches. This study evaluates the performance of a range of regression-theory influence diagnostics on ten case studies with a variety of model structures and inference scenarios including: nonlinear model response, heteroscedastic residual errors, data uncertainty and Bayesian priors. A new technique is developed, generalised Cook's distance, that is able to accurately identify the same influential data as standard case deletion approaches (Spearman rank correlation: 0.93–1.00) at a fraction of the computational cost (<1%). This is because generalised Cook's distance uses a generalised leverage formulation which outperforms linear and nonlinear leverage formulations due to less restrictive assumptions. Generalised Cook's distance has the potential to enable influential data to be efficiently identified on a wide variety of hydrological and environmental modelling problems. Highlights: Influential data points have a disproportionate impact on model predictions. A new generalised Cook's distance accurately identifies influential data points. More efficient (<1% computational cost) than standard case-deletion approaches. Applies to nonlinear regression and hydrological models with heteroscedastic errors. Can be used in a Bayesian framework with priors or data uncertainty. … (more)
- Is Part Of:
- Environmental modelling & software. Volume 111(2019)
- Journal:
- Environmental modelling & software
- Issue:
- Volume 111(2019)
- Issue Display:
- Volume 111, Issue 2019 (2019)
- Year:
- 2019
- Volume:
- 111
- Issue:
- 2019
- Issue Sort Value:
- 2019-0111-2019-0000
- Page Start:
- 231
- Page End:
- 247
- Publication Date:
- 2019-01
- Subjects:
- Hydrologic model calibration -- Influence diagnostics -- Cook's distance -- Generalised leverage
Environmental monitoring -- Computer programs -- Periodicals
Ecology -- Computer simulation -- Periodicals
Digital computer simulation -- Periodicals
Computer software -- Periodicals
Environmental Monitoring -- Periodicals
Computer Simulation -- Periodicals
Environnement -- Surveillance -- Logiciels -- Périodiques
Écologie -- Simulation, Méthodes de -- Périodiques
Simulation par ordinateur -- Périodiques
Logiciels -- Périodiques
Computer software
Digital computer simulation
Ecology -- Computer simulation
Environmental monitoring -- Computer programs
Periodicals
Electronic journals
363.70015118 - Journal URLs:
- http://www.sciencedirect.com/science/journal/13648152 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.envsoft.2018.03.004 ↗
- Languages:
- English
- ISSNs:
- 1364-8152
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3791.522800
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 21602.xml