P65 A new index to assess the impact of collinearity in epidemiological research. (16th November 2010)
- Record Type:
- Journal Article
- Title:
- P65 A new index to assess the impact of collinearity in epidemiological research. (16th November 2010)
- Main Title:
- P65 A new index to assess the impact of collinearity in epidemiological research
- Authors:
- Woolston, A
Tu, Y-K
Baxter, P D
Gilthorpe, M S - Abstract:
- Abstract : Background: The problem of collinearity due to high correlations between explanatory variables in multiple regression is often overlooked in epidemiological research. The assumption that covariates are independent implies that all pair-wise covariate associations should be negligible—an unlikely scenario for biological and epidemiological data. Small but significant departures from the assumption of independence can severely distort the interpretation of a model and the role of each covariate. If the relative impact of collinearity on the estimates is not understood, these effects can potentially obscure the conclusions of the study. Methods: The impact of collinearity must be assessed in relation to the model environment. Factors such as the relation of the response with the predictors, the sample size and the variation of the covariates each have the potential to exacerbate or relieve the symptoms of collinearity. We present a novel approach to assessing the overall uncertainty in the model estimates, which adjusts in relation to these factors. The index will aid the researcher in the decision towards whether a result is of biological relevance or if it is a consequence of the uncertainty generated by collinearity. Results: We consider data from a paper by Lipkin (1988) in the American Journal of Clinical Nutrition . The study examines the role of factors associated with substantial calciuresis. A hypothetical model is proposed involving measures of calcium andAbstract : Background: The problem of collinearity due to high correlations between explanatory variables in multiple regression is often overlooked in epidemiological research. The assumption that covariates are independent implies that all pair-wise covariate associations should be negligible—an unlikely scenario for biological and epidemiological data. Small but significant departures from the assumption of independence can severely distort the interpretation of a model and the role of each covariate. If the relative impact of collinearity on the estimates is not understood, these effects can potentially obscure the conclusions of the study. Methods: The impact of collinearity must be assessed in relation to the model environment. Factors such as the relation of the response with the predictors, the sample size and the variation of the covariates each have the potential to exacerbate or relieve the symptoms of collinearity. We present a novel approach to assessing the overall uncertainty in the model estimates, which adjusts in relation to these factors. The index will aid the researcher in the decision towards whether a result is of biological relevance or if it is a consequence of the uncertainty generated by collinearity. Results: We consider data from a paper by Lipkin (1988) in the American Journal of Clinical Nutrition . The study examines the role of factors associated with substantial calciuresis. A hypothetical model is proposed involving measures of calcium and potassium in the diet—two highly correlated predictors. Both produce positive coefficients when entered individually, but the sign of diet protein becomes negative when entered simultaneously. The variance inflation factor (VIF) of 4.51 suggests that the collinearity is not considerable (Belsley, 1991). However, when the VIF index is adjusted using model R 2, the impact appears more substantial than first thought. We propose an alternative diagnostic that utilises the additional influences as a basis to assess the impact of collinearity on the model estimates. Conclusions: The results of significance testing for collinear variables within multiple regression should not be the only criteria by which we judge whether collinearity is a problem. The role of collinearity must be carefully assessed and understood using an appropriate index. Measuring the impact of collinearity using overly simplistic diagnostics, such as the VIF, may lure a researcher into a false assurance of the results. Similarly, a model consisting of highly collinear predictors may be relatively unaffected when considered in relation to other factors in the model. … (more)
- Is Part Of:
- Journal of epidemiology and community health. Volume 64(2010)Supplement 1
- Journal:
- Journal of epidemiology and community health
- Issue:
- Volume 64(2010)Supplement 1
- Issue Display:
- Volume 64, Issue 1 (2010)
- Year:
- 2010
- Volume:
- 64
- Issue:
- 1
- Issue Sort Value:
- 2010-0064-0001-0000
- Page Start:
- A59
- Page End:
- A59
- Publication Date:
- 2010-11-16
- Subjects:
- Public health -- Periodicals
Epidemiology -- Periodicals
614.4 - Journal URLs:
- http://jech.bmj.com/ ↗
http://www.jstor.org/journals/0143005X.html ↗
http://www.pubmedcentral.nih.gov/tocrender.fcgi?journal=165&action=archive ↗
http://www.bmj.com/archive ↗ - DOI:
- 10.1136/jech.2010.120477.65 ↗
- Languages:
- English
- ISSNs:
- 0143-005X
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 18758.xml