Random forest and variable importance rankings for correlated survival data, with applications to tooth loss. (December 2014)
- Record Type:
- Journal Article
- Title:
- Random forest and variable importance rankings for correlated survival data, with applications to tooth loss. (December 2014)
- Main Title:
- Random forest and variable importance rankings for correlated survival data, with applications to tooth loss
- Authors:
- Hallett, M.J.
Fan, J.J.
Su, X.G.
Levine, R.A.
Nunn, M.E. - Other Names:
- Lesaffre Emmanuel guest-editor.
Declerck Dominique guest-editor. - Abstract:
- Oral health is a significant issue for adults because of its relationship to quality of life, as well as systematic health and well being. Impaired oral health can lead to significant health problems, such as pain and infection. This article considers a tree-based method to assess tooth loss. In particular, a variable importance measure based on extremely randomized trees (Geurts et al ., 2006 ) is proposed for correlated survival data, and is applied to the VA Dental Longitudinal Study. This new variable importance method aims to remove the bias of the traditional random forest variable selection, which may favour input variables with more categories, as shown by Strobl et al . (2007 ). The multivariate exponential tree algorithm of Fan et al . (2009 ) is used to build trees, as it has superior prediction accuracy and computational efficiency compared to marginal and semiparametric frailty model-based trees (Nunn et al., 2011 ). Simulation studies for assessing various variable importance methods are presented. To limit the final number of meaningful prognostic groups, an amalgamation procedure is used to develop tooth prognostic groups from a forest of trees. The resulting prognosis rules and variable importance rankings may be used in clinical practice to increase tooth retention and establish rational treatment plans. By ranking the relative importance of various clinical and genetic factors for tooth loss, we are able to provide clinicians with critical information soOral health is a significant issue for adults because of its relationship to quality of life, as well as systematic health and well being. Impaired oral health can lead to significant health problems, such as pain and infection. This article considers a tree-based method to assess tooth loss. In particular, a variable importance measure based on extremely randomized trees (Geurts et al ., 2006 ) is proposed for correlated survival data, and is applied to the VA Dental Longitudinal Study. This new variable importance method aims to remove the bias of the traditional random forest variable selection, which may favour input variables with more categories, as shown by Strobl et al . (2007 ). The multivariate exponential tree algorithm of Fan et al . (2009 ) is used to build trees, as it has superior prediction accuracy and computational efficiency compared to marginal and semiparametric frailty model-based trees (Nunn et al., 2011 ). Simulation studies for assessing various variable importance methods are presented. To limit the final number of meaningful prognostic groups, an amalgamation procedure is used to develop tooth prognostic groups from a forest of trees. The resulting prognosis rules and variable importance rankings may be used in clinical practice to increase tooth retention and establish rational treatment plans. By ranking the relative importance of various clinical and genetic factors for tooth loss, we are able to provide clinicians with critical information so that they can develop and implement an effective treatment plan. … (more)
- Is Part Of:
- Statistical modelling. Volume 14:Number 6(2014)
- Journal:
- Statistical modelling
- Issue:
- Volume 14:Number 6(2014)
- Issue Display:
- Volume 14, Issue 6 (2014)
- Year:
- 2014
- Volume:
- 14
- Issue:
- 6
- Issue Sort Value:
- 2014-0014-0006-0000
- Page Start:
- 523
- Page End:
- 547
- Publication Date:
- 2014-12
- Subjects:
- Random forest -- variable importance -- correlated survival data -- prognostic rules -- dental applications -- VA Dental Longitudinal Study
Linear models (Statistics) -- Periodicals
Mathematical models -- Periodicals
Modèles linéaires (Statistique) -- Périodiques
Modèles mathématiques -- Périodiques
Modèle statistique
Modèle linéaire
Modélisation statistique
Périodique électronique (Descripteur de forme)
Ressource Internet (Descripteur de forme)
519.5011 - Journal URLs:
- http://www.uk.sagepub.com/home.nav ↗
http://firstsearch.oclc.org ↗
http://firstsearch.oclc.org/journal=1471-082x;screen=info;ECOIP ↗ - DOI:
- 10.1177/1471082X14535517 ↗
- Languages:
- English
- ISSNs:
- 1471-082X
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 6164.xml