A weighted random forests approach to improve predictive performance. (8th July 2013)
- Record Type:
- Journal Article
- Title:
- A weighted random forests approach to improve predictive performance. (8th July 2013)
- Main Title:
- A weighted random forests approach to improve predictive performance
- Authors:
- Winham, Stacey J.
Freimuth, Robert R.
Biernacka, Joanna M. - Abstract:
- <abstract abstract-type="main" xml:lang="en"> <title>Abstract</title> <p>Identifying genetic variants associated with complex disease in high‐dimensional data is a challenging problem, and complicated etiologies such as gene–gene interactions are often ignored in analyses. The data‐mining method random forests (RF) can handle high dimensions; however, in high‐dimensional data, RF is not an effective filter for identifying risk factors associated with the disease trait via complex genetic models such as gene–gene interactions without strong marginal components. In this article we propose an extension called weighted random forests (wRF), which incorporates tree‐level weights to emphasize more accurate trees in prediction and calculation of variable importance. We demonstrate through simulation and application to data from a genetic study of addiction that wRF can outperform RF in high‐dimensional data, although the improvements are modest and limited to situations with effect sizes that are larger than what is realistic in genetics of complex disease. Thus, the current implementation of wRF is unlikely to improve detection of relevant predictors in high‐dimensional genetic data, but may be applicable in other situations where larger effect sizes are anticipated. © 2013 Wiley Periodicals, Inc. Statistical Analysis and Data Mining, 2013</p> </abstract>
- Is Part Of:
- Statistical analysis and data mining. Volume 6:Number 6(2013)
- Journal:
- Statistical analysis and data mining
- Issue:
- Volume 6:Number 6(2013)
- Issue Display:
- Volume 6, Issue 6 (2013)
- Year:
- 2013
- Volume:
- 6
- Issue:
- 6
- Issue Sort Value:
- 2013-0006-0006-0000
- Page Start:
- 496
- Page End:
- 505
- Publication Date:
- 2013-07-08
- Subjects:
- Data mining -- Statistical methods -- Periodicals
006.312 - Journal URLs:
- http://www3.interscience.wiley.com/journal/112701062/home ↗
http://onlinelibrary.wiley.com/ ↗ - DOI:
- 10.1002/sam.11196 ↗
- Languages:
- English
- ISSNs:
- 1932-1864
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 8447.424100
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 3162.xml