High dimensional variable selection with clustered data: an application of random multivariate survival forests for detection of outlier medical device components. Issue 8 (24th May 2019)
- Record Type:
- Journal Article
- Title:
- High dimensional variable selection with clustered data: an application of random multivariate survival forests for detection of outlier medical device components. Issue 8 (24th May 2019)
- Main Title:
- High dimensional variable selection with clustered data: an application of random multivariate survival forests for detection of outlier medical device components
- Authors:
- Cafri, Guy
Calhoun, Peter
Fan, Juanjuan - Abstract:
- ABSTRACT: In many medical studies patients are nested or clustered within doctor. With many explanatory variables, variable selection with clustered data can be challenging. We propose a method for variable selection based on random forest that addresses clustered data through stratified binary splits. Our motivating example involves the detection orthopedic device components from a large pool of candidates, where each patient belongs to a surgeon. Simulations compare the performance of survival forests grown using the stratified logrank statistic to conventional and robust logrank statistics, as well as a method to select variables using a threshold value based on a variable's empirical null distribution. The stratified logrank test performs superior to conventional and robust methods when data are generated to have cluster-specific effects, and when cluster sizes are sufficiently large, perform comparably to the splitting alternatives in the absence of cluster-specific effects. Thresholding was effective at distinguishing between important and unimportant variables.
- Is Part Of:
- Journal of statistical computation and simulation. Volume 89:Issue 8(2019)
- Journal:
- Journal of statistical computation and simulation
- Issue:
- Volume 89:Issue 8(2019)
- Issue Display:
- Volume 89, Issue 8 (2019)
- Year:
- 2019
- Volume:
- 89
- Issue:
- 8
- Issue Sort Value:
- 2019-0089-0008-0000
- Page Start:
- 1410
- Page End:
- 1422
- Publication Date:
- 2019-05-24
- Subjects:
- Medical devices -- multivariate -- random forest -- stratification -- survival
Mathematical statistics -- Data processing -- Periodicals
Digital computer simulation -- Periodicals
519.5028505 - Journal URLs:
- http://www.tandfonline.com/loi/gscs20 ↗
http://www.tandfonline.com/ ↗ - DOI:
- 10.1080/00949655.2019.1584198 ↗
- Languages:
- English
- ISSNs:
- 0094-9655
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 5066.820000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 9680.xml