Analyzing discrete competing risks data with partially overlapping or independent data sources and nonstandard sampling schemes, with application to cancer registries. (28th October 2019)
- Record Type:
- Journal Article
- Title:
- Analyzing discrete competing risks data with partially overlapping or independent data sources and nonstandard sampling schemes, with application to cancer registries. (28th October 2019)
- Main Title:
- Analyzing discrete competing risks data with partially overlapping or independent data sources and nonstandard sampling schemes, with application to cancer registries
- Authors:
- Lee, Minjung
Feuer, Eric J.
Wang, Zhuoqiao
Cho, Hyunsoon
Zou, Zhaohui
Hankey, Benjamin F.
Mariotto, Angela B.
Fine, Jason P. - Abstract:
- Abstract : This paper demonstrates the flexibility of a general approach for the analysis of discrete time competing risks data that can accommodate complex data structures, different time scales for different causes, and nonstandard sampling schemes. The data may involve a single data source where all individuals contribute to analyses of both cause‐specific hazard functions, overlapping datasets where some individuals contribute to the analysis of the cause‐specific hazard function of only one cause while other individuals contribute to analyses of both cause‐specific hazard functions, or separate data sources where each individual contributes to the analysis of the cause‐specific hazard function of only a single cause. The approach is modularized into estimation and prediction. For the estimation step, the parameters and the variance‐covariance matrix can be estimated using widely available software. The prediction step utilizes a generic program with plug‐in estimates from the estimation step. The approach is illustrated with three prognostic models for stage IV male oral cancer using different data structures. The first model uses only men with stage IV oral cancer from population‐based registry data. The second model strategically extends the cohort to improve the efficiency of the estimates. The third model improves the accuracy for those with a lower risk of other causes of death, by bringing in an independent data source collected under a complex sampling designAbstract : This paper demonstrates the flexibility of a general approach for the analysis of discrete time competing risks data that can accommodate complex data structures, different time scales for different causes, and nonstandard sampling schemes. The data may involve a single data source where all individuals contribute to analyses of both cause‐specific hazard functions, overlapping datasets where some individuals contribute to the analysis of the cause‐specific hazard function of only one cause while other individuals contribute to analyses of both cause‐specific hazard functions, or separate data sources where each individual contributes to the analysis of the cause‐specific hazard function of only a single cause. The approach is modularized into estimation and prediction. For the estimation step, the parameters and the variance‐covariance matrix can be estimated using widely available software. The prediction step utilizes a generic program with plug‐in estimates from the estimation step. The approach is illustrated with three prognostic models for stage IV male oral cancer using different data structures. The first model uses only men with stage IV oral cancer from population‐based registry data. The second model strategically extends the cohort to improve the efficiency of the estimates. The third model improves the accuracy for those with a lower risk of other causes of death, by bringing in an independent data source collected under a complex sampling design with additional other‐cause covariates. These analyses represent novel extensions of existing methodology, broadly applicable for the development of prognostic models capturing both the cancer and noncancer aspects of a patient's health. … (more)
- Is Part Of:
- Statistics in medicine. Volume 38:Number 29(2019)
- Journal:
- Statistics in medicine
- Issue:
- Volume 38:Number 29(2019)
- Issue Display:
- Volume 38, Issue 29 (2019)
- Year:
- 2019
- Volume:
- 38
- Issue:
- 29
- Issue Sort Value:
- 2019-0038-0029-0000
- Page Start:
- 5528
- Page End:
- 5546
- Publication Date:
- 2019-10-28
- Subjects:
- absolute risk prediction -- cause‐specific hazard function -- discrete time -- likelihood inference -- multiple time scales -- survey sampling
Medical statistics -- Periodicals
Statistique médicale -- Périodiques
Statistiques médicales -- Périodiques
610.727 - Journal URLs:
- http://onlinelibrary.wiley.com/ ↗
- DOI:
- 10.1002/sim.8381 ↗
- Languages:
- English
- ISSNs:
- 0277-6715
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 8453.576000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 12114.xml