Empirical extensions of the lasso penalty to reduce the false discovery rate in high‐dimensional Cox regression models. (10th March 2016)
- Record Type:
- Journal Article
- Title:
- Empirical extensions of the lasso penalty to reduce the false discovery rate in high‐dimensional Cox regression models. (10th March 2016)
- Main Title:
- Empirical extensions of the lasso penalty to reduce the false discovery rate in high‐dimensional Cox regression models
- Authors:
- Ternès, Nils
Rotolo, Federico
Michiels, Stefan - Abstract:
- Abstract : Correct selection of prognostic biomarkers among multiple candidates is becoming increasingly challenging as the dimensionality of biological data becomes higher. Therefore, minimizing the false discovery rate (FDR) is of primary importance, while a low false negative rate (FNR) is a complementary measure. The lasso is a popular selection method in Cox regression, but its results depend heavily on the penalty parameter λ. Usually, λ is chosen using maximum cross‐validated log‐likelihood (max‐ cvl ). However, this method has often a very high FDR. We review methods for a more conservative choice of λ. We propose an empirical extension of the cvl by adding a penalization term, which trades off between the goodness‐of‐fit and the parsimony of the model, leading to the selection of fewer biomarkers and, as we show, to the reduction of the FDR without large increase in FNR. We conducted a simulation study considering null and moderately sparse alternative scenarios and compared our approach with the standard lasso and 10 other competitors: Akaike information criterion (AIC), corrected AIC, Bayesian information criterion (BIC), extended BIC, Hannan and Quinn information criterion (HQIC), risk information criterion (RIC), one‐standard‐error rule, adaptive lasso, stability selection, and percentile lasso. Our extension achieved the best compromise across all the scenarios between a reduction of the FDR and a limited raise of the FNR, followed by the AIC, the RIC, and theAbstract : Correct selection of prognostic biomarkers among multiple candidates is becoming increasingly challenging as the dimensionality of biological data becomes higher. Therefore, minimizing the false discovery rate (FDR) is of primary importance, while a low false negative rate (FNR) is a complementary measure. The lasso is a popular selection method in Cox regression, but its results depend heavily on the penalty parameter λ. Usually, λ is chosen using maximum cross‐validated log‐likelihood (max‐ cvl ). However, this method has often a very high FDR. We review methods for a more conservative choice of λ. We propose an empirical extension of the cvl by adding a penalization term, which trades off between the goodness‐of‐fit and the parsimony of the model, leading to the selection of fewer biomarkers and, as we show, to the reduction of the FDR without large increase in FNR. We conducted a simulation study considering null and moderately sparse alternative scenarios and compared our approach with the standard lasso and 10 other competitors: Akaike information criterion (AIC), corrected AIC, Bayesian information criterion (BIC), extended BIC, Hannan and Quinn information criterion (HQIC), risk information criterion (RIC), one‐standard‐error rule, adaptive lasso, stability selection, and percentile lasso. Our extension achieved the best compromise across all the scenarios between a reduction of the FDR and a limited raise of the FNR, followed by the AIC, the RIC, and the adaptive lasso, which performed well in some settings. We illustrate the methods using gene expression data of 523 breast cancer patients. In conclusion, we propose to apply our extension to the lasso whenever a stringent FDR with a limited FNR is targeted. Copyright © 2016 John Wiley & Sons, Ltd. … (more)
- Is Part Of:
- Statistics in medicine. Volume 35:Number 15(2016)
- Journal:
- Statistics in medicine
- Issue:
- Volume 35:Number 15(2016)
- Issue Display:
- Volume 35, Issue 15 (2016)
- Year:
- 2016
- Volume:
- 35
- Issue:
- 15
- Issue Sort Value:
- 2016-0035-0015-0000
- Page Start:
- 2561
- Page End:
- 2573
- Publication Date:
- 2016-03-10
- Subjects:
- lasso penalty -- variable selection -- Cox model -- prognostic biomarkers -- false discovery rate
Medical statistics -- Periodicals
Statistique médicale -- Périodiques
Statistiques médicales -- Périodiques
610.727 - Journal URLs:
- http://onlinelibrary.wiley.com/ ↗
- DOI:
- 10.1002/sim.6927 ↗
- Languages:
- English
- ISSNs:
- 0277-6715
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 8453.576000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 1501.xml