Class-imbalanced subsampling lasso algorithm for discovering adverse drug reactions. (March 2018)
- Record Type:
- Journal Article
- Title:
- Class-imbalanced subsampling lasso algorithm for discovering adverse drug reactions. (March 2018)
- Main Title:
- Class-imbalanced subsampling lasso algorithm for discovering adverse drug reactions
- Authors:
- Ahmed, Ismaïl
Pariente, Antoine
Tubert-Bitter, Pascale - Other Names:
- Nakas Christos T guest-editor.
Reiser Benjamin guest-editor. - Abstract:
- Background: All methods routinely used to generate safety signals from pharmacovigilance databases rely on disproportionality analyses of counts aggregating patients' spontaneous reports. Recently, it was proposed to analyze individual spontaneous reports directly using Bayesian lasso logistic regressions. Nevertheless, this raises the issue of choosing an adequate regularization parameter in a variable selection framework while accounting for computational constraints due to the high dimension of the data. Purpose: Our main objective is to propose a method, which exploits the subsampling idea from Stability Selection, a variable selection procedure combining subsampling with a high-dimensional selection algorithm, and adapts it to the specificities of the spontaneous reporting data, the latter being characterized by their large size, their binary nature and their sparsity. Materials and method: Given the large imbalance existing between the presence and absence of a given adverse event, we propose an alternative subsampling scheme to that of Stability Selection resulting in an over-representation of the minority class and a drastic reduction in the number of observations in each subsample. Simulations are used to help define the detection threshold as regards the average proportion of false signals. They are also used to compare the performances of the proposed sampling scheme with that originally proposed for Stability Selection. Finally, we compare the proposed method toBackground: All methods routinely used to generate safety signals from pharmacovigilance databases rely on disproportionality analyses of counts aggregating patients' spontaneous reports. Recently, it was proposed to analyze individual spontaneous reports directly using Bayesian lasso logistic regressions. Nevertheless, this raises the issue of choosing an adequate regularization parameter in a variable selection framework while accounting for computational constraints due to the high dimension of the data. Purpose: Our main objective is to propose a method, which exploits the subsampling idea from Stability Selection, a variable selection procedure combining subsampling with a high-dimensional selection algorithm, and adapts it to the specificities of the spontaneous reporting data, the latter being characterized by their large size, their binary nature and their sparsity. Materials and method: Given the large imbalance existing between the presence and absence of a given adverse event, we propose an alternative subsampling scheme to that of Stability Selection resulting in an over-representation of the minority class and a drastic reduction in the number of observations in each subsample. Simulations are used to help define the detection threshold as regards the average proportion of false signals. They are also used to compare the performances of the proposed sampling scheme with that originally proposed for Stability Selection. Finally, we compare the proposed method to the gamma Poisson shrinker, a disproportionality method, and to a lasso logistic regression approach through an empirical study conducted on the French national pharmacovigilance database and two sets of reference signals. Results: Simulations show that the proposed sampling strategy performs better in terms of false discoveries and is faster than the equiprobable sampling of Stability Selection. The empirical evaluation illustrates the better performances of the proposed method compared with gamma Poisson shrinker and the lasso in terms of number of reference signals retrieved. … (more)
- Is Part Of:
- Statistical methods in medical research. Volume 27:Number 3(2018)
- Journal:
- Statistical methods in medical research
- Issue:
- Volume 27:Number 3(2018)
- Issue Display:
- Volume 27, Issue 3 (2018)
- Year:
- 2018
- Volume:
- 27
- Issue:
- 3
- Issue Sort Value:
- 2018-0027-0003-0000
- Page Start:
- 785
- Page End:
- 797
- Publication Date:
- 2018-03
- Subjects:
- Medicine -- Research -- Statistical methods -- Periodicals
Research -- Periodicals
Review Literature -- Periodicals
Statistics -- methods -- Periodicals
Médecine -- Recherche -- Méthodes statistiques -- Périodiques
610.727 - Journal URLs:
- http://smm.sagepub.com/ ↗
http://www.ingentaselect.com/rpsv/cw/arn/09622802/contp1.htm ↗
http://www.uk.sagepub.com/home.nav ↗
http://firstsearch.oclc.org ↗
http://firstsearch.oclc.org/journal=0962-2802;screen=info;ECOIP ↗ - DOI:
- 10.1177/0962280216643116 ↗
- Languages:
- English
- ISSNs:
- 0962-2802
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 8091.xml