Development of a classifier to identify patients with probable Lennox–Gastaut syndrome in health insurance claims databases via random forest methodology. (3rd August 2019)
- Record Type:
- Journal Article
- Title:
- Development of a classifier to identify patients with probable Lennox–Gastaut syndrome in health insurance claims databases via random forest methodology. (3rd August 2019)
- Main Title:
- Development of a classifier to identify patients with probable Lennox–Gastaut syndrome in health insurance claims databases via random forest methodology
- Authors:
- Vekeman, Francis
Piña-Garza, Jesus Eric
Cheng, Wendy Y.
Tuttle, Edward
Giguère-Duval, Philippe
Oganisian, Arman
Damron, Joseph
Sheng Duh, Mei
Shen, Vivienne
Saurer, Timothy B.
Montouris, Georgia D.
Isojarvi, Jouko - Abstract:
- Abstract: Objective: Describe the development of a claims-based classifier utilizing machine learning to identify patients with probable Lennox–Gastaut syndrome (LGS) from six state Medicaid programs. Methods: Patients were included if they had ≥2 medical claims ≥30 days apart for specified or unspecified epilepsy, excluding those with ≥1 claim for petit mal status. The LGS classifier utilized a random forest algorithm, a compilation of thousands of binary decision trees in which machine-generated predictor variables split the data set into branches that predict the presence or absence of LGS. To construct the splitting rules, the importance of each candidate variable was determined by calculating the mean decrease in Gini impurity. Training and testing were performed on two data sets (30% and 70%) using a "true" LGS and non-LGS patient population. Performance was compared with logistic regression and single tree methodology. Results: Using a 60% probability threshold, which yielded the highest sensitivity (97.3%) and specificity (95.6%), the classifier identified approximately 4% of patients with epilepsy as probable LGS. The most important input variables included number of distinct antiepileptic drugs received, epilepsy-related outpatient/inpatient visits, electroencephalogram procedures and claims for delayed development. The random forest methodology outperformed logistic regression and single tree methodology. Most of the important LGS predictor characteristicsAbstract: Objective: Describe the development of a claims-based classifier utilizing machine learning to identify patients with probable Lennox–Gastaut syndrome (LGS) from six state Medicaid programs. Methods: Patients were included if they had ≥2 medical claims ≥30 days apart for specified or unspecified epilepsy, excluding those with ≥1 claim for petit mal status. The LGS classifier utilized a random forest algorithm, a compilation of thousands of binary decision trees in which machine-generated predictor variables split the data set into branches that predict the presence or absence of LGS. To construct the splitting rules, the importance of each candidate variable was determined by calculating the mean decrease in Gini impurity. Training and testing were performed on two data sets (30% and 70%) using a "true" LGS and non-LGS patient population. Performance was compared with logistic regression and single tree methodology. Results: Using a 60% probability threshold, which yielded the highest sensitivity (97.3%) and specificity (95.6%), the classifier identified approximately 4% of patients with epilepsy as probable LGS. The most important input variables included number of distinct antiepileptic drugs received, epilepsy-related outpatient/inpatient visits, electroencephalogram procedures and claims for delayed development. The random forest methodology outperformed logistic regression and single tree methodology. Most of the important LGS predictor characteristics identified by the classifier were statistically significantly associated with LGS status ( p < .05). Conclusions: The claims-based LGS classifier showed high sensitivity and specificity, outperformed single tree and logistic regression methodologies and identified a prevalence of probable LGS that was similar to previously published estimates. … (more)
- Is Part Of:
- Current medical research and opinion. Volume 35:Number 8(2019)
- Journal:
- Current medical research and opinion
- Issue:
- Volume 35:Number 8(2019)
- Issue Display:
- Volume 35, Issue 8 (2019)
- Year:
- 2019
- Volume:
- 35
- Issue:
- 8
- Issue Sort Value:
- 2019-0035-0008-0000
- Page Start:
- 1415
- Page End:
- 1420
- Publication Date:
- 2019-08-03
- Subjects:
- Epilepsy -- Lennox–Gastaut syndrome -- LGS -- LGS classifier -- machine learning -- random forest methodology
Clinical medicine -- Periodicals
Therapeutics -- Periodicals
615.5 - Journal URLs:
- http://informahealthcare.com ↗
- DOI:
- 10.1080/03007995.2019.1595552 ↗
- Languages:
- English
- ISSNs:
- 0300-7995
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3500.301000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 17167.xml