Constrained binary classification using ensemble learning: an application to cost‐efficient targeted PrEP strategies. (6th April 2017)
- Record Type:
- Journal Article
- Title:
- Constrained binary classification using ensemble learning: an application to cost‐efficient targeted PrEP strategies. (6th April 2017)
- Main Title:
- Constrained binary classification using ensemble learning: an application to cost‐efficient targeted PrEP strategies
- Authors:
- Zheng, Wenjing
Balzer, Laura
van der Laan, Mark
Petersen, Maya - Other Names:
- Gezmu Misrak guestEditor.
Qin Jing guestEditor. - Abstract:
- Abstract : Binary classification problems are ubiquitous in health and social sciences. In many cases, one wishes to balance two competing optimality considerations for a binary classifier. For instance, in resource‐limited settings, an human immunodeficiency virus prevention program based on offering pre‐exposure prophylaxis (PrEP) to select high‐risk individuals must balance the sensitivity of the binary classifier in detecting future seroconverters (and hence offering them PrEP regimens) with the total number of PrEP regimens that is financially and logistically feasible for the program. In this article, we consider a general class of constrained binary classification problems wherein the objective function and the constraint are both monotonic with respect to a threshold. These include the minimization of the rate of positive predictions subject to a minimum sensitivity, the maximization of sensitivity subject to a maximum rate of positive predictions, and the Neyman–Pearson paradigm, which minimizes the type II error subject to an upper bound on the type I error. We propose an ensemble approach to these binary classification problems based on the Super Learner methodology. This approach linearly combines a user‐supplied library of scoring algorithms, with combination weights and a discriminating threshold chosen to minimize the constrained optimality criterion. We then illustrate the application of the proposed classifier to develop an individualized PrEP targetingAbstract : Binary classification problems are ubiquitous in health and social sciences. In many cases, one wishes to balance two competing optimality considerations for a binary classifier. For instance, in resource‐limited settings, an human immunodeficiency virus prevention program based on offering pre‐exposure prophylaxis (PrEP) to select high‐risk individuals must balance the sensitivity of the binary classifier in detecting future seroconverters (and hence offering them PrEP regimens) with the total number of PrEP regimens that is financially and logistically feasible for the program. In this article, we consider a general class of constrained binary classification problems wherein the objective function and the constraint are both monotonic with respect to a threshold. These include the minimization of the rate of positive predictions subject to a minimum sensitivity, the maximization of sensitivity subject to a maximum rate of positive predictions, and the Neyman–Pearson paradigm, which minimizes the type II error subject to an upper bound on the type I error. We propose an ensemble approach to these binary classification problems based on the Super Learner methodology. This approach linearly combines a user‐supplied library of scoring algorithms, with combination weights and a discriminating threshold chosen to minimize the constrained optimality criterion. We then illustrate the application of the proposed classifier to develop an individualized PrEP targeting strategy in a resource‐limited setting, with the goal of minimizing the number of PrEP offerings while achieving a minimum required sensitivity. This proof of concept data analysis uses baseline data from the ongoing Sustainable East Africa Research in Community Health study. Copyright © 2017 John Wiley & Sons, Ltd. … (more)
- Is Part Of:
- Statistics in medicine. Volume 37:Number 2(2018)
- Journal:
- Statistics in medicine
- Issue:
- Volume 37:Number 2(2018)
- Issue Display:
- Volume 37, Issue 2 (2018)
- Year:
- 2018
- Volume:
- 37
- Issue:
- 2
- Issue Sort Value:
- 2018-0037-0002-0000
- Page Start:
- 261
- Page End:
- 279
- Publication Date:
- 2017-04-06
- Subjects:
- Super Learner -- constrained binary classification -- Neyman–Pearson, sensitivity -- rate of positive predictions -- PrEP -- ensemble classification, cross‐validation
Medical statistics -- Periodicals
Statistique médicale -- Périodiques
Statistiques médicales -- Périodiques
610.727 - Journal URLs:
- http://onlinelibrary.wiley.com/ ↗
- DOI:
- 10.1002/sim.7296 ↗
- Languages:
- English
- ISSNs:
- 0277-6715
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 8453.576000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 5545.xml