Estimating the relative probability of direct transmission between infectious disease patients. (24th March 2020)
- Record Type:
- Journal Article
- Title:
- Estimating the relative probability of direct transmission between infectious disease patients. (24th March 2020)
- Main Title:
- Estimating the relative probability of direct transmission between infectious disease patients
- Authors:
- Leavitt, Sarah V
Lee, Robyn S
Sebastiani, Paola
Horsburgh, C Robert
Jenkins, Helen E
White, Laura F - Abstract:
- Abstract: Background: Estimating infectious disease parameters such as the serial interval (time between symptom onset in primary and secondary cases) and reproductive number (average number of secondary cases produced by a primary case) are important in understanding infectious disease dynamics. Many estimation methods require linking cases by direct transmission, a difficult task for most diseases. Methods: Using a subset of cases with detailed genetic and/or contact investigation data to develop a training set of probable transmission events, we build a model to estimate the relative transmission probability for all case-pairs from demographic, spatial and clinical data. Our method is based on naive Bayes, a machine learning classification algorithm which uses the observed frequencies in the training dataset to estimate the probability that a pair is linked given a set of covariates. Results: In simulations, we find that the probabilities estimated using genetic distance between cases to define training transmission events are able to distinguish between truly linked and unlinked pairs with high accuracy (area under the receiver operating curve value of 95%). Additionally, only a subset of the cases, 10–50% depending on sample size, need to have detailed genetic data for our method to perform well. We show how these probabilities can be used to estimate the average effective reproductive number and apply our method to a tuberculosis outbreak in Hamburg, Germany.Abstract: Background: Estimating infectious disease parameters such as the serial interval (time between symptom onset in primary and secondary cases) and reproductive number (average number of secondary cases produced by a primary case) are important in understanding infectious disease dynamics. Many estimation methods require linking cases by direct transmission, a difficult task for most diseases. Methods: Using a subset of cases with detailed genetic and/or contact investigation data to develop a training set of probable transmission events, we build a model to estimate the relative transmission probability for all case-pairs from demographic, spatial and clinical data. Our method is based on naive Bayes, a machine learning classification algorithm which uses the observed frequencies in the training dataset to estimate the probability that a pair is linked given a set of covariates. Results: In simulations, we find that the probabilities estimated using genetic distance between cases to define training transmission events are able to distinguish between truly linked and unlinked pairs with high accuracy (area under the receiver operating curve value of 95%). Additionally, only a subset of the cases, 10–50% depending on sample size, need to have detailed genetic data for our method to perform well. We show how these probabilities can be used to estimate the average effective reproductive number and apply our method to a tuberculosis outbreak in Hamburg, Germany. Conclusions: Our method is a novel way to infer transmission dynamics in any dataset when only a subset of cases has rich contact investigation and/or genetic data. … (more)
- Is Part Of:
- International journal of epidemiology. Volume 49:Number 3(2020)
- Journal:
- International journal of epidemiology
- Issue:
- Volume 49:Number 3(2020)
- Issue Display:
- Volume 49, Issue 3 (2020)
- Year:
- 2020
- Volume:
- 49
- Issue:
- 3
- Issue Sort Value:
- 2020-0049-0003-0000
- Page Start:
- 764
- Page End:
- 775
- Publication Date:
- 2020-03-24
- Subjects:
- Tuberculosis -- reproductive number -- naive Bayes -- machine learning
Epidemiology -- Periodicals
614.4 - Journal URLs:
- http://ije.oxfordjournals.org/ ↗
http://ukcatalogue.oup.com/ ↗ - DOI:
- 10.1093/ije/dyaa031 ↗
- Languages:
- English
- ISSNs:
- 0300-5771
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4542.244000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 15066.xml