Beyond the SNP Threshold: Identifying Outbreak Clusters Using Inferred Transmissions. (25th January 2019)
- Record Type:
- Journal Article
- Title:
- Beyond the SNP Threshold: Identifying Outbreak Clusters Using Inferred Transmissions. (25th January 2019)
- Main Title:
- Beyond the SNP Threshold: Identifying Outbreak Clusters Using Inferred Transmissions
- Authors:
- Stimson, James
Gardy, Jennifer
Mathema, Barun
Crudu, Valeriu
Cohen, Ted
Colijn, Caroline - Editors:
- Leitner, Thomas
- Abstract:
- Abstract: Whole-genome sequencing (WGS) is increasingly used to aid the understanding of pathogen transmission. A first step in analyzing WGS data is usually to define "transmission clusters, " sets of cases that are potentially linked by direct transmission. This is often done by including two cases in the same cluster if they are separated by fewer single-nucleotide polymorphisms (SNPs) than a specified threshold. However, there is little agreement as to what an appropriate threshold should be. We propose a probabilistic alternative, suggesting that the key inferential target for transmission clusters is the number of transmissions separating cases. We characterize this by combining the number of SNP differences and the length of time over which those differences have accumulated, using information about case timing, molecular clock, and transmission processes. Our framework has the advantage of allowing for variable mutation rates across the genome and can incorporate other epidemiological data. We use two tuberculosis studies to illustrate the impact of our approach: with British Columbia data by using spatial divisions; with Republic of Moldova data by incorporating antibiotic resistance. Simulation results indicate that our transmission-based method is better in identifying direct transmissions than a SNP threshold, with dissimilarity between clusterings of on average 0.27 bits compared with 0.37 bits for the SNP-threshold method and 0.84 bits for randomly permutedAbstract: Whole-genome sequencing (WGS) is increasingly used to aid the understanding of pathogen transmission. A first step in analyzing WGS data is usually to define "transmission clusters, " sets of cases that are potentially linked by direct transmission. This is often done by including two cases in the same cluster if they are separated by fewer single-nucleotide polymorphisms (SNPs) than a specified threshold. However, there is little agreement as to what an appropriate threshold should be. We propose a probabilistic alternative, suggesting that the key inferential target for transmission clusters is the number of transmissions separating cases. We characterize this by combining the number of SNP differences and the length of time over which those differences have accumulated, using information about case timing, molecular clock, and transmission processes. Our framework has the advantage of allowing for variable mutation rates across the genome and can incorporate other epidemiological data. We use two tuberculosis studies to illustrate the impact of our approach: with British Columbia data by using spatial divisions; with Republic of Moldova data by incorporating antibiotic resistance. Simulation results indicate that our transmission-based method is better in identifying direct transmissions than a SNP threshold, with dissimilarity between clusterings of on average 0.27 bits compared with 0.37 bits for the SNP-threshold method and 0.84 bits for randomly permuted data. These results show that it is likely to outperform the SNP-threshold method where clock rates are variable and sample collection times are spread out. We implement the method in the R package transcluster. … (more)
- Is Part Of:
- Molecular biology and evolution. Volume 36:Number 3(2019)
- Journal:
- Molecular biology and evolution
- Issue:
- Volume 36:Number 3(2019)
- Issue Display:
- Volume 36, Issue 3 (2019)
- Year:
- 2019
- Volume:
- 36
- Issue:
- 3
- Issue Sort Value:
- 2019-0036-0003-0000
- Page Start:
- 587
- Page End:
- 603
- Publication Date:
- 2019-01-25
- Subjects:
- SNP -- transmission clusters -- whole-genome sequencing -- public health
Molecular biology -- Periodicals
Molecular evolution -- Periodicals
Evolution, Molecular -- Periodicals
Molecular Biology -- Periodicals
572.8 - Journal URLs:
- http://mbe.oxfordjournals.org/ ↗
http://www.molbiolevol.org/ ↗
http://ukcatalogue.oup.com/ ↗
http://firstsearch.oclc.org ↗
http://firstsearch.oclc.org/journal=0737-7038;screen=info;ECOIP ↗ - DOI:
- 10.1093/molbev/msy242 ↗
- Languages:
- English
- ISSNs:
- 0737-4038
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 5900.782000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 11803.xml