Matching whole genomes to rare genetic disorders: Identification of potential causative variants using phenotype‐weighted knowledge in the CAGI SickKids5 clinical genomes challenge. Issue 2 (15th November 2019)
- Record Type:
- Journal Article
- Title:
- Matching whole genomes to rare genetic disorders: Identification of potential causative variants using phenotype‐weighted knowledge in the CAGI SickKids5 clinical genomes challenge. Issue 2 (15th November 2019)
- Main Title:
- Matching whole genomes to rare genetic disorders: Identification of potential causative variants using phenotype‐weighted knowledge in the CAGI SickKids5 clinical genomes challenge
- Authors:
- Pal, Lipika R.
Kundu, Kunal
Yin, Yizhou
Moult, John - Abstract:
- Abstract: Precise identification of causative variants from whole‐genome sequencing data, including both coding and noncoding variants, is challenging. The Critical Assessment of Genome Interpretation 5 SickKids clinical genome challenge provided an opportunity to assess our ability to extract such information. Participants in the challenge were required to match each of the 24 whole‐genome sequences to the correct phenotypic profile and to identify the disease class of each genome. These are all rare disease cases that have resisted genetic diagnosis in a state‐of‐the‐art pipeline. The patients have a range of eye, neurological, and connective‐tissue disorders. We used a gene‐centric approach to address this problem, assigning each gene a multiphenotype‐matching score. Mutations in the top‐scoring genes for each phenotype profile were ranked on a 6‐point scale of pathogenicity probability, resulting in an approximately equal number of top‐ranked coding and noncoding candidate variants overall. We were able to assign the correct disease class for 12 cases and the correct genome to a clinical profile for five cases. The challenge assessor found genes in three of these five cases as likely appropriate. In the postsubmission phase, after careful screening of the genes in the correct genome, we identified additional potential diagnostic variants, a high proportion of which are noncoding. Abstract : We describe a pipeline for identifying rare disease causative variants from theAbstract: Precise identification of causative variants from whole‐genome sequencing data, including both coding and noncoding variants, is challenging. The Critical Assessment of Genome Interpretation 5 SickKids clinical genome challenge provided an opportunity to assess our ability to extract such information. Participants in the challenge were required to match each of the 24 whole‐genome sequences to the correct phenotypic profile and to identify the disease class of each genome. These are all rare disease cases that have resisted genetic diagnosis in a state‐of‐the‐art pipeline. The patients have a range of eye, neurological, and connective‐tissue disorders. We used a gene‐centric approach to address this problem, assigning each gene a multiphenotype‐matching score. Mutations in the top‐scoring genes for each phenotype profile were ranked on a 6‐point scale of pathogenicity probability, resulting in an approximately equal number of top‐ranked coding and noncoding candidate variants overall. We were able to assign the correct disease class for 12 cases and the correct genome to a clinical profile for five cases. The challenge assessor found genes in three of these five cases as likely appropriate. In the postsubmission phase, after careful screening of the genes in the correct genome, we identified additional potential diagnostic variants, a high proportion of which are noncoding. Abstract : We describe a pipeline for identifying rare disease causative variants from the whole‐genome sequence. The pipeline was developed for the Critical Assessment of Genome Interpretation 5 (CAGI5) challenge of matching clinical phenotypes to genomes for a set of pediatric cases but has general applicability. Novel features include a phenotype‐gene matching algorithm using weighted Human Phenotype Ontology terms and six categories of potential impact variants. The pipeline was demonstrated to be effective with the CAGI data. … (more)
- Is Part Of:
- Human mutation. Volume 41:Issue 2(2020)
- Journal:
- Human mutation
- Issue:
- Volume 41:Issue 2(2020)
- Issue Display:
- Volume 41, Issue 2 (2020)
- Year:
- 2020
- Volume:
- 41
- Issue:
- 2
- Issue Sort Value:
- 2020-0041-0002-0000
- Page Start:
- 347
- Page End:
- 362
- Publication Date:
- 2019-11-15
- Subjects:
- CAGI5 -- connective‐tissue disorder -- diagnostic variants -- eye disorder -- Human Phenotype Ontology (HPO) -- neurological diseases -- whole‐genome sequencing data
Human chromosome abnormalities -- Periodicals
Mutation (Biology) -- Periodicals
616.04205 - Journal URLs:
- http://onlinelibrary.wiley.com/journal/10.1002/(ISSN)1098-1004 ↗
http://onlinelibrary.wiley.com/ ↗ - DOI:
- 10.1002/humu.23933 ↗
- Languages:
- English
- ISSNs:
- 1059-7794
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4336.217000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 18819.xml