Matching phenotypes to whole genomes: Lessons learned from four iterations of the personal genome project community challenges. Issue 9 (19th June 2017)
- Record Type:
- Journal Article
- Title:
- Matching phenotypes to whole genomes: Lessons learned from four iterations of the personal genome project community challenges. Issue 9 (19th June 2017)
- Main Title:
- Matching phenotypes to whole genomes: Lessons learned from four iterations of the personal genome project community challenges
- Authors:
- Cai, Binghuang
Li, Biao
Kiga, Nikki
Thusberg, Janita
Bergquist, Timothy
Chen, Yun‐Ching
Niknafs, Noushin
Carter, Hannah
Tokheim, Collin
Beleva‐Guthrie, Violeta
Douville, Christopher
Bhattacharya, Rohit
Yeo, Hui Ting Grace
Fan, Jean
Sengupta, Sohini
Kim, Dewey
Cline, Melissa
Turner, Tychele
Diekhans, Mark
Zaucha, Jan
Pal, Lipika R.
Cao, Chen
Yu, Chen‐Hsin
Yin, Yizhou
Carraro, Marco
Giollo, Manuel
Ferrari, Carlo
Leonardi, Emanuela
Tosatto, Silvio C.E.
Bobe, Jason
Ball, Madeleine
Hoskins, Roger A.
Repo, Susanna
Church, George
Brenner, Steven E.
Moult, John
Gough, Julian
Stanke, Mario
Karchin, Rachel
Mooney, Sean D.
… (more) - Abstract:
- Abstract : PGP provides unrestricted access to genomes of individuals and their associated phenotypes. The CAGI PGP challenge is to predict whether an individual had a particular trait or phenotype profile based on their whole genome. Assessment results show prediction of individual traits is difficult, relying on a strong knowledge of trait frequency within general population, while matching genomes to trait profiles relies heavily upon a small number of common traits. Abstract: The advent of next‐generation sequencing has dramatically decreased the cost for whole‐genome sequencing and increased the viability for its application in research and clinical care. The Personal Genome Project (PGP) provides unrestricted access to genomes of individuals and their associated phenotypes. This resource enabled the Critical Assessment of Genome Interpretation (CAGI) to create a community challenge to assess the bioinformatics community's ability to predict traits from whole genomes. In the CAGI PGP challenge, researchers were asked to predict whether an individual had a particular trait or profile based on their whole genome. Several approaches were used to assess submissions, including ROC AUC (area under receiver operating characteristic curve), probability rankings, the number of correct predictions, and statistical significance simulations. Overall, we found that prediction of individual traits is difficult, relying on a strong knowledge of trait frequency within the generalAbstract : PGP provides unrestricted access to genomes of individuals and their associated phenotypes. The CAGI PGP challenge is to predict whether an individual had a particular trait or phenotype profile based on their whole genome. Assessment results show prediction of individual traits is difficult, relying on a strong knowledge of trait frequency within general population, while matching genomes to trait profiles relies heavily upon a small number of common traits. Abstract: The advent of next‐generation sequencing has dramatically decreased the cost for whole‐genome sequencing and increased the viability for its application in research and clinical care. The Personal Genome Project (PGP) provides unrestricted access to genomes of individuals and their associated phenotypes. This resource enabled the Critical Assessment of Genome Interpretation (CAGI) to create a community challenge to assess the bioinformatics community's ability to predict traits from whole genomes. In the CAGI PGP challenge, researchers were asked to predict whether an individual had a particular trait or profile based on their whole genome. Several approaches were used to assess submissions, including ROC AUC (area under receiver operating characteristic curve), probability rankings, the number of correct predictions, and statistical significance simulations. Overall, we found that prediction of individual traits is difficult, relying on a strong knowledge of trait frequency within the general population, whereas matching genomes to trait profiles relies heavily upon a small number of common traits including ancestry, blood type, and eye color. When a rare genetic disorder is present, profiles can be matched when one or more pathogenic variants are identified. Prediction accuracy has improved substantially over the last 6 years due to improved methodology and a better understanding of features. … (more)
- Is Part Of:
- Human mutation. Volume 38:Issue 9(2017)
- Journal:
- Human mutation
- Issue:
- Volume 38:Issue 9(2017)
- Issue Display:
- Volume 38, Issue 9 (2017)
- Year:
- 2017
- Volume:
- 38
- Issue:
- 9
- Issue Sort Value:
- 2017-0038-0009-0000
- Page Start:
- 1266
- Page End:
- 1276
- Publication Date:
- 2017-06-19
- Subjects:
- biomedical informatics -- community challenge -- critical assessment -- genome -- genome interpretation -- open consent -- personal genome project (PGP) -- phenotype
Human chromosome abnormalities -- Periodicals
Mutation (Biology) -- Periodicals
616.04205 - Journal URLs:
- http://onlinelibrary.wiley.com/journal/10.1002/(ISSN)1098-1004 ↗
http://onlinelibrary.wiley.com/ ↗ - DOI:
- 10.1002/humu.23265 ↗
- Languages:
- English
- ISSNs:
- 1059-7794
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4336.217000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 14205.xml