PheKB: a catalog and workflow for creating electronic phenotype algorithms for transportability. (28th March 2016)
- Record Type:
- Journal Article
- Title:
- PheKB: a catalog and workflow for creating electronic phenotype algorithms for transportability. (28th March 2016)
- Main Title:
- PheKB: a catalog and workflow for creating electronic phenotype algorithms for transportability
- Authors:
- Kirby, Jacqueline C
Speltz, Peter
Rasmussen, Luke V
Basford, Melissa
Gottesman, Omri
Peissig, Peggy L
Pacheco, Jennifer A
Tromp, Gerard
Pathak, Jyotishman
Carrell, David S
Ellis, Stephen B
Lingren, Todd
Thompson, Will K
Savova, Guergana
Haines, Jonathan
Roden, Dan M
Harris, Paul A
Denny, Joshua C - Abstract:
- Abstract : Objective Health care generated data have become an important source for clinical and genomic research. Often, investigators create and iteratively refine phenotype algorithms to achieve high positive predictive values (PPVs) or sensitivity, thereby identifying valid cases and controls. These algorithms achieve the greatest utility when validated and shared by multiple health care systems. Materials and Methods We report the current status and impact of the Phenotype KnowledgeBase (PheKB, http://phekb.org ), an online environment supporting the workflow of building, sharing, and validating electronic phenotype algorithms. We analyze the most frequent components used in algorithms and their performance at authoring institutions and secondary implementation sites. Results As of June 2015, PheKB contained 30 finalized phenotype algorithms and 62 algorithms in development spanning a range of traits and diseases. Phenotypes have had over 3500 unique views in a 6-month period and have been reused by other institutions. International Classification of Disease codes were the most frequently used component, followed by medications and natural language processing. Among algorithms with published performance data, the median PPV was nearly identical when evaluated at the authoring institutions (n = 44; case 96.0%, control 100%) compared to implementation sites (n = 40; case 97.5%, control 100%). Discussion These results demonstrate that a broad range of algorithms to mineAbstract : Objective Health care generated data have become an important source for clinical and genomic research. Often, investigators create and iteratively refine phenotype algorithms to achieve high positive predictive values (PPVs) or sensitivity, thereby identifying valid cases and controls. These algorithms achieve the greatest utility when validated and shared by multiple health care systems. Materials and Methods We report the current status and impact of the Phenotype KnowledgeBase (PheKB, http://phekb.org ), an online environment supporting the workflow of building, sharing, and validating electronic phenotype algorithms. We analyze the most frequent components used in algorithms and their performance at authoring institutions and secondary implementation sites. Results As of June 2015, PheKB contained 30 finalized phenotype algorithms and 62 algorithms in development spanning a range of traits and diseases. Phenotypes have had over 3500 unique views in a 6-month period and have been reused by other institutions. International Classification of Disease codes were the most frequently used component, followed by medications and natural language processing. Among algorithms with published performance data, the median PPV was nearly identical when evaluated at the authoring institutions (n = 44; case 96.0%, control 100%) compared to implementation sites (n = 40; case 97.5%, control 100%). Discussion These results demonstrate that a broad range of algorithms to mine electronic health record data from different health systems can be developed with high PPV, and algorithms developed at one site are generally transportable to others. Conclusion By providing a central repository, PheKB enables improved development, transportability, and validity of algorithms for research-grade phenotypes using health care generated data. … (more)
- Is Part Of:
- Journal of the American Medical Informatics Association. Volume 23:Number 6(2016:Nov.)
- Journal:
- Journal of the American Medical Informatics Association
- Issue:
- Volume 23:Number 6(2016:Nov.)
- Issue Display:
- Volume 23, Issue 6 (2016)
- Year:
- 2016
- Volume:
- 23
- Issue:
- 6
- Issue Sort Value:
- 2016-0023-0006-0000
- Page Start:
- 1046
- Page End:
- 1052
- Publication Date:
- 2016-03-28
- Subjects:
- electronic health records -- electronic phenotyping -- natural language processing -- genomic research -- clinical research
Medical informatics -- Periodicals
Information Services -- Periodicals
Medical Informatics -- Periodicals
Médecine -- Informatique -- Périodiques
Informatica
Geneeskunde
Informatique médicale
Computer network resources
Electronic journals
610.285 - Journal URLs:
- http://jamia.bmj.com/ ↗
http://www.jamia.org ↗
http://www.pubmedcentral.nih.gov/tocrender.fcgi?journal=76 ↗
http://www.sciencedirect.com/science/journal/10675027 ↗
http://jamia.oxfordjournals.org/ ↗
http://www.oxfordjournals.org/en/ ↗ - DOI:
- 10.1093/jamia/ocv202 ↗
- Languages:
- English
- ISSNs:
- 1067-5027
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4689.025000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 15455.xml