Characterizing variability of electronic health record-driven phenotype definitions. (6th December 2022)
- Record Type:
- Journal Article
- Title:
- Characterizing variability of electronic health record-driven phenotype definitions. (6th December 2022)
- Main Title:
- Characterizing variability of electronic health record-driven phenotype definitions
- Authors:
- Brandt, Pascal S
Kho, Abel
Luo, Yuan
Pacheco, Jennifer A
Walunas, Theresa L
Hakonarson, Hakon
Hripcsak, George
Liu, Cong
Shang, Ning
Weng, Chunhua
Walton, Nephi
Carrell, David S
Crane, Paul K
Larson, Eric B
Chute, Christopher G
Kullo, Iftikhar J
Carroll, Robert
Denny, Josh
Ramirez, Andrea
Wei, Wei-Qi
Pathak, Jyoti
Wiley, Laura K
Richesson, Rachel
Starren, Justin B
Rasmussen, Luke V - Abstract:
- Abstract: Objective: The aim of this study was to analyze a publicly available sample of rule-based phenotype definitions to characterize and evaluate the variability of logical constructs used. Materials and Methods: A sample of 33 preexisting phenotype definitions used in research that are represented using Fast Healthcare Interoperability Resources and Clinical Quality Language (CQL) was analyzed using automated analysis of the computable representation of the CQL libraries. Results: Most of the phenotype definitions include narrative descriptions and flowcharts, while few provide pseudocode or executable artifacts. Most use 4 or fewer medical terminologies. The number of codes used ranges from 5 to 6865, and value sets from 1 to 19. We found that the most common expressions used were literal, data, and logical expressions. Aggregate and arithmetic expressions are the least common. Expression depth ranges from 4 to 27. Discussion: Despite the range of conditions, we found that all of the phenotype definitions consisted of logical criteria, representing both clinical and operational logic, and tabular data, consisting of codes from standard terminologies and keywords for natural language processing. The total number and variety of expressions are low, which may be to simplify implementation, or authors may limit complexity due to data availability constraints. Conclusions: The phenotype definitions analyzed show significant variation in specific logical, arithmetic, andAbstract: Objective: The aim of this study was to analyze a publicly available sample of rule-based phenotype definitions to characterize and evaluate the variability of logical constructs used. Materials and Methods: A sample of 33 preexisting phenotype definitions used in research that are represented using Fast Healthcare Interoperability Resources and Clinical Quality Language (CQL) was analyzed using automated analysis of the computable representation of the CQL libraries. Results: Most of the phenotype definitions include narrative descriptions and flowcharts, while few provide pseudocode or executable artifacts. Most use 4 or fewer medical terminologies. The number of codes used ranges from 5 to 6865, and value sets from 1 to 19. We found that the most common expressions used were literal, data, and logical expressions. Aggregate and arithmetic expressions are the least common. Expression depth ranges from 4 to 27. Discussion: Despite the range of conditions, we found that all of the phenotype definitions consisted of logical criteria, representing both clinical and operational logic, and tabular data, consisting of codes from standard terminologies and keywords for natural language processing. The total number and variety of expressions are low, which may be to simplify implementation, or authors may limit complexity due to data availability constraints. Conclusions: The phenotype definitions analyzed show significant variation in specific logical, arithmetic, and other operators but are all composed of the same high-level components, namely tabular data and logical expressions. A standard representation for phenotype definitions should support these formats and be modular to support localization and shared logic. … (more)
- Is Part Of:
- Journal of the American Medical Informatics Association. Volume 30:Number 3(2023)
- Journal:
- Journal of the American Medical Informatics Association
- Issue:
- Volume 30:Number 3(2023)
- Issue Display:
- Volume 30, Issue 3 (2023)
- Year:
- 2023
- Volume:
- 30
- Issue:
- 3
- Issue Sort Value:
- 2023-0030-0003-0000
- Page Start:
- 427
- Page End:
- 437
- Publication Date:
- 2022-12-06
- Subjects:
- FHIR -- CQL -- EHR-driven phenotyping -- cohort identification
Medical informatics -- Periodicals
Information Services -- Periodicals
Medical Informatics -- Periodicals
Médecine -- Informatique -- Périodiques
Informatica
Geneeskunde
Informatique médicale
Computer network resources
Electronic journals
610.285 - Journal URLs:
- http://jamia.bmj.com/ ↗
http://www.jamia.org ↗
http://www.pubmedcentral.nih.gov/tocrender.fcgi?journal=76 ↗
http://www.sciencedirect.com/science/journal/10675027 ↗
http://jamia.oxfordjournals.org/ ↗
http://www.oxfordjournals.org/en/ ↗ - DOI:
- 10.1093/jamia/ocac235 ↗
- Languages:
- English
- ISSNs:
- 1067-5027
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4689.025000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 25954.xml