A System for Phenotype Harmonization in the National Heart, Lung, and Blood Institute Trans-Omics for Precision Medicine (TOPMed) Program. Issue 10 (16th April 2021)
- Record Type:
- Journal Article
- Title:
- A System for Phenotype Harmonization in the National Heart, Lung, and Blood Institute Trans-Omics for Precision Medicine (TOPMed) Program. Issue 10 (16th April 2021)
- Main Title:
- A System for Phenotype Harmonization in the National Heart, Lung, and Blood Institute Trans-Omics for Precision Medicine (TOPMed) Program
- Authors:
- Stilp, Adrienne M
Emery, Leslie S
Broome, Jai G
Buth, Erin J
Khan, Alyna T
Laurie, Cecelia A
Wang, Fei Fei
Wong, Quenna
Chen, Dongquan
D'Augustine, Catherine M
Heard-Costa, Nancy L
Hohensee, Chancellor R
Johnson, William Craig
Juarez, Lucia D
Liu, Jingmin
Mutalik, Karen M
Raffield, Laura M
Wiggins, Kerri L
de Vries, Paul S
Kelly, Tanika N
Kooperberg, Charles
Natarajan, Pradeep
Peloso, Gina M
Peyser, Patricia A
Reiner, Alex P
Arnett, Donna K
Aslibekyan, Stella
Barnes, Kathleen C
Bielak, Lawrence F
Bis, Joshua C
Cade, Brian E
Chen, Ming-Huei
Correa, Adolfo
Cupples, L Adrienne
de Andrade, Mariza
Ellinor, Patrick T
Fornage, Myriam
Franceschini, Nora
Gan, Weiniu
Ganesh, Santhi K
Graffelman, Jan
Grove, Megan L
Guo, Xiuqing
Hawley, Nicola L
Hsu, Wan-Ling
Jackson, Rebecca D
Jaquish, Cashell E
Johnson, Andrew D
Kardia, Sharon L R
Kelly, Shannon
Lee, Jiwon
Mathias, Rasika A
McGarvey, Stephen T
Mitchell, Braxton D
Montasser, May E
Morrison, Alanna C
North, Kari E
Nouraie, Seyed Mehdi
Oelsner, Elizabeth C
Pankratz, Nathan
Rich, Stephen S
Rotter, Jerome I
Smith, Jennifer A
Taylor, Kent D
Vasan, Ramachandran S
Weeks, Daniel E
Weiss, Scott T
Wilson, Carla G
Yanek, Lisa R
Psaty, Bruce M
Heckbert, Susan R
Laurie, Cathy C
… (more) - Abstract:
- Abstract: Genotype-phenotype association studies often combine phenotype data from multiple studies to increase statistical power. Harmonization of the data usually requires substantial effort due to heterogeneity in phenotype definitions, study design, data collection procedures, and data-set organization. Here we describe a centralized system for phenotype harmonization that includes input from phenotype domain and study experts, quality control, documentation, reproducible results, and data-sharing mechanisms. This system was developed for the National Heart, Lung, and Blood Institute's Trans-Omics for Precision Medicine (TOPMed) program, which is generating genomic and other -omics data for more than 80 studies with extensive phenotype data. To date, 63 phenotypes have been harmonized across thousands of participants (recruited in 1948–2012) from up to 17 studies per phenotype. Here we discuss challenges in this undertaking and how they were addressed. The harmonized phenotype data and associated documentation have been submitted to National Institutes of Health data repositories for controlled access by the scientific community. We also provide materials to facilitate future harmonization efforts by the community, which include 1) the software code used to generate the 63 harmonized phenotypes, enabling others to reproduce, modify, or extend these harmonizations to additional studies, and 2) the results of labeling thousands of phenotype variables with controlledAbstract: Genotype-phenotype association studies often combine phenotype data from multiple studies to increase statistical power. Harmonization of the data usually requires substantial effort due to heterogeneity in phenotype definitions, study design, data collection procedures, and data-set organization. Here we describe a centralized system for phenotype harmonization that includes input from phenotype domain and study experts, quality control, documentation, reproducible results, and data-sharing mechanisms. This system was developed for the National Heart, Lung, and Blood Institute's Trans-Omics for Precision Medicine (TOPMed) program, which is generating genomic and other -omics data for more than 80 studies with extensive phenotype data. To date, 63 phenotypes have been harmonized across thousands of participants (recruited in 1948–2012) from up to 17 studies per phenotype. Here we discuss challenges in this undertaking and how they were addressed. The harmonized phenotype data and associated documentation have been submitted to National Institutes of Health data repositories for controlled access by the scientific community. We also provide materials to facilitate future harmonization efforts by the community, which include 1) the software code used to generate the 63 harmonized phenotypes, enabling others to reproduce, modify, or extend these harmonizations to additional studies, and 2) the results of labeling thousands of phenotype variables with controlled vocabulary terms. … (more)
- Is Part Of:
- American journal of epidemiology. Volume 190:Issue 10(2021)
- Journal:
- American journal of epidemiology
- Issue:
- Volume 190:Issue 10(2021)
- Issue Display:
- Volume 190, Issue 10 (2021)
- Year:
- 2021
- Volume:
- 190
- Issue:
- 10
- Issue Sort Value:
- 2021-0190-0010-0000
- Page Start:
- 1977
- Page End:
- 1992
- Publication Date:
- 2021-04-16
- Subjects:
- cardiovascular disease -- common data elements -- hematologic disease -- information dissemination -- lung diseases -- phenotypes -- sleep-wake disorders
Epidemiology -- Periodicals
Public health -- Periodicals
614.4 - Journal URLs:
- http://aje.oxfordjournals.org/ ↗
http://ukcatalogue.oup.com/ ↗ - DOI:
- 10.1093/aje/kwab115 ↗
- Languages:
- English
- ISSNs:
- 0002-9262
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 0824.600000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 26011.xml