The National COVID Cohort Collaborative (N3C): Rationale, design, infrastructure, and deployment. (17th August 2020)
- Record Type:
- Journal Article
- Title:
- The National COVID Cohort Collaborative (N3C): Rationale, design, infrastructure, and deployment. (17th August 2020)
- Main Title:
- The National COVID Cohort Collaborative (N3C): Rationale, design, infrastructure, and deployment
- Authors:
- Haendel, Melissa A
Chute, Christopher G
Bennett, Tellen D
Eichmann, David A
Guinney, Justin
Kibbe, Warren A
Payne, Philip R O
Pfaff, Emily R
Robinson, Peter N
Saltz, Joel H
Spratt, Heidi
Suver, Christine
Wilbanks, John
Wilcox, Adam B
Williams, Andrew E
Wu, Chunlei
Blacketer, Clair
Bradford, Robert L
Cimino, James J
Clark, Marshall
Colmenares, Evan W
Francis, Patricia A
Gabriel, Davera
Graves, Alexis
Hemadri, Raju
Hong, Stephanie S
Hripscak, George
Jiao, Dazhi
Klann, Jeffrey G
Kostka, Kristin
Lee, Adam M
Lehmann, Harold P
Lingrey, Lora
Miller, Robert T
Morris, Michele
Murphy, Shawn N
Natarajan, Karthik
Palchuk, Matvey B
Sheikh, Usman
Solbrig, Harold
Visweswaran, Shyam
Walden, Anita
Walters, Kellie M
Weber, Griffin M
Zhang, Xiaohan Tanner
Zhu, Richard L
Amor, Benjamin
Girvin, Andrew T
Manna, Amin
Qureshi, Nabeel
Kurilla, Michael G
Michael, Sam G
Portilla, Lili M
Rutter, Joni L
Austin, Christopher P
Gersing, Ken R
… (more) - Abstract:
- Abstract: Objective: Coronavirus disease 2019 (COVID-19) poses societal challenges that require expeditious data and knowledge sharing. Though organizational clinical data are abundant, these are largely inaccessible to outside researchers. Statistical, machine learning, and causal analyses are most successful with large-scale data beyond what is available in any given organization. Here, we introduce the National COVID Cohort Collaborative (N3C), an open science community focused on analyzing patient-level data from many centers. Materials and Methods: The Clinical and Translational Science Award Program and scientific community created N3C to overcome technical, regulatory, policy, and governance barriers to sharing and harmonizing individual-level clinical data. We developed solutions to extract, aggregate, and harmonize data across organizations and data models, and created a secure data enclave to enable efficient, transparent, and reproducible collaborative analytics. Results: Organized in inclusive workstreams, we created legal agreements and governance for organizations and researchers; data extraction scripts to identify and ingest positive, negative, and possible COVID-19 cases; a data quality assurance and harmonization pipeline to create a single harmonized dataset; population of the secure data enclave with data, machine learning, and statistical analytics tools; dissemination mechanisms; and a synthetic data pilot to democratize data access. Conclusions: TheAbstract: Objective: Coronavirus disease 2019 (COVID-19) poses societal challenges that require expeditious data and knowledge sharing. Though organizational clinical data are abundant, these are largely inaccessible to outside researchers. Statistical, machine learning, and causal analyses are most successful with large-scale data beyond what is available in any given organization. Here, we introduce the National COVID Cohort Collaborative (N3C), an open science community focused on analyzing patient-level data from many centers. Materials and Methods: The Clinical and Translational Science Award Program and scientific community created N3C to overcome technical, regulatory, policy, and governance barriers to sharing and harmonizing individual-level clinical data. We developed solutions to extract, aggregate, and harmonize data across organizations and data models, and created a secure data enclave to enable efficient, transparent, and reproducible collaborative analytics. Results: Organized in inclusive workstreams, we created legal agreements and governance for organizations and researchers; data extraction scripts to identify and ingest positive, negative, and possible COVID-19 cases; a data quality assurance and harmonization pipeline to create a single harmonized dataset; population of the secure data enclave with data, machine learning, and statistical analytics tools; dissemination mechanisms; and a synthetic data pilot to democratize data access. Conclusions: The N3C has demonstrated that a multisite collaborative learning health network can overcome barriers to rapidly build a scalable infrastructure incorporating multiorganizational clinical data for COVID-19 analytics. We expect this effort to save lives by enabling rapid collaboration among clinicians, researchers, and data scientists to identify treatments and specialized care and thereby reduce the immediate and long-term impacts of COVID-19. … (more)
- Is Part Of:
- Journal of the American Medical Informatics Association. Volume 28:Number 3(2021)
- Journal:
- Journal of the American Medical Informatics Association
- Issue:
- Volume 28:Number 3(2021)
- Issue Display:
- Volume 28, Issue 3 (2021)
- Year:
- 2021
- Volume:
- 28
- Issue:
- 3
- Issue Sort Value:
- 2021-0028-0003-0000
- Page Start:
- 427
- Page End:
- 443
- Publication Date:
- 2020-08-17
- Subjects:
- COVID-19 -- open science -- clinical data model harmonization -- EHR data -- collaborative analytics -- SARS-CoV-2
Medical informatics -- Periodicals
Information Services -- Periodicals
Medical Informatics -- Periodicals
Médecine -- Informatique -- Périodiques
Informatica
Geneeskunde
Informatique médicale
Computer network resources
Electronic journals
610.285 - Journal URLs:
- http://jamia.bmj.com/ ↗
http://www.jamia.org ↗
http://www.pubmedcentral.nih.gov/tocrender.fcgi?journal=76 ↗
http://www.sciencedirect.com/science/journal/10675027 ↗
http://jamia.oxfordjournals.org/ ↗
http://www.oxfordjournals.org/en/ ↗ - DOI:
- 10.1093/jamia/ocaa196 ↗
- Languages:
- English
- ISSNs:
- 1067-5027
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4689.025000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 22238.xml