The DataHarmonizer: a tool for faster data harmonization, validation, aggregation and analysis of pathogen genomics contextual information. Issue 1 (23rd January 2023)
- Record Type:
- Journal Article
- Title:
- The DataHarmonizer: a tool for faster data harmonization, validation, aggregation and analysis of pathogen genomics contextual information. Issue 1 (23rd January 2023)
- Main Title:
- The DataHarmonizer: a tool for faster data harmonization, validation, aggregation and analysis of pathogen genomics contextual information
- Authors:
- Gill, Ivan S.
Griffiths, Emma J.
Dooley, Damion
Cameron, Rhiannon
Savić Kallesøe, Sarah
John, Nithu Sara
Sehar, Anoosha
Gosal, Gurinder
Alexander, David
Chapel, Madison
Croxen, Matthew A.
Delisle, Benjamin
Di Tullio, Rachelle
Gaston, Daniel
Duggan, Ana
Guthrie, Jennifer L.
Horsman, Mark
Joshi, Esha
Kearny, Levon
Knox, Natalie
Lau, Lynette
LeBlanc, Jason J.
Li, Vincent
Lyons, Pierre
MacKenzie, Keith
McArthur, Andrew G.
Panousis, Emily M.
Palmer, John
Prystajecky, Natalie
Smith, Kerri N.
Tanner, Jennifer
Townend, Christopher
Tyler, Andrea
Van Domselaar, Gary
Hsiao, William W. L.
… (more) - Abstract:
- Abstract : Pathogen genomics is a critical tool for public health surveillance, infection control, outbreak investigations as well as research. In order to make use of pathogen genomics data, they must be interpreted using contextual data (metadata). Contextual data include sample metadata, laboratory methods, patient demographics, clinical outcomes and epidemiological information. However, the variability in how contextual information is captured by different authorities and how it is encoded in different databases poses challenges for data interpretation, integration and their use/re-use. The DataHarmonizer is a template-driven spreadsheet application for harmonizing, validating and transforming genomics contextual data into submission-ready formats for public or private repositories. The tool's web browser-based JavaScript environment enables validation and its offline functionality and local installation increases data security. The DataHarmonizer was developed to address the data sharing needs that arose during the COVID-19 pandemic, and was used by members of the Canadian COVID Genomics Network (CanCOGeN) to harmonize SARS-CoV-2 contextual data for national surveillance and for public repository submission. In order to support coordination of international surveillance efforts, we have partnered with the Public Health Alliance for Genomic Epidemiology to also provide a template conforming to its SARS-CoV-2 contextual data specification for use worldwide. Templates areAbstract : Pathogen genomics is a critical tool for public health surveillance, infection control, outbreak investigations as well as research. In order to make use of pathogen genomics data, they must be interpreted using contextual data (metadata). Contextual data include sample metadata, laboratory methods, patient demographics, clinical outcomes and epidemiological information. However, the variability in how contextual information is captured by different authorities and how it is encoded in different databases poses challenges for data interpretation, integration and their use/re-use. The DataHarmonizer is a template-driven spreadsheet application for harmonizing, validating and transforming genomics contextual data into submission-ready formats for public or private repositories. The tool's web browser-based JavaScript environment enables validation and its offline functionality and local installation increases data security. The DataHarmonizer was developed to address the data sharing needs that arose during the COVID-19 pandemic, and was used by members of the Canadian COVID Genomics Network (CanCOGeN) to harmonize SARS-CoV-2 contextual data for national surveillance and for public repository submission. In order to support coordination of international surveillance efforts, we have partnered with the Public Health Alliance for Genomic Epidemiology to also provide a template conforming to its SARS-CoV-2 contextual data specification for use worldwide. Templates are also being developed for One Health and foodborne pathogens. Overall, the DataHarmonizer tool improves the effectiveness and fidelity of contextual data capture as well as its subsequent usability. Harmonization of contextual information across authorities, platforms and systems globally improves interoperability and reusability of data for concerted public health and research initiatives to fight the current pandemic and future public health emergencies. While initially developed for the COVID-19 pandemic, its expansion to other data management applications and pathogens is already underway. … (more)
- Is Part Of:
- Microbial genomics. Volume 9:Issue 1(2023)
- Journal:
- Microbial genomics
- Issue:
- Volume 9:Issue 1(2023)
- Issue Display:
- Volume 9, Issue 1 (2023)
- Year:
- 2023
- Volume:
- 9
- Issue:
- 1
- Issue Sort Value:
- 2023-0009-0001-0000
- Page Start:
- Page End:
- Publication Date:
- 2023-01-23
- Subjects:
- contextual data -- data management -- genomic surveillance -- harmonization -- metadata
Microbial genomics -- Periodicals
572.8629 - Journal URLs:
- https://www.microbiologyresearch.org/content/journal/mgen ↗
- DOI:
- 10.1099/mgen.0.000908 ↗
- Languages:
- English
- ISSNs:
- 2057-5858
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library HMNTS - ELD Digital store
- Ingest File:
- 25135.xml