A comprehensive and high-quality collection of Escherichia coli genomes and their genes. Issue 2 (8th February 2021)
- Record Type:
- Journal Article
- Title:
- A comprehensive and high-quality collection of Escherichia coli genomes and their genes. Issue 2 (8th February 2021)
- Main Title:
- A comprehensive and high-quality collection of Escherichia coli genomes and their genes
- Authors:
- Horesh, Gal
Blackwell, Grace A.
Tonkin-Hill, Gerry
Corander, Jukka
Heinz, Eva
Thomson, Nicholas R. - Abstract:
- Abstract : Escherichia coli is a highly diverse organism that includes a range of commensal and pathogenic variants found across a range of niches and worldwide. In addition to causing severe intestinal and extraintestinal disease, E. coli is considered a priority pathogen due to high levels of observed drug resistance. The diversity in the E. coli population is driven by high genome plasticity and a very large gene pool. All these have made E. coli one of the most well-studied organisms, as well as a commonly used laboratory strain. Today, there are thousands of sequenced E. coli genomes stored in public databases. While data is widely available, accessing the information in order to perform analyses can still be a challenge. Collecting relevant available data requires accessing different sources, where data may be stored in a range of formats, and often requires further manipulation and processing to apply various analyses and extract useful information. In this study, we collated and intensely curated a collection of over 10 000 E. coli and Shigella genomes to provide a single, uniform, high-quality dataset. Shigella were included as they are considered specialized pathovars of E. coli . We provide these data in a number of easily accessible formats that can be used as the foundation for future studies addressing the biological differences between E. coli lineages and the distribution and flow of genes in the E. coli population at a high resolution. The analysis weAbstract : Escherichia coli is a highly diverse organism that includes a range of commensal and pathogenic variants found across a range of niches and worldwide. In addition to causing severe intestinal and extraintestinal disease, E. coli is considered a priority pathogen due to high levels of observed drug resistance. The diversity in the E. coli population is driven by high genome plasticity and a very large gene pool. All these have made E. coli one of the most well-studied organisms, as well as a commonly used laboratory strain. Today, there are thousands of sequenced E. coli genomes stored in public databases. While data is widely available, accessing the information in order to perform analyses can still be a challenge. Collecting relevant available data requires accessing different sources, where data may be stored in a range of formats, and often requires further manipulation and processing to apply various analyses and extract useful information. In this study, we collated and intensely curated a collection of over 10 000 E. coli and Shigella genomes to provide a single, uniform, high-quality dataset. Shigella were included as they are considered specialized pathovars of E. coli . We provide these data in a number of easily accessible formats that can be used as the foundation for future studies addressing the biological differences between E. coli lineages and the distribution and flow of genes in the E. coli population at a high resolution. The analysis we present emphasizes our lack of understanding of the true diversity of the E. coli species, and the biased nature of our current understanding of the genetic diversity of such a key pathogen. … (more)
- Is Part Of:
- Microbial genomics. Volume 7:Issue 2(2021)
- Journal:
- Microbial genomics
- Issue:
- Volume 7:Issue 2(2021)
- Issue Display:
- Volume 7, Issue 2 (2021)
- Year:
- 2021
- Volume:
- 7
- Issue:
- 2
- Issue Sort Value:
- 2021-0007-0002-0000
- Page Start:
- Page End:
- Publication Date:
- 2021-02-08
- Subjects:
- antimicrobial resistance -- Escherichia coli -- horizontal gene transfer -- pan-genome -- Shigella
Microbial genomics -- Periodicals
572.8629 - Journal URLs:
- https://www.microbiologyresearch.org/content/journal/mgen ↗
- DOI:
- 10.1099/mgen.0.000499 ↗
- Languages:
- English
- ISSNs:
- 2057-5858
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library HMNTS - ELD Digital store
- Ingest File:
- 15935.xml