Syntenic block overlap multiplicities with a panel of reference genomes provide a signature of ancient polyploidization events. (December 2015)
- Record Type:
- Journal Article
- Title:
- Syntenic block overlap multiplicities with a panel of reference genomes provide a signature of ancient polyploidization events. (December 2015)
- Main Title:
- Syntenic block overlap multiplicities with a panel of reference genomes provide a signature of ancient polyploidization events
- Authors:
- Zheng, Chunfang
Santos Muñoz, Daniella
Albert, Victor
Sankoff, David - Abstract:
- Abstract Background Following whole genome duplication (WGD), there is a compact distribution of gene similarities within the genome reflecting duplicate pairs of all the genes in the genome. With time, the distribution broadens and loses volume due to variable decay of duplicate gene similarity and to the process of duplicate gene loss. If there are two WGD, the older one becomes so reduced and broad that it merges with the tail of the distributions resulting from more recent events, and it becomes difficult to distinguish them. The goal of this paper is to advance statistical methods of identifying, or at least counting, the WGD events in the lineage of a given genome. Methods For a set of 15 angiosperm genomes, we analyze all 15× 14 = 210 ordered pairs oftarget genome versus reference genome, using SynMap to find syntenic blocks. We consider all sets ofB ≥ 2 syntenic blocks in the target genome that overlap in the reference genome as evidence of WGD activity in the target, whether it be one event or several. We hypothesize that in fitting an exponential function to the tail of the empirical distributionf (B ) of block multiplicities, the size of the exponent will reflect the amount of WGD in the history of the target genome. Results By amalgamating the results from all reference genomes, a range of values of SynMap parameters, and alternative cutoff points for the tail, we find a clear pattern whereby multiple-WGD core eudicots have the smallest (negative) exponents,Abstract Background Following whole genome duplication (WGD), there is a compact distribution of gene similarities within the genome reflecting duplicate pairs of all the genes in the genome. With time, the distribution broadens and loses volume due to variable decay of duplicate gene similarity and to the process of duplicate gene loss. If there are two WGD, the older one becomes so reduced and broad that it merges with the tail of the distributions resulting from more recent events, and it becomes difficult to distinguish them. The goal of this paper is to advance statistical methods of identifying, or at least counting, the WGD events in the lineage of a given genome. Methods For a set of 15 angiosperm genomes, we analyze all 15× 14 = 210 ordered pairs oftarget genome versus reference genome, using SynMap to find syntenic blocks. We consider all sets ofB ≥ 2 syntenic blocks in the target genome that overlap in the reference genome as evidence of WGD activity in the target, whether it be one event or several. We hypothesize that in fitting an exponential function to the tail of the empirical distributionf (B ) of block multiplicities, the size of the exponent will reflect the amount of WGD in the history of the target genome. Results By amalgamating the results from all reference genomes, a range of values of SynMap parameters, and alternative cutoff points for the tail, we find a clear pattern whereby multiple-WGD core eudicots have the smallest (negative) exponents, followed by core eudicots with only the single "γ " triplication in their history, followed by a non-core eudicot with a single WGD, followed by the monocots, with a basal angiosperm, the WGD-freeAmborella having the largest exponent. Conclusion The hypothesis that the exponent of the fit to the tail of the multiplicity distribution is a signature of the amount of WGD is verified, but there is also a clear complicating factor in the monocot clade, where a history of multiple WGD is not reflected in a small exponent. … (more)
- Is Part Of:
- BMC genomics. Volume 16:Number 10(2015)
- Journal:
- BMC genomics
- Issue:
- Volume 16:Number 10(2015)
- Issue Display:
- Volume 16, Issue 10 (2015)
- Year:
- 2015
- Volume:
- 16
- Issue:
- 10
- Issue Sort Value:
- 2015-0016-0010-0000
- Page Start:
- 1
- Page End:
- 6
- Publication Date:
- 2015-12
- Subjects:
- whole genome duplication -- angiosperms -- mixture of distributions
Genomes -- Periodicals
Gene mapping -- Periodicals
Genomics -- Periodicals
Base Sequence -- Periodicals
Chromosome Mapping -- Periodicals
Genetic Techniques -- Periodicals
Sequence Analysis, DNA -- Periodicals
572.8605 - Journal URLs:
- http://www.biomedcentral.com/bmcgenomics/ ↗
http://www.pubmedcentral.nih.gov/tocrender.fcgi?journal=32 ↗
http://link.springer.com/ ↗ - DOI:
- 10.1186/1471-2164-16-S10-S8 ↗
- Languages:
- English
- ISSNs:
- 1471-2164
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 9854.xml