Study of the optimum haplotype length to build genomic relationship matrices. Issue 1 (December 2016)
- Record Type:
- Journal Article
- Title:
- Study of the optimum haplotype length to build genomic relationship matrices. Issue 1 (December 2016)
- Main Title:
- Study of the optimum haplotype length to build genomic relationship matrices
- Authors:
- Ferdosi, Mohammad
Henshall, John
Tier, Bruce - Abstract:
- Abstract Background As genomic data becomes more abundant, genomic prediction is more routinely used to estimate breeding values. In genomic prediction, the relationship matrix ( $${\mathbf{A}}$$ A ), which is traditionally used in genetic evaluations is replaced by the genomic relationship matrix ( $${\mathbf{G}}$$ G ). This paper considers alternative ways of building relationship matrices either using single markers or haplotypes of different lengths. We compared the prediction accuracies and log-likelihoods when using these alternative relationship matrices and the traditional $${\mathbf{G}}$$ G matrix, for real and simulated data. Methods For real data, we built relationship matrices using 50k genotype data for a population of Brahman cattle to analyze three traits: scrotal circumference (SC), age at puberty (AGECL) and weight at first corpus luteum (WTCL). Haplotypes were phased with hsphase and imputed with BEAGLE. The relationship matrices were built using three methods based on haplotypes of different lengths. The log-likelihood was considered to define the optimum haplotype lengths for each trait and each haplotype-based relationship matrix. Results Based on simulated data, we showed that the inverse of $${\mathbf{G}}$$ G matrix and the inverse of the haplotype relationship matrices for methods using one-single nucleotide polymorphism (SNP) phased haplotypes provided coefficients of determination (R2 ) close to 1, although the estimated genetic variances differedAbstract Background As genomic data becomes more abundant, genomic prediction is more routinely used to estimate breeding values. In genomic prediction, the relationship matrix ( $${\mathbf{A}}$$ A ), which is traditionally used in genetic evaluations is replaced by the genomic relationship matrix ( $${\mathbf{G}}$$ G ). This paper considers alternative ways of building relationship matrices either using single markers or haplotypes of different lengths. We compared the prediction accuracies and log-likelihoods when using these alternative relationship matrices and the traditional $${\mathbf{G}}$$ G matrix, for real and simulated data. Methods For real data, we built relationship matrices using 50k genotype data for a population of Brahman cattle to analyze three traits: scrotal circumference (SC), age at puberty (AGECL) and weight at first corpus luteum (WTCL). Haplotypes were phased with hsphase and imputed with BEAGLE. The relationship matrices were built using three methods based on haplotypes of different lengths. The log-likelihood was considered to define the optimum haplotype lengths for each trait and each haplotype-based relationship matrix. Results Based on simulated data, we showed that the inverse of $${\mathbf{G}}$$ G matrix and the inverse of the haplotype relationship matrices for methods using one-single nucleotide polymorphism (SNP) phased haplotypes provided coefficients of determination (R2 ) close to 1, although the estimated genetic variances differed across methods. Using real data and multiple SNPs in the haplotype segments to build the relationship matrices provided better results than the $${\mathbf{G}}$$ G matrix based on one-SNP haplotypes. However, the optimal haplotype length to achieve the highest log-likelihood depended on the method used and the trait. The optimal haplotype length (7 to 8 SNPs) was similar for SC and AGECL. One of the haplotype-based methods achieved the largest increase in log-likelihood for SC, i.e. from −1330 when using $${\mathbf{G}}$$ G to −1325 when using haplotypes with eight SNPs. Conclusions Building the relationship matrix by using haplotypes that comprise multiple SNPs will increase the accuracy of estimated breeding values. However, the optimum haplotype length that shows the correct relationship among individuals for each trait can be derived from the data. … (more)
- Is Part Of:
- Genetics, selection, evolution. Volume 48:Issue 1(2016)
- Journal:
- Genetics, selection, evolution
- Issue:
- Volume 48:Issue 1(2016)
- Issue Display:
- Volume 48, Issue 1 (2016)
- Year:
- 2016
- Volume:
- 48
- Issue:
- 1
- Issue Sort Value:
- 2016-0048-0001-0000
- Page Start:
- 1
- Page End:
- 14
- Publication Date:
- 2016-12
- Subjects:
- Livestock -- Breeding -- Periodicals
Animal genetics -- Periodicals
Livestock -- Genetics -- Periodicals
Evolution -- Periodicals
576.505 - Journal URLs:
- http://www.edpsciences.com/docinfos/INRA-GENETICS/ ↗
http://www.gsejournal.org/ ↗
http://www.pubmedcentral.nih.gov/tocrender.fcgi?action=archive&journal=847 ↗
http://link.springer.com/ ↗ - DOI:
- 10.1186/s12711-016-0253-6 ↗
- Languages:
- English
- ISSNs:
- 1297-9686
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 10184.xml