LocStra: Fast analysis of regional/global stratification in whole‐genome sequencing studies. Issue 1 (14th September 2020)
- Record Type:
- Journal Article
- Title:
- LocStra: Fast analysis of regional/global stratification in whole‐genome sequencing studies. Issue 1 (14th September 2020)
- Main Title:
- LocStra: Fast analysis of regional/global stratification in whole‐genome sequencing studies
- Authors:
- Hahn, Georg
Lutz, Sharon M.
Hecker, Julian
Prokopenko, Dmitry
Cho, Michael H.
Silverman, Edwin K.
Weiss, Scott T.
Lange, Christoph - Abstract:
- Abstract: locStra is an R ‐package for the analysis of regional and global population stratification in whole‐genome sequencing (WGS) studies, where regional stratification refers to the substructure defined by the loci in a particular region on the genome. Population substructure can be assessed based on the genetic covariance matrix, the genomic relationship matrix, and the unweighted/weighted genetic Jaccard similarity matrix. Using a sliding window approach, the regional similarity matrices are compared with the global ones, based on user‐defined window sizes and metrics, for example, the correlation between regional and global eigenvectors. An algorithm for the specification of the window size is provided. As the implementation fully exploits sparse matrix algebra and is written in C++, the analysis is highly efficient. Even on single cores, for realistic study sizes (several thousand subjects, several million rare variants per subject), the runtime for the genome‐wide computation of all regional similarity matrices does typically not exceed one hour, enabling an unprecedented investigation of regional stratification across the entire genome. The package is applied to three WGS studies, illustrating the varying patterns of regional substructure across the genome and its beneficial effects on association testing.
- Is Part Of:
- Genetic epidemiology. Volume 45:Issue 1(2021)
- Journal:
- Genetic epidemiology
- Issue:
- Volume 45:Issue 1(2021)
- Issue Display:
- Volume 45, Issue 1 (2021)
- Year:
- 2021
- Volume:
- 45
- Issue:
- 1
- Issue Sort Value:
- 2021-0045-0001-0000
- Page Start:
- 82
- Page End:
- 98
- Publication Date:
- 2020-09-14
- Subjects:
- regional analysis -- population stratification -- population substructure -- similarity matrix -- whole‐genome sequencing
Genetic epidemiology -- Periodicals
Heredity -- Periodicals
Medical geography -- Periodicals
614 - Journal URLs:
- http://onlinelibrary.wiley.com/journal/10.1002/(ISSN)1098-2272 ↗
http://onlinelibrary.wiley.com/ ↗ - DOI:
- 10.1002/gepi.22356 ↗
- Languages:
- English
- ISSNs:
- 0741-0395
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4111.848000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 23100.xml