Adaptive weighted sum tests via LASSO method in multi-locus family-based association analysis. (October 2020)
- Record Type:
- Journal Article
- Title:
- Adaptive weighted sum tests via LASSO method in multi-locus family-based association analysis. (October 2020)
- Main Title:
- Adaptive weighted sum tests via LASSO method in multi-locus family-based association analysis
- Authors:
- Liu, Rui
Yuan, Min
Xu, Huang
Chen, Pinzhong
Xu, Xu Steven
Yang, Yaning - Abstract:
- Graphical abstract: Highlights: Family-based association test (FBAT) are robust against the population stratification. Multi-SNP tests are more powerful than single-SNP methods as taking the covariance structure into consideration. Rare variant analysis is crucial to finding genetic causes for some complex diseases. Weighted sum statisitc using LASSO regression outperforms other tests in various GWAs designs. Abstract: Family based multi-locus tests integrate information from individual loci by weighted averaging of the marginal statistics, and have been proven to be more efficient and robust than the single-locus tests in genetic association studies. The power depends on how much information the weights can extract from data. The currently published weighted sum methods are only applicable to either common or rare variants and may suffer from substantial power loss especially for rare variants. In this paper, we propose a novel data-driven weight to improve the power under both common and rare variant circumstances. We use the l 1 regularization in Least Absolute Shrinkage and Selection Operator (LASSO) regression to construct the weight serving as a simultaneously adaptive marker selection process. Simulations for a dichotomous phenotype demonstrated that our LASSO-based approach outperformed the existing multi-locus methods in the sense of providing the highest statistical power while well controlled type I error rate under different scenarios. We also applied our methodsGraphical abstract: Highlights: Family-based association test (FBAT) are robust against the population stratification. Multi-SNP tests are more powerful than single-SNP methods as taking the covariance structure into consideration. Rare variant analysis is crucial to finding genetic causes for some complex diseases. Weighted sum statisitc using LASSO regression outperforms other tests in various GWAs designs. Abstract: Family based multi-locus tests integrate information from individual loci by weighted averaging of the marginal statistics, and have been proven to be more efficient and robust than the single-locus tests in genetic association studies. The power depends on how much information the weights can extract from data. The currently published weighted sum methods are only applicable to either common or rare variants and may suffer from substantial power loss especially for rare variants. In this paper, we propose a novel data-driven weight to improve the power under both common and rare variant circumstances. We use the l 1 regularization in Least Absolute Shrinkage and Selection Operator (LASSO) regression to construct the weight serving as a simultaneously adaptive marker selection process. Simulations for a dichotomous phenotype demonstrated that our LASSO-based approach outperformed the existing multi-locus methods in the sense of providing the highest statistical power while well controlled type I error rate under different scenarios. We also applied our methods to a real dataset for rheumatoid arthritis (GAW15 Problem 2). Two groups of alleles, in which individual SNPs had only modest and non-significant effects, were detected (P < 0.00001) using our proposed methods, whereas traditional multi-locus methods failed to identify them. In conclusion, the novel LASSO-based approach represents a superior weight-choosing strategy for multi-locus tests. … (more)
- Is Part Of:
- Computational biology and chemistry. Volume 88(2020)
- Journal:
- Computational biology and chemistry
- Issue:
- Volume 88(2020)
- Issue Display:
- Volume 88, Issue 2020 (2020)
- Year:
- 2020
- Volume:
- 88
- Issue:
- 2020
- Issue Sort Value:
- 2020-0088-2020-0000
- Page Start:
- Page End:
- Publication Date:
- 2020-10
- Subjects:
- Family based design -- Genetic association studies -- Multi-locus -- LASSO -- Robust-efficient
Chemistry -- Data processing -- Periodicals
Biology -- Data processing -- Periodicals
Biochemistry -- Data processing
Biology -- Data processing
Molecular biology -- Data processing
Periodicals
Electronic journals
542.85 - Journal URLs:
- http://www.sciencedirect.com/science/journal/14769271 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.compbiolchem.2020.107320 ↗
- Languages:
- English
- ISSNs:
- 1476-9271
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3390.576700
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 15506.xml