Sparse exponential family Principal Component Analysis. (December 2016)
- Record Type:
- Journal Article
- Title:
- Sparse exponential family Principal Component Analysis. (December 2016)
- Main Title:
- Sparse exponential family Principal Component Analysis
- Authors:
- Lu, Meng
Huang, Jianhua Z.
Qian, Xiaoning - Abstract:
- Abstract: We propose a Sparse exponential family Principal Component Analysis (SePCA) method suitable for any type of data following exponential family distributions to achieve simultaneous dimension reduction and variable selection for better interpretation of the results. Because of the generality of exponential family distributions, the method can be applied to a wide range of applications, in particular when analyzing high dimensional next-generation sequencing data and genetic mutation data in genomics. The use of sparsity-inducing penalty helps produce sparse principal component loading vectors such that the principal components can focus on informative variables. By using an equivalent dual form of the formulated optimization problem for SePCA, we derive optimal solutions with efficient iterative closed-form updating rules. The results from both simulation experiments and real-world applications have demonstrated the superiority of our SePCA in reconstruction accuracy and computational efficiency over traditional exponential family PCA (ePCA), the existing Sparse PCA (SPCA) and Sparse Logistic PCA (SLPCA) algorithms. Abstract : Highlights: A Sparse exponential family Principal Component Analysis is proposed. It has wide applications to any data following exponential family distributions. Efficient solutions can be achieved with closed-form update rules. The performance is enhanced with appropriate assumption of the data distribution. This model is flexible and highlyAbstract: We propose a Sparse exponential family Principal Component Analysis (SePCA) method suitable for any type of data following exponential family distributions to achieve simultaneous dimension reduction and variable selection for better interpretation of the results. Because of the generality of exponential family distributions, the method can be applied to a wide range of applications, in particular when analyzing high dimensional next-generation sequencing data and genetic mutation data in genomics. The use of sparsity-inducing penalty helps produce sparse principal component loading vectors such that the principal components can focus on informative variables. By using an equivalent dual form of the formulated optimization problem for SePCA, we derive optimal solutions with efficient iterative closed-form updating rules. The results from both simulation experiments and real-world applications have demonstrated the superiority of our SePCA in reconstruction accuracy and computational efficiency over traditional exponential family PCA (ePCA), the existing Sparse PCA (SPCA) and Sparse Logistic PCA (SLPCA) algorithms. Abstract : Highlights: A Sparse exponential family Principal Component Analysis is proposed. It has wide applications to any data following exponential family distributions. Efficient solutions can be achieved with closed-form update rules. The performance is enhanced with appropriate assumption of the data distribution. This model is flexible and highly extensible. … (more)
- Is Part Of:
- Pattern recognition. Volume 60(2016:Dec.)
- Journal:
- Pattern recognition
- Issue:
- Volume 60(2016:Dec.)
- Issue Display:
- Volume 60 (2016)
- Year:
- 2016
- Volume:
- 60
- Issue Sort Value:
- 2016-0060-0000-0000
- Page Start:
- 681
- Page End:
- 691
- Publication Date:
- 2016-12
- Subjects:
- Dimension reduction -- Sparsity -- Exponential family principal component analysis
Pattern perception -- Periodicals
Perception des structures -- Périodiques
Patroonherkenning
006.4 - Journal URLs:
- http://www.sciencedirect.com/science/journal/00313203 ↗
http://www.sciencedirect.com/ ↗ - DOI:
- 10.1016/j.patcog.2016.05.024 ↗
- Languages:
- English
- ISSNs:
- 0031-3203
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 747.xml