Classification of geochemical data based on multivariate statistical analyses: Complementary roles of cluster, principal component, and independent component analyses. (17th March 2017)
- Record Type:
- Journal Article
- Title:
- Classification of geochemical data based on multivariate statistical analyses: Complementary roles of cluster, principal component, and independent component analyses. (17th March 2017)
- Main Title:
- Classification of geochemical data based on multivariate statistical analyses: Complementary roles of cluster, principal component, and independent component analyses
- Authors:
- Iwamori, Hikaru
Yoshida, Kenta
Nakamura, Hitomi
Kuwatani, Tatsu
Hamada, Morihisa
Haraguchi, Satoru
Ueki, Kenta - Abstract:
- Abstract: Identifying the data structure including trends and groups/clusters in geochemical problems is essential to discuss the origin of sources and processes from the observed variability of data. An increasing number and high dimensionality of recent geochemical data require efficient and accurate multivariate statistical analysis methods. In this paper, we show the relationship and complementary roles of k‐means cluster analysis (KCA), principal component analysis (PCA), and independent component analysis (ICA) to capture the true data structure. When the data are preprocessed by primary standardization (i.e., with the zero mean and normalized by the standard deviation), KCA and PCA provide essentially the same results, although the former returns the solution in a discretized space. When the data are preprocessed by whitening (i.e., normalized by eigenvalues along the principal components), KCA and ICA may identify a set of independent trends and groups, irrespective of the amplitude (power) of variance. As an example, basalt isotopic compositions have been analyzed with KCA on the whitened data, demonstrating clear rock type/tectonic occurrence/mantle end‐member discrimination. Therefore, the combination of these methods, particularly KCA on whitened data, is useful to capture and discuss the data structure of various geochemical systems, for which an Excel program is provided. Plain Language Summary: This paper presents a new statistical method that effectivelyAbstract: Identifying the data structure including trends and groups/clusters in geochemical problems is essential to discuss the origin of sources and processes from the observed variability of data. An increasing number and high dimensionality of recent geochemical data require efficient and accurate multivariate statistical analysis methods. In this paper, we show the relationship and complementary roles of k‐means cluster analysis (KCA), principal component analysis (PCA), and independent component analysis (ICA) to capture the true data structure. When the data are preprocessed by primary standardization (i.e., with the zero mean and normalized by the standard deviation), KCA and PCA provide essentially the same results, although the former returns the solution in a discretized space. When the data are preprocessed by whitening (i.e., normalized by eigenvalues along the principal components), KCA and ICA may identify a set of independent trends and groups, irrespective of the amplitude (power) of variance. As an example, basalt isotopic compositions have been analyzed with KCA on the whitened data, demonstrating clear rock type/tectonic occurrence/mantle end‐member discrimination. Therefore, the combination of these methods, particularly KCA on whitened data, is useful to capture and discuss the data structure of various geochemical systems, for which an Excel program is provided. Plain Language Summary: This paper presents a new statistical method that effectively captures the structures of various types of multivariate data (not only geochemical data but also any type of data). The method is based on combinations of k‐means cluster analysis, principal component analysis, and independent component analysis for preprocessed data. The corresponding Excel program file is provided. Key Points: To present statistical methods that effectively capture the structures including trends and groups inherent in geochemical data This approach is useful for any type of geochemical data, and we provide an Excel program file with which the readers can test the method We show how the isotopic compositions of basalts (MORB, OIB, arc, and continental basalts) are looked into with the methods … (more)
- Is Part Of:
- Geochemistry, geophysics, geosystems. Volume 18:Number 3(2017)
- Journal:
- Geochemistry, geophysics, geosystems
- Issue:
- Volume 18:Number 3(2017)
- Issue Display:
- Volume 18, Issue 3 (2017)
- Year:
- 2017
- Volume:
- 18
- Issue:
- 3
- Issue Sort Value:
- 2017-0018-0003-0000
- Page Start:
- 994
- Page End:
- 1012
- Publication Date:
- 2017-03-17
- Subjects:
- multivariate analysis -- cluster analysis -- principal component analysis -- independent component analysis -- mantle -- isotope
Geochemistry -- Periodicals
Geophysics -- Periodicals
Earth sciences -- Periodicals
550.5 - Journal URLs:
- http://g-cubed.org/index.html?ContentPage=main.shtml ↗
http://onlinelibrary.wiley.com/journal/10.1002/(ISSN)1525-2027 ↗
http://onlinelibrary.wiley.com/ ↗ - DOI:
- 10.1002/2016GC006663 ↗
- Languages:
- English
- ISSNs:
- 1525-2027
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4234.930000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 8311.xml