Classification of lignocellulosic biomass by weighted‐covariance factor fuzzy C‐means clustering of mid‐infrared and near‐infrared spectra. (11th January 2017)
- Record Type:
- Journal Article
- Title:
- Classification of lignocellulosic biomass by weighted‐covariance factor fuzzy C‐means clustering of mid‐infrared and near‐infrared spectra. (11th January 2017)
- Main Title:
- Classification of lignocellulosic biomass by weighted‐covariance factor fuzzy C‐means clustering of mid‐infrared and near‐infrared spectra
- Authors:
- Rammal, Abbas
Perrin, Eric
Vrabie, Valeriu
Bertrand, Isabelle
Chabbert, Brigitte - Abstract:
- Abstract : The analysis of lignocellulosic materials is crucial to optimizing the conversion efficiencies in biorefineries and to studying crop residue input to soil nutrient cycles. Mid‐infrared (MIR) and near‐infrared (NIR) spectroscopies are rapid, simple, and nondestructive methods for the determination of biomass compositions. However, the analysis of a small set of plant biomass is not generally possible with conventional methods of data processing, such as partial least squares. Additionally, IR spectra do not distribute spherically in the data space. To overcome these problems, we propose a weighted‐covariance factor fuzzy C‐means clustering method combined with bootstrapping. The algorithm can classify spherical and nonspherical clusters, in contrast to classic fuzzy C‐means, which is only adapted to spherical clusters. Bootstrapping enables resampling of the available spectra to generate several datasets on which the classification is performed. This unsupervised clustering methodology was tested to classify a small set of maize roots in soil according to genotype or period of their biodegradation process based on their NIR and MIR spectra. This methodology is applied to determine the optimal pretreatment of IR spectra, to study the contribution of the combination of MIR and NIR spectra and to compare the results on spectral and chemical data. The results show that the best methods of pretreatment are the first‐order Savitzky‐Golay derivative followed by standardAbstract : The analysis of lignocellulosic materials is crucial to optimizing the conversion efficiencies in biorefineries and to studying crop residue input to soil nutrient cycles. Mid‐infrared (MIR) and near‐infrared (NIR) spectroscopies are rapid, simple, and nondestructive methods for the determination of biomass compositions. However, the analysis of a small set of plant biomass is not generally possible with conventional methods of data processing, such as partial least squares. Additionally, IR spectra do not distribute spherically in the data space. To overcome these problems, we propose a weighted‐covariance factor fuzzy C‐means clustering method combined with bootstrapping. The algorithm can classify spherical and nonspherical clusters, in contrast to classic fuzzy C‐means, which is only adapted to spherical clusters. Bootstrapping enables resampling of the available spectra to generate several datasets on which the classification is performed. This unsupervised clustering methodology was tested to classify a small set of maize roots in soil according to genotype or period of their biodegradation process based on their NIR and MIR spectra. This methodology is applied to determine the optimal pretreatment of IR spectra, to study the contribution of the combination of MIR and NIR spectra and to compare the results on spectral and chemical data. The results show that the best methods of pretreatment are the first‐order Savitzky‐Golay derivative followed by standard normal variate. The MIR spectra produce a better result than NIR spectra for the initial characterization and for dynamic samples, while MIR spectra acquired on raw samples, without soluble extraction, provided better classification than wet chemistry. Abstract : A weighted‐covariance factor fuzzy C‐means clustering method combined with bootstrapping is proposed to classify small datasets of infrared spectra of lignocellulosic biomass. This unsupervised method allowed to classify maize roots according to genotype or period of decomposition in soil. The results show that mid‐infrared spectra clustering was generally more efficient than near‐infrared spectra clustering, and mid‐infrared acquired on raw samples (without soluble extraction) provided better classification than wet chemistry. … (more)
- Is Part Of:
- Journal of chemometrics. Volume 31:Number 2(2017)
- Journal:
- Journal of chemometrics
- Issue:
- Volume 31:Number 2(2017)
- Issue Display:
- Volume 31, Issue 2 (2017)
- Year:
- 2017
- Volume:
- 31
- Issue:
- 2
- Issue Sort Value:
- 2017-0031-0002-0000
- Page Start:
- n/a
- Page End:
- n/a
- Publication Date:
- 2017-01-11
- Subjects:
- bootstrapping -- covariance‐based weight -- fuzzy C‐means clustering -- infrared spectroscopy -- lignocellulose -- soil -- unsupervised classification
Chemistry -- Mathematics -- Periodicals
Chemistry -- Statistical methods -- Periodicals
542.85 - Journal URLs:
- http://onlinelibrary.wiley.com/ ↗
- DOI:
- 10.1002/cem.2865 ↗
- Languages:
- English
- ISSNs:
- 0886-9383
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4957.380000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 1625.xml