A feature-based soft sensor for spectroscopic data analysis. (June 2019)
- Record Type:
- Journal Article
- Title:
- A feature-based soft sensor for spectroscopic data analysis. (June 2019)
- Main Title:
- A feature-based soft sensor for spectroscopic data analysis
- Authors:
- Shah, Devarshi
Wang, Jin
He, Q. Peter - Abstract:
- Highlights: Variable selection removes wavelengths from modeling; SPA extracts features without excluding any wavelength from modeling. SPA outperforms full PLS, Lasso and synergy interval PLS (SiPLS) based soft sensors in terms of accuracy, robustness and bias. SPA performs especially better at extreme or boundary regions where the number of samples are usually fewer than other regions. Explored potential of kernel PLS based soft sensor approaches and discuss their pros and cons. MCVT procedure and three performance indices proposed for consistent and fair comparison of different methods across different data sets. Abstract: In the last few decades, spectroscopic techniques such as near-infrared (NIR) and UV/vis spectroscopies have gained wide applications. As a result, various soft sensors have been developed to predict sample properties from its spectroscopic readings. Because the readings at different wavelengths are highly correlated, it has been shown that variable selection could significantly improve a soft sensor's prediction performance and reduce the model complexity. Currently, almost all variable selection methods focus on how to select the variables (i.e., wavelengths or wavelength segments) that are strongly correlated with the dependent variable to improve the prediction performance. Although many successful applications have been reported, such variable selection methods do have their limitations, such as high sensitivity to the choice of training data, andHighlights: Variable selection removes wavelengths from modeling; SPA extracts features without excluding any wavelength from modeling. SPA outperforms full PLS, Lasso and synergy interval PLS (SiPLS) based soft sensors in terms of accuracy, robustness and bias. SPA performs especially better at extreme or boundary regions where the number of samples are usually fewer than other regions. Explored potential of kernel PLS based soft sensor approaches and discuss their pros and cons. MCVT procedure and three performance indices proposed for consistent and fair comparison of different methods across different data sets. Abstract: In the last few decades, spectroscopic techniques such as near-infrared (NIR) and UV/vis spectroscopies have gained wide applications. As a result, various soft sensors have been developed to predict sample properties from its spectroscopic readings. Because the readings at different wavelengths are highly correlated, it has been shown that variable selection could significantly improve a soft sensor's prediction performance and reduce the model complexity. Currently, almost all variable selection methods focus on how to select the variables (i.e., wavelengths or wavelength segments) that are strongly correlated with the dependent variable to improve the prediction performance. Although many successful applications have been reported, such variable selection methods do have their limitations, such as high sensitivity to the choice of training data, and deteriorated performance when testing on new samples. One possible reason is the removal of useful wavelengths or segments of wavelengths during the calibration process, which could be "tilted" to overfit or capture the noise or unknown disturbances contained in the calibration data. As a result, the model prediction performance may deteriorate significantly when the model is extrapolated or applied to new samples. To address this limitation, we propose a feature-based soft sensor approach utilizing statistics pattern analysis (SPA). Instead of selecting certain wavelengths or wavelength segments, the SPA-based method considers the whole spectrum which is divided into segments, and extracts different features over each spectrum segment to build the soft sensor. In other words, the SPA model contains the complete information from the full spectrum without any selection or removal, which we believe is the main reason for the high robustness of the SPA-based method. In addition, we propose a Monte Carlo validation and testing (MCVT) procedure and three MCVT-based performance indices for consistent and fair comparison of different soft sensor methods across different datasets. The MCVT procedure and indices are generally applicable for model comparison in other applications. Four case studies are presented to demonstrate the performance of the feature-based soft sensor and to compare it with a full partial least squares (PLS), a least absolute shrinkage and selection operator (Lasso), and a synergy interval PLS (SiPLS) based models following the proposed MCVT procedure. In addition, we examine the potential of kernel PLS (KPLS) based soft sensor approaches, examine their performances, and discuss their pros and cons. … (more)
- Is Part Of:
- Journal of process control. Volume 78(2019)
- Journal:
- Journal of process control
- Issue:
- Volume 78(2019)
- Issue Display:
- Volume 78, Issue 2019 (2019)
- Year:
- 2019
- Volume:
- 78
- Issue:
- 2019
- Issue Sort Value:
- 2019-0078-2019-0000
- Page Start:
- 98
- Page End:
- 107
- Publication Date:
- 2019-06
- Subjects:
- Soft sensor -- Variable selection -- Multivariate regression -- Partial least squares -- Kernel partial least squares -- Statistics pattern analysis -- NIR -- UV/Vis -- Chemometrics
Process control -- Periodicals
Fabrication -- Contrôle -- Périodiques
Process control
Periodicals
Electronic journals
660.281 - Journal URLs:
- http://www.sciencedirect.com/science/journal/09591524 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.jprocont.2019.03.016 ↗
- Languages:
- English
- ISSNs:
- 0959-1524
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 5042.645000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 10742.xml