Reshaped Sequential Replacement for variable selection in QSPR: comparison with other reference methods. (17th February 2014)
- Record Type:
- Journal Article
- Title:
- Reshaped Sequential Replacement for variable selection in QSPR: comparison with other reference methods. (17th February 2014)
- Main Title:
- Reshaped Sequential Replacement for variable selection in QSPR: comparison with other reference methods
- Authors:
- Grisoni, F.
Cassotti, M.
Todeschini, R.
Heberger, Karoly - Abstract:
- <abstract abstract-type="main"> <title> <x xml:space="preserve">Abstract</x> </title> <p>The objective of the present work was to compare the Reshaped Sequential Replacement (RSR) algorithm with other well‐known variable selection techniques in the field of Quantitative Structure–Property Relationship (QSPR) modelling. RSR algorithm is based on a simple sequential replacement procedure with the addition of several 'reshaping' functions that aimed to (i) ensure a faster convergence upon optimal subsets of variables and (ii) reject models affected by chance correlation, overfitting and other pathologies. In particular, three reference variable selection methods were chosen for the comparison (stepwise forward selection, genetic algorithms and particle swarm optimization), aiming to identify benefits and drawbacks of RSR with respect to these methods. To this end, several QSPR datasets regarding different physical–chemical properties and characterized by different objects/variables ratios were used to build ordinary least squares models; in addition, some well‐known (Y‐scrambling) and more recent (<italic>R</italic>‐based functions) statistical tools were used to analyse and compare the results. The study highlighted the good capability of RSR to find optimal subsets of variables in QSPR modelling, comparable or better than those found by the other reference variable selection methods. Moreover, RSR resulted to be faster than some of the analysed variable selection techniques,<abstract abstract-type="main"> <title> <x xml:space="preserve">Abstract</x> </title> <p>The objective of the present work was to compare the Reshaped Sequential Replacement (RSR) algorithm with other well‐known variable selection techniques in the field of Quantitative Structure–Property Relationship (QSPR) modelling. RSR algorithm is based on a simple sequential replacement procedure with the addition of several 'reshaping' functions that aimed to (i) ensure a faster convergence upon optimal subsets of variables and (ii) reject models affected by chance correlation, overfitting and other pathologies. In particular, three reference variable selection methods were chosen for the comparison (stepwise forward selection, genetic algorithms and particle swarm optimization), aiming to identify benefits and drawbacks of RSR with respect to these methods. To this end, several QSPR datasets regarding different physical–chemical properties and characterized by different objects/variables ratios were used to build ordinary least squares models; in addition, some well‐known (Y‐scrambling) and more recent (<italic>R</italic>‐based functions) statistical tools were used to analyse and compare the results. The study highlighted the good capability of RSR to find optimal subsets of variables in QSPR modelling, comparable or better than those found by the other reference variable selection methods. Moreover, RSR resulted to be faster than some of the analysed variable selection techniques, despite its extensive exploration of the variables space. Copyright © 2014 John Wiley &amp; Sons, Ltd.</p> </abstract> … (more)
- Is Part Of:
- Journal of chemometrics. Volume 28:Number 4(2014:Apr.)
- Journal:
- Journal of chemometrics
- Issue:
- Volume 28:Number 4(2014:Apr.)
- Issue Display:
- Volume 28, Issue 4 (2014)
- Year:
- 2014
- Volume:
- 28
- Issue:
- 4
- Issue Sort Value:
- 2014-0028-0004-0000
- Page Start:
- 249
- Page End:
- 259
- Publication Date:
- 2014-02-17
- Subjects:
- Chemistry -- Mathematics -- Periodicals
Chemistry -- Statistical methods -- Periodicals
542.85 - Journal URLs:
- http://onlinelibrary.wiley.com/ ↗
- DOI:
- 10.1002/cem.2603 ↗
- Languages:
- English
- ISSNs:
- 0886-9383
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4957.380000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 4001.xml