Predicting persistence in the sediment compartment with a new automatic software based on the k-Nearest Neighbor (k-NN) algorithm. (February 2016)
- Record Type:
- Journal Article
- Title:
- Predicting persistence in the sediment compartment with a new automatic software based on the k-Nearest Neighbor (k-NN) algorithm. (February 2016)
- Main Title:
- Predicting persistence in the sediment compartment with a new automatic software based on the k-Nearest Neighbor (k-NN) algorithm
- Authors:
- Manganaro, Alberto
Pizzo, Fabiola
Lombardo, Anna
Pogliaghi, Alberto
Benfenati, Emilio - Abstract:
- Abstract: The ability of a substance to resist degradation and persist in the environment needs to be readily identified in order to protect the environment and human health. Many regulations require the assessment of persistence for substances commonly manufactured and marketed. Besides laboratory-based testing methods, in silico tools may be used to obtain a computational prediction of persistence. We present a new program to develop k -Nearest Neighbor ( k- NN) models. The k- NN algorithm is a similarity-based approach that predicts the property of a substance in relation to the experimental data for its most similar compounds. We employed this software to identify persistence in the sediment compartment. Data on half-life (HL) in sediment were obtained from different sources and, after careful data pruning the final dataset, containing 297 organic compounds, was divided into four experimental classes. We developed several models giving satisfactory performances, considering that both the training and test set accuracy ranged between 0.90 and 0.96. We finally selected one model which will be made available in the near future in the freely available software platform VEGA. This model offers a valuable in silico tool that may be really useful for fast and inexpensive screening. Highlights: A software for building k -NN models is presented. The software have been used for modeling half-life in sediment. The best model obtained has good performance and can be used forAbstract: The ability of a substance to resist degradation and persist in the environment needs to be readily identified in order to protect the environment and human health. Many regulations require the assessment of persistence for substances commonly manufactured and marketed. Besides laboratory-based testing methods, in silico tools may be used to obtain a computational prediction of persistence. We present a new program to develop k -Nearest Neighbor ( k- NN) models. The k- NN algorithm is a similarity-based approach that predicts the property of a substance in relation to the experimental data for its most similar compounds. We employed this software to identify persistence in the sediment compartment. Data on half-life (HL) in sediment were obtained from different sources and, after careful data pruning the final dataset, containing 297 organic compounds, was divided into four experimental classes. We developed several models giving satisfactory performances, considering that both the training and test set accuracy ranged between 0.90 and 0.96. We finally selected one model which will be made available in the near future in the freely available software platform VEGA. This model offers a valuable in silico tool that may be really useful for fast and inexpensive screening. Highlights: A software for building k -NN models is presented. The software have been used for modeling half-life in sediment. The best model obtained has good performance and can be used for reliable predictions. … (more)
- Is Part Of:
- Chemosphere. Volume 144(2016)
- Journal:
- Chemosphere
- Issue:
- Volume 144(2016)
- Issue Display:
- Volume 144, Issue 2016 (2016)
- Year:
- 2016
- Volume:
- 144
- Issue:
- 2016
- Issue Sort Value:
- 2016-0144-2016-0000
- Page Start:
- 1624
- Page End:
- 1630
- Publication Date:
- 2016-02
- Subjects:
- Persistence -- Half-life -- Sediment -- PBT -- In silico -- Read across
Pollution -- Periodicals
Pollution -- Physiological effect -- Periodicals
Environmental sciences -- Periodicals
Atmospheric chemistry -- Periodicals
551.511 - Journal URLs:
- http://www.sciencedirect.com/science/journal/00456535/ ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.chemosphere.2015.10.054 ↗
- Languages:
- English
- ISSNs:
- 0045-6535
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3172.280000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 5056.xml