Similar, or dissimilar, that is the question. How different are methods for comparison of compounds similarity?. (October 2020)
- Record Type:
- Journal Article
- Title:
- Similar, or dissimilar, that is the question. How different are methods for comparison of compounds similarity?. (October 2020)
- Main Title:
- Similar, or dissimilar, that is the question. How different are methods for comparison of compounds similarity?
- Authors:
- Rajda, Krzysztof
Podlewska, Sabina - Abstract:
- Graphical abstract: Highlights: Compound similarity assessment constitutes basis for a number of virtual screening protocols. The results of similarity assessment vary depending on the compound representation and metric. Settings for similarity searching tasks should be carefully selected before the running the comparison. Abstract: Comparison of compounds similarity is one of the main strategies of virtual screening protocols. Both similarity and dissimilarity concepts are of great importance during the search for new active compounds. Similarity is important due to the assumption that underlies the process of searching for new drug candidates: structurally similar compounds should induce similar biological response. On the other hand, we are also interested in dissimilarity, as we usually aim to find structurally novel ligands. In the study, we compared several approaches of evaluating compound similarity. Various representations and metrics were applied and we indicated the rate of variation of the results that can occur when shifting from one strategy to another. We compared both general similarity of datasets using different approaches, as well as examined the changes in the set of nearest neighbors when changing one compound representation into another, and the influence of representation/metric settings on the clustering outcome. We hope that the study will be of great help during the preparation of virtual screening experiments, stressing the need for carefulGraphical abstract: Highlights: Compound similarity assessment constitutes basis for a number of virtual screening protocols. The results of similarity assessment vary depending on the compound representation and metric. Settings for similarity searching tasks should be carefully selected before the running the comparison. Abstract: Comparison of compounds similarity is one of the main strategies of virtual screening protocols. Both similarity and dissimilarity concepts are of great importance during the search for new active compounds. Similarity is important due to the assumption that underlies the process of searching for new drug candidates: structurally similar compounds should induce similar biological response. On the other hand, we are also interested in dissimilarity, as we usually aim to find structurally novel ligands. In the study, we compared several approaches of evaluating compound similarity. Various representations and metrics were applied and we indicated the rate of variation of the results that can occur when shifting from one strategy to another. We compared both general similarity of datasets using different approaches, as well as examined the changes in the set of nearest neighbors when changing one compound representation into another, and the influence of representation/metric settings on the clustering outcome. We hope that the study will be of great help during the preparation of virtual screening experiments, stressing the need for careful selection of the way, the compound similarity is assessed. The differences in the results that can be obtained via the application of particular strategy can significantly influence the outcome of comparison studies; therefore, its settings should be carefully selected beforerunning the comparison. … (more)
- Is Part Of:
- Computational biology and chemistry. Volume 88(2020)
- Journal:
- Computational biology and chemistry
- Issue:
- Volume 88(2020)
- Issue Display:
- Volume 88, Issue 2020 (2020)
- Year:
- 2020
- Volume:
- 88
- Issue:
- 2020
- Issue Sort Value:
- 2020-0088-2020-0000
- Page Start:
- Page End:
- Publication Date:
- 2020-10
- Subjects:
- Similarity metric -- Fingerprint -- Clustering -- Virtual screening -- G protein-coupled receptors
Chemistry -- Data processing -- Periodicals
Biology -- Data processing -- Periodicals
Biochemistry -- Data processing
Biology -- Data processing
Molecular biology -- Data processing
Periodicals
Electronic journals
542.85 - Journal URLs:
- http://www.sciencedirect.com/science/journal/14769271 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.compbiolchem.2020.107367 ↗
- Languages:
- English
- ISSNs:
- 1476-9271
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3390.576700
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 15501.xml