Machine learning prediction of 3CLpro SARS-CoV-2 docking scores. (June 2022)
- Record Type:
- Journal Article
- Title:
- Machine learning prediction of 3CLpro SARS-CoV-2 docking scores. (June 2022)
- Main Title:
- Machine learning prediction of 3CLpro SARS-CoV-2 docking scores
- Authors:
- Bucinsky, Lukas
Bortňák, Dušan
Gall, Marián
Matúška, Ján
Milata, Viktor
Pitoňák, Michal
Štekláč, Marek
Végh, Daniel
Zajaček, Dávid - Abstract:
- Abstract: Molecular docking results of two training sets containing 866 and 8, 696 compounds were used to train three different machine learning (ML) approaches. Neural network approaches according to Keras and TensorFlow libraries and the gradient boosted decision trees approach of XGBoost were used with DScribe's Smooth Overlap of Atomic Positions molecular descriptors. In addition, neural networks using the SchNetPack library and descriptors were used. The ML performance was tested on three different sets, including compounds for future organic synthesis. The final evaluation of the ML predicted docking scores was based on the ZINC in vivo set, from which 1, 200 compounds were randomly selected with respect to their size. The results obtained showed a consistent ML prediction capability of docking scores, and even though compounds with more than 60 atoms were found slightly overestimated they remain valid for a subsequent evaluation of their drug repurposing suitability. Graphical abstract: ga1 Highlights: AutoDock docking scores of 12, 000 compounds are obtained for 3CLpro (6WQF). Machine learning (ML) is based on TensorFlow, XGBoost and SchNetPack libraries. DScribe and SchNet molecular descriptors use xyz/mol2 file formats. Predictions of docking scores tend to be overestimated for large compounds. Further improvement of ML models is possible for better tuned train sets.
- Is Part Of:
- Computational biology and chemistry. Volume 98(2022)
- Journal:
- Computational biology and chemistry
- Issue:
- Volume 98(2022)
- Issue Display:
- Volume 98, Issue 2022 (2022)
- Year:
- 2022
- Volume:
- 98
- Issue:
- 2022
- Issue Sort Value:
- 2022-0098-2022-0000
- Page Start:
- Page End:
- Publication Date:
- 2022-06
- Subjects:
- AutoDock molecular docking -- 3CLpro Mpro 6WQF -- Machine learning -- TensorFlow XGBoost SchNetPack -- COVID19 -- SARS-CoV-2
Chemistry -- Data processing -- Periodicals
Biology -- Data processing -- Periodicals
Biochemistry -- Data processing
Biology -- Data processing
Molecular biology -- Data processing
Periodicals
Electronic journals
542.85 - Journal URLs:
- http://www.sciencedirect.com/science/journal/14769271 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.compbiolchem.2022.107656 ↗
- Languages:
- English
- ISSNs:
- 1476-9271
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3390.576700
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 21597.xml