Benchmarking Accuracy and Generalizability of Four Graph Neural Networks Using Large In Vitro ADME Datasets from Different Chemical Spaces. Issue 8 (23rd February 2022)
- Record Type:
- Journal Article
- Title:
- Benchmarking Accuracy and Generalizability of Four Graph Neural Networks Using Large In Vitro ADME Datasets from Different Chemical Spaces. Issue 8 (23rd February 2022)
- Main Title:
- Benchmarking Accuracy and Generalizability of Four Graph Neural Networks Using Large In Vitro ADME Datasets from Different Chemical Spaces
- Authors:
- Broccatelli, Fabio
Trager, Richard
Reutlinger, Michael
Karypis, George
Li, Mufei - Abstract:
- Abstract: In this work, we benchmark a variety of single‐ and multi‐task graph neural network (GNN) models against lower‐bar and higher‐bar traditional machine learning approaches employing human engineered molecular features. We consider four GNN variants – Graph Convolutional Network (GCN), Graph Attention Network (GAT), Message Passing Neural Network (MPNN), and Attentive Fingerprint (AttentiveFP). So far deep learning models have been primarily benchmarked using lower‐bar traditional models solely based on fingerprints, while more realistic benchmarks employing fingerprints, whole‐molecule descriptors and predictions from other related endpoints (e. g., LogD7.4) appear to be scarce for industrial ADME datasets. In addition to time‐split test sets based on Genentech data, this study benefits from the availability of measurements from an external chemical space (Roche data). We identify GAT as a promising approach to implementing deep learning models. While all the deep learning models significantly outperform lower‐bar benchmark traditional models solely based on fingerprints, only GATs seem to offer a small but consistent improvement over higher‐bar benchmark traditional models. Finally, the accuracy of in vitro assays from different laboratories predicting the same experimental endpoints appears to be comparable with the accuracy of GAT single‐task models, suggesting that most of the observed error from the models is a function of the experimental error propagation.Abstract: In this work, we benchmark a variety of single‐ and multi‐task graph neural network (GNN) models against lower‐bar and higher‐bar traditional machine learning approaches employing human engineered molecular features. We consider four GNN variants – Graph Convolutional Network (GCN), Graph Attention Network (GAT), Message Passing Neural Network (MPNN), and Attentive Fingerprint (AttentiveFP). So far deep learning models have been primarily benchmarked using lower‐bar traditional models solely based on fingerprints, while more realistic benchmarks employing fingerprints, whole‐molecule descriptors and predictions from other related endpoints (e. g., LogD7.4) appear to be scarce for industrial ADME datasets. In addition to time‐split test sets based on Genentech data, this study benefits from the availability of measurements from an external chemical space (Roche data). We identify GAT as a promising approach to implementing deep learning models. While all the deep learning models significantly outperform lower‐bar benchmark traditional models solely based on fingerprints, only GATs seem to offer a small but consistent improvement over higher‐bar benchmark traditional models. Finally, the accuracy of in vitro assays from different laboratories predicting the same experimental endpoints appears to be comparable with the accuracy of GAT single‐task models, suggesting that most of the observed error from the models is a function of the experimental error propagation. Abstract : … (more)
- Is Part Of:
- Molecular informatics. Volume 41:Issue 8(2022)
- Journal:
- Molecular informatics
- Issue:
- Volume 41:Issue 8(2022)
- Issue Display:
- Volume 41, Issue 8 (2022)
- Year:
- 2022
- Volume:
- 41
- Issue:
- 8
- Issue Sort Value:
- 2022-0041-0008-0000
- Page Start:
- n/a
- Page End:
- n/a
- Publication Date:
- 2022-02-23
- Subjects:
- ADME -- in vitro assays -- deep learning -- graph neural network -- multi-task learning
Cheminformatics -- Periodicals
QSAR (Biochemistry) -- Periodicals
Structure-activity relationships (Biochemistry) -- Periodicals
Drugs -- Structure-activity relationships -- Periodicals
615.19 - Journal URLs:
- http://onlinelibrary.wiley.com/journal/10.1002/(ISSN)1868-1751 ↗
http://www3.interscience.wiley.com/journal/123236613/home ↗
http://onlinelibrary.wiley.com/ ↗ - DOI:
- 10.1002/minf.202100321 ↗
- Languages:
- English
- ISSNs:
- 1868-1743
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 5900.817750
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 23013.xml