An effective self-supervised framework for learning expressive molecular global representations to drug discovery. Issue 6 (3rd May 2021)
- Record Type:
- Journal Article
- Title:
- An effective self-supervised framework for learning expressive molecular global representations to drug discovery. Issue 6 (3rd May 2021)
- Main Title:
- An effective self-supervised framework for learning expressive molecular global representations to drug discovery
- Authors:
- Li, Pengyong
Wang, Jun
Qiao, Yixuan
Chen, Hao
Yu, Yihuan
Yao, Xiaojun
Gao, Peng
Xie, Guotong
Song, Sen - Abstract:
- Abstract: How to produce expressive molecular representations is a fundamental challenge in artificial intelligence-driven drug discovery. Graph neural network (GNN) has emerged as a powerful technique for modeling molecular data. However, previous supervised approaches usually suffer from the scarcity of labeled data and poor generalization capability. Here, we propose a novel molecular pre-training graph-based deep learning framework, named MPG, that learns molecular representations from large-scale unlabeled molecules. In MPG, we proposed a powerful GNN for modelling molecular graph named MolGNet, and designed an effective self-supervised strategy for pre-training the model at both the node and graph-level. After pre-training on 11 million unlabeled molecules, we revealed that MolGNet can capture valuable chemical insights to produce interpretable representation. The pre-trained MolGNet can be fine-tuned with just one additional output layer to create state-of-the-art models for a wide range of drug discovery tasks, including molecular properties prediction, drug-drug interaction and drug-target interaction, on 14 benchmark datasets. The pre-trained MolGNet in MPG has the potential to become an advanced molecular encoder in the drug discovery pipeline.
- Is Part Of:
- Briefings in bioinformatics. Volume 22:Issue 6(2021)
- Journal:
- Briefings in bioinformatics
- Issue:
- Volume 22:Issue 6(2021)
- Issue Display:
- Volume 22, Issue 6 (2021)
- Year:
- 2021
- Volume:
- 22
- Issue:
- 6
- Issue Sort Value:
- 2021-0022-0006-0000
- Page Start:
- Page End:
- Publication Date:
- 2021-05-03
- Subjects:
- molecular representation -- deep learning -- graph neural network -- self-supervised learning
Genetics -- Data processing -- Periodicals
Molecular biology -- Data processing -- Periodicals
Genomes -- Data processing -- Periodicals
572.80285 - Journal URLs:
- http://bib.oxfordjournals.org ↗
http://www.oxfordjournals.org/content?genre=journal&issn=1477-4054 ↗
http://ukcatalogue.oup.com/ ↗
http://firstsearch.oclc.org ↗ - DOI:
- 10.1093/bib/bbab109 ↗
- Languages:
- English
- ISSNs:
- 1467-5463
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 2283.958363
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 19693.xml