BETA: a comprehensive benchmark for computational drug–target prediction. Issue 4 (2nd June 2022)
- Record Type:
- Journal Article
- Title:
- BETA: a comprehensive benchmark for computational drug–target prediction. Issue 4 (2nd June 2022)
- Main Title:
- BETA: a comprehensive benchmark for computational drug–target prediction
- Authors:
- Zong, Nansu
Li, Ning
Wen, Andrew
Ngo, Victoria
Yu, Yue
Huang, Ming
Chowdhury, Shaika
Jiang, Chao
Fu, Sunyang
Weinshilboum, Richard
Jiang, Guoqian
Hunter, Lawrence
Liu, Hongfang - Abstract:
- Abstract: Internal validation is the most popular evaluation strategy used for drug–target predictive models. The simple random shuffling in the cross-validation, however, is not always ideal to handle large, diverse and copious datasets as it could potentially introduce bias. Hence, these predictive models cannot be comprehensively evaluated to provide insight into their general performance on a variety of use-cases (e.g. permutations of different levels of connectiveness and categories in drug and target space, as well as validations based on different data sources). In this work, we introduce a benchmark, BETA, that aims to address this gap by (i) providing an extensive multipartite network consisting of 0.97 million biomedical concepts and 8.5 million associations, in addition to 62 million drug–drug and protein–protein similarities and (ii) presenting evaluation strategies that reflect seven cases (i.e. general, screening with different connectivity, target and drug screening based on categories, searching for specific drugs and targets and drug repurposing for specific diseases), a total of seven Tests (consisting of 344 Tasks in total) across multiple sampling and validation strategies. Six state-of-the-art methods covering two broad input data types (chemical structure- and gene sequence-based and network-based) were tested across all the developed Tasks . The best-worst performing cases have been analyzed to demonstrate the ability of the proposed benchmark toAbstract: Internal validation is the most popular evaluation strategy used for drug–target predictive models. The simple random shuffling in the cross-validation, however, is not always ideal to handle large, diverse and copious datasets as it could potentially introduce bias. Hence, these predictive models cannot be comprehensively evaluated to provide insight into their general performance on a variety of use-cases (e.g. permutations of different levels of connectiveness and categories in drug and target space, as well as validations based on different data sources). In this work, we introduce a benchmark, BETA, that aims to address this gap by (i) providing an extensive multipartite network consisting of 0.97 million biomedical concepts and 8.5 million associations, in addition to 62 million drug–drug and protein–protein similarities and (ii) presenting evaluation strategies that reflect seven cases (i.e. general, screening with different connectivity, target and drug screening based on categories, searching for specific drugs and targets and drug repurposing for specific diseases), a total of seven Tests (consisting of 344 Tasks in total) across multiple sampling and validation strategies. Six state-of-the-art methods covering two broad input data types (chemical structure- and gene sequence-based and network-based) were tested across all the developed Tasks . The best-worst performing cases have been analyzed to demonstrate the ability of the proposed benchmark to identify limitations of the tested methods for running over the benchmark tasks. The results highlight BETA as a benchmark in the selection of computational strategies for drug repurposing and target discovery. … (more)
- Is Part Of:
- Briefings in bioinformatics. Volume 23:Issue 4(2022)
- Journal:
- Briefings in bioinformatics
- Issue:
- Volume 23:Issue 4(2022)
- Issue Display:
- Volume 23, Issue 4 (2022)
- Year:
- 2022
- Volume:
- 23
- Issue:
- 4
- Issue Sort Value:
- 2022-0023-0004-0000
- Page Start:
- Page End:
- Publication Date:
- 2022-06-02
- Subjects:
- computational cenchmark -- drug target prediction -- computational drug development -- deep learning
Genetics -- Data processing -- Periodicals
Molecular biology -- Data processing -- Periodicals
Genomes -- Data processing -- Periodicals
572.80285 - Journal URLs:
- http://bib.oxfordjournals.org ↗
http://www.oxfordjournals.org/content?genre=journal&issn=1477-4054 ↗
http://ukcatalogue.oup.com/ ↗
http://firstsearch.oclc.org ↗ - DOI:
- 10.1093/bib/bbac199 ↗
- Languages:
- English
- ISSNs:
- 1467-5463
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 2283.958363
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 22545.xml