Benchmarks in antimicrobial peptide prediction are biased due to the selection of negative data. Issue 5 (21st August 2022)
- Record Type:
- Journal Article
- Title:
- Benchmarks in antimicrobial peptide prediction are biased due to the selection of negative data. Issue 5 (21st August 2022)
- Main Title:
- Benchmarks in antimicrobial peptide prediction are biased due to the selection of negative data
- Authors:
- Sidorczuk, Katarzyna
Gagat, Przemysław
Pietluch, Filip
Kała, Jakub
Rafacz, Dominik
Bąkała, Laura
Słowik, Jadwiga
Kolenda, Rafał
Rödiger, Stefan
Fingerhut, Legana C H W
Cooke, Ira R
Mackiewicz, Paweł
Burdukiewicz, Michał - Abstract:
- Abstract: Antimicrobial peptides (AMPs) are a heterogeneous group of short polypeptides that target not only microorganisms but also viruses and cancer cells. Due to their lower selection for resistance compared with traditional antibiotics, AMPs have been attracting the ever-growing attention from researchers, including bioinformaticians. Machine learning represents the most cost-effective method for novel AMP discovery and consequently many computational tools for AMP prediction have been recently developed. In this article, we investigate the impact of negative data sampling on model performance and benchmarking. We generated 660 predictive models using 12 machine learning architectures, a single positive data set and 11 negative data sampling methods; the architectures and methods were defined on the basis of published AMP prediction software. Our results clearly indicate that similar training and benchmark data set, i.e. produced by the same or a similar negative data sampling method, positively affect model performance. Consequently, all the benchmark analyses that have been performed for AMP prediction models are significantly biased and, moreover, we do not know which model is the most accurate. To provide researchers with reliable information about the performance of AMP predictors, we also created a web server AMPBenchmark for fair model benchmarking. AMPBenchmark is available at http://BioGenies.info/AMPBenchmark .
- Is Part Of:
- Briefings in bioinformatics. Volume 23:Issue 5(2022)
- Journal:
- Briefings in bioinformatics
- Issue:
- Volume 23:Issue 5(2022)
- Issue Display:
- Volume 23, Issue 5 (2022)
- Year:
- 2022
- Volume:
- 23
- Issue:
- 5
- Issue Sort Value:
- 2022-0023-0005-0000
- Page Start:
- Page End:
- Publication Date:
- 2022-08-21
- Subjects:
- antimicrobial peptides -- benchmarks -- machine learning -- negative sampling -- prediction -- reproducibility
Genetics -- Data processing -- Periodicals
Molecular biology -- Data processing -- Periodicals
Genomes -- Data processing -- Periodicals
572.80285 - Journal URLs:
- http://bib.oxfordjournals.org ↗
http://www.oxfordjournals.org/content?genre=journal&issn=1477-4054 ↗
http://ukcatalogue.oup.com/ ↗
http://firstsearch.oclc.org ↗ - DOI:
- 10.1093/bib/bbac343 ↗
- Languages:
- English
- ISSNs:
- 1467-5463
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 2283.958363
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 23923.xml