Enabling full‐length evolutionary profiles based deep convolutional neural network for predicting DNA‐binding proteins from sequence. Issue 1 (8th July 2019)
- Record Type:
- Journal Article
- Title:
- Enabling full‐length evolutionary profiles based deep convolutional neural network for predicting DNA‐binding proteins from sequence. Issue 1 (8th July 2019)
- Main Title:
- Enabling full‐length evolutionary profiles based deep convolutional neural network for predicting DNA‐binding proteins from sequence
- Authors:
- Chauhan, Sucheta
Ahmad, Shandar - Abstract:
- Abstract: Sequence based DNA‐binding protein (DBP) prediction is a widely studied biological problem. Sliding windows on position specific substitution matrices (PSSMs) rows predict DNA‐binding residues well on known DBPs but the same models cannot be applied to unequally sized protein sequences. PSSM summaries representing column averages and their amino‐acid wise versions have been effectively used for the task, but it remains unclear if these features carry all the PSSM's predictive power, traditionally harnessed for binding site predictions. Here we evaluate if PSSMs scaled up to a fixed size by zero‐vector padding (pPSSM) could perform better than the summary based features on similar models. Using multilayer perceptron (MLP) and deep convolutional neural network (CNN), we found that (a) Summary features work well for single‐genome (human‐only) data but are outperformed by pPSSM for diverse PDB‐derived data sets, suggesting greater summary‐level redundancy in the former, (b) even when summary features work comparably well with pPSSM, a consensus on the two outperforms both of them (c) CNN models comprehensively outperform their corresponding MLP models and (d) actual predicted scores from different models depend on the choice of input feature sets used whereas overall performance levels are model‐dependent in which CNN leads the accuracy.
- Is Part Of:
- Proteins. Volume 88:Issue 1(2020)
- Journal:
- Proteins
- Issue:
- Volume 88:Issue 1(2020)
- Issue Display:
- Volume 88, Issue 1 (2020)
- Year:
- 2020
- Volume:
- 88
- Issue:
- 1
- Issue Sort Value:
- 2020-0088-0001-0000
- Page Start:
- 15
- Page End:
- 30
- Publication Date:
- 2019-07-08
- Subjects:
- convolutional neural networks -- DNA‐binding proteins -- evolutionary profiles -- functional annotations -- PSSM -- sequence‐based predictions
Proteins -- Periodicals
Proteins -- Periodicals
572.6 - Journal URLs:
- http://onlinelibrary.wiley.com/ ↗
- DOI:
- 10.1002/prot.25763 ↗
- Languages:
- English
- ISSNs:
- 0887-3585
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 6936.164000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 15272.xml