DeepFunc: A Deep Learning Framework for Accurate Prediction of Protein Functions from Protein Sequences and Interactions. Issue 12 (27th May 2019)
- Record Type:
- Journal Article
- Title:
- DeepFunc: A Deep Learning Framework for Accurate Prediction of Protein Functions from Protein Sequences and Interactions. Issue 12 (27th May 2019)
- Main Title:
- DeepFunc: A Deep Learning Framework for Accurate Prediction of Protein Functions from Protein Sequences and Interactions
- Authors:
- Zhang, Fuhao
Song, Hong
Zeng, Min
Li, Yaohang
Kurgan, Lukasz
Li, Min - Abstract:
- Abstract: Annotation of protein functions plays an important role in understanding life at the molecular level. High‐throughput sequencing produces massive numbers of raw proteins sequences and only about 1% of them have been manually annotated with functions. Experimental annotations of functions are expensive, time‐consuming and do not keep up with the rapid growth of the sequence numbers. This motivates the development of computational approaches that predict protein functions. A novel deep learning framework, DeepFunc, is proposed which accurately predicts protein functions from protein sequence‐ and network‐derived information. More precisely, DeepFunc uses a long and sparse binary vector to encode information concerning domains, families, and motifs collected from the InterPro tool that is associated with the input protein sequence. This vector is processed with two neural layers to obtain a low‐dimensional vector which is combined with topological information extracted from protein–protein interactions (PPIs) and functional linkages. The combined information is processed by a deep neural network that predicts protein functions. DeepFunc is empirically and comparatively tested on a benchmark testing dataset and the Critical Assessment of protein Function Annotation algorithms (CAFA) 3 dataset. The experimental results demonstrate that DeepFunc outperforms current methods on the testing dataset and that it secures the highest Fmax = 0.54 and AUC = 0.94 on the CAFA3Abstract: Annotation of protein functions plays an important role in understanding life at the molecular level. High‐throughput sequencing produces massive numbers of raw proteins sequences and only about 1% of them have been manually annotated with functions. Experimental annotations of functions are expensive, time‐consuming and do not keep up with the rapid growth of the sequence numbers. This motivates the development of computational approaches that predict protein functions. A novel deep learning framework, DeepFunc, is proposed which accurately predicts protein functions from protein sequence‐ and network‐derived information. More precisely, DeepFunc uses a long and sparse binary vector to encode information concerning domains, families, and motifs collected from the InterPro tool that is associated with the input protein sequence. This vector is processed with two neural layers to obtain a low‐dimensional vector which is combined with topological information extracted from protein–protein interactions (PPIs) and functional linkages. The combined information is processed by a deep neural network that predicts protein functions. DeepFunc is empirically and comparatively tested on a benchmark testing dataset and the Critical Assessment of protein Function Annotation algorithms (CAFA) 3 dataset. The experimental results demonstrate that DeepFunc outperforms current methods on the testing dataset and that it secures the highest Fmax = 0.54 and AUC = 0.94 on the CAFA3 dataset. … (more)
- Is Part Of:
- Proteomics. Volume 19:Issue 12(2019)
- Journal:
- Proteomics
- Issue:
- Volume 19:Issue 12(2019)
- Issue Display:
- Volume 19, Issue 12 (2019)
- Year:
- 2019
- Volume:
- 19
- Issue:
- 12
- Issue Sort Value:
- 2019-0019-0012-0000
- Page Start:
- n/a
- Page End:
- n/a
- Publication Date:
- 2019-05-27
- Subjects:
- deep learning -- functional linkages -- protein domains -- protein functions -- protein–protein interactions -- protein sequences
Proteins -- Separation -- Periodicals
Bioinformatics -- Periodicals
Proteomics -- Periodicals
Genomes -- Periodicals
Molecular genetics -- Periodicals
572.605 - Journal URLs:
- http://onlinelibrary.wiley.com/journal/10.1002/(ISSN)1615-9861 ↗
http://onlinelibrary.wiley.com/ ↗ - DOI:
- 10.1002/pmic.201900019 ↗
- Languages:
- English
- ISSNs:
- 1615-9853
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 6936.178000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 10892.xml