Large‐scale analysis of intrinsic disorder flavors and associated functions in the protein sequence universe. (25th October 2016)
- Record Type:
- Journal Article
- Title:
- Large‐scale analysis of intrinsic disorder flavors and associated functions in the protein sequence universe. (25th October 2016)
- Main Title:
- Large‐scale analysis of intrinsic disorder flavors and associated functions in the protein sequence universe
- Authors:
- Necci, Marco
Piovesan, Damiano
Tosatto, Silvio C. E. - Abstract:
- Abstract: Intrinsic disorder (ID) in proteins has been extensively described for the last decade; a large‐scale classification of ID in proteins is mostly missing. Here, we provide an extensive analysis of ID in the protein universe on the UniProt database derived from sequence‐based predictions in MobiDB. Almost half the sequences contain an ID region of at least five residues. About 9% of proteins have a long ID region of over 20 residues which are more abundant in Eukaryotic organisms and most frequently cover less than 20% of the sequence. A small subset of about 67, 000 (out of over 80 million) proteins is fully disordered and mostly found in Viruses. Most proteins have only one ID, with short ID evenly distributed along the sequence and long ID overrepresented in the center. The charged residue composition of Das and Pappu was used to classify ID proteins by structural propensities and corresponding functional enrichment. Swollen Coils seem to be used mainly as structural components and in biosynthesis in both Prokaryotes and Eukaryotes. In Bacteria, they are confined in the nucleoid and in Viruses provide DNA binding function. Coils & Hairpins seem to be specialized in ribosome binding and methylation activities. Globules & Tadpoles bind antigens in Eukaryotes but are involved in killing other organisms and cytolysis in Bacteria. The Undefined class is used by Bacteria to bind toxic substances and mediate transport and movement between and within organisms in Viruses.Abstract: Intrinsic disorder (ID) in proteins has been extensively described for the last decade; a large‐scale classification of ID in proteins is mostly missing. Here, we provide an extensive analysis of ID in the protein universe on the UniProt database derived from sequence‐based predictions in MobiDB. Almost half the sequences contain an ID region of at least five residues. About 9% of proteins have a long ID region of over 20 residues which are more abundant in Eukaryotic organisms and most frequently cover less than 20% of the sequence. A small subset of about 67, 000 (out of over 80 million) proteins is fully disordered and mostly found in Viruses. Most proteins have only one ID, with short ID evenly distributed along the sequence and long ID overrepresented in the center. The charged residue composition of Das and Pappu was used to classify ID proteins by structural propensities and corresponding functional enrichment. Swollen Coils seem to be used mainly as structural components and in biosynthesis in both Prokaryotes and Eukaryotes. In Bacteria, they are confined in the nucleoid and in Viruses provide DNA binding function. Coils & Hairpins seem to be specialized in ribosome binding and methylation activities. Globules & Tadpoles bind antigens in Eukaryotes but are involved in killing other organisms and cytolysis in Bacteria. The Undefined class is used by Bacteria to bind toxic substances and mediate transport and movement between and within organisms in Viruses. Fully disordered proteins behave similarly, but are enriched for glycine residues and extracellular structures. … (more)
- Is Part Of:
- Protein science. Volume 25:Number 12(2016:Dec.)
- Journal:
- Protein science
- Issue:
- Volume 25:Number 12(2016:Dec.)
- Issue Display:
- Volume 25, Issue 12 (2016)
- Year:
- 2016
- Volume:
- 25
- Issue:
- 12
- Issue Sort Value:
- 2016-0025-0012-0000
- Page Start:
- 2164
- Page End:
- 2174
- Publication Date:
- 2016-10-25
- Subjects:
- protein sequence -- intrinsic disorder -- protein structure -- MobiDB -- UniProt -- classification
Proteins -- Periodicals
572.6 - Journal URLs:
- http://www.proteinscience.org/ ↗
http://www3.interscience.wiley.com/journal/121502357/ ↗
http://onlinelibrary.wiley.com/ ↗
http://firstsearch.oclc.org ↗ - DOI:
- 10.1002/pro.3041 ↗
- Languages:
- English
- ISSNs:
- 0961-8368
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 6936.105500
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 11195.xml