The effect of three novel feature extraction methods on the prediction of the subcellular localization of multi-site virus proteins. Issue 1 (1st January 2018)
- Record Type:
- Journal Article
- Title:
- The effect of three novel feature extraction methods on the prediction of the subcellular localization of multi-site virus proteins. Issue 1 (1st January 2018)
- Main Title:
- The effect of three novel feature extraction methods on the prediction of the subcellular localization of multi-site virus proteins
- Authors:
- Wang, Lei
Zhao, Yaou
Chen, Yuehui
Wang, Dong - Abstract:
- ABSTRACT: Experimental methods play a crucial role in identifying the subcellular localization of proteins and building high-quality databases. However, more efficient, automated computational methods are required to predict the subcellular localization of proteins on a large scale. Various efficient feature extraction methods have been proposed to predict subcellular localization, but challenges remain. In this paper, three novel feature extraction methods are established to improve multi-site prediction. The first novel feature extraction method utilizes repetitive information via moving windows based on a dipeptide pseudo amino acid composition method (R-Dipeptide). The second novel feature extraction method utilizes the impact of each amino acid residue on its following residues based on pseudo amino acids (I-PseAAC). The third novel feature extraction method provides local information about protein sequences that reflects the strength of the physicochemical properties of residues (PseAAC2). The multi-label k-nearest neighbor algorithm (MLKNN) is used to predict the subcellular localization of multi-site virus proteins. The best overall accuracy values of R-Dipeptide, I-PseAAC, and PseAAC2 when applied to dataset S from Virus-mPloc are 59.92%, 59.13%, and 57.94% respectively.
- Is Part Of:
- Bioengineered. Volume 9:Issue 1(2018)
- Journal:
- Bioengineered
- Issue:
- Volume 9:Issue 1(2018)
- Issue Display:
- Volume 9, Issue 1 (2018)
- Year:
- 2018
- Volume:
- 9
- Issue:
- 1
- Issue Sort Value:
- 2018-0009-0001-0000
- Page Start:
- 196
- Page End:
- 202
- Publication Date:
- 2018-01-01
- Subjects:
- feature extraction -- I-PseAAC -- PseAAC2 -- R-Dipeptide -- subcellular localization
Biomedical engineering -- Periodicals
Biotechnology -- Periodicals
Microbiology -- Periodicals
660.6 - Journal URLs:
- http://www.tandfonline.com/toc/kbie20/current ↗
http://www.landesbioscience.com/journals/bioe/ ↗
http://www.tandfonline.com/ ↗ - DOI:
- 10.1080/21655979.2017.1373536 ↗
- Languages:
- English
- ISSNs:
- 2165-5987
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 12329.xml