An efficient approach for the prediction of ion channels and their subfamilies. (October 2015)
- Record Type:
- Journal Article
- Title:
- An efficient approach for the prediction of ion channels and their subfamilies. (October 2015)
- Main Title:
- An efficient approach for the prediction of ion channels and their subfamilies
- Authors:
- Tiwari, Arvind Kumar
Srivastava, Rajeev - Abstract:
- Graphical abstract: Highlights: In this paper, random forest based approach is proposed to predict ion channels families and their subfamilies by using sequence derived features. Here, amino acid composition, dipeptide composition, correlation features, composition, transition and distribution and pseudo amino acid composition are used to represent the protein sequence. The minimum redundancy and maximum relevance feature selection is used to find the optimal number of features for improving the prediction performance. Obtained high accuracies, MCC and ROC area values. Abstract: Ion channels are integral membrane proteins that are responsible for controlling the flow of ions across the cell. There are various biological functions that are performed by different types of ion channels. Therefore for new drug discovery it is necessary to develop a novel computational intelligence techniques based approach for the reliable prediction of ion channels families and their subfamilies. In this paper random forest based approach is proposed to predict ion channels families and their subfamilies by using sequence derived features. Here, seven feature vectors are used to represent the protein sample, including amino acid composition, dipeptide composition, correlation features, composition, transition and distribution and pseudo amino acid composition. The minimum redundancy and maximum relevance feature selection is used to find the optimal number of features for improving theGraphical abstract: Highlights: In this paper, random forest based approach is proposed to predict ion channels families and their subfamilies by using sequence derived features. Here, amino acid composition, dipeptide composition, correlation features, composition, transition and distribution and pseudo amino acid composition are used to represent the protein sequence. The minimum redundancy and maximum relevance feature selection is used to find the optimal number of features for improving the prediction performance. Obtained high accuracies, MCC and ROC area values. Abstract: Ion channels are integral membrane proteins that are responsible for controlling the flow of ions across the cell. There are various biological functions that are performed by different types of ion channels. Therefore for new drug discovery it is necessary to develop a novel computational intelligence techniques based approach for the reliable prediction of ion channels families and their subfamilies. In this paper random forest based approach is proposed to predict ion channels families and their subfamilies by using sequence derived features. Here, seven feature vectors are used to represent the protein sample, including amino acid composition, dipeptide composition, correlation features, composition, transition and distribution and pseudo amino acid composition. The minimum redundancy and maximum relevance feature selection is used to find the optimal number of features for improving the prediction performance. The proposed method achieved an overall accuracy of 100%, 98.01%, 91.5%, 93.0%, 92.2%, 78.6%, 95.5%, 84.9%, MCC values of 1.00, 0.92, 0.88, 0.88, 0.90, 0.79, 0.91, 0.81 and ROC area values of 1.00, 0.99, 0.99, 0.99, 0.99, 0.95, 0.99 and 0.96 using 10-fold cross validation to predict the ion channels and non-ion channels, voltage gated ion channels and ligand gated ion channels, four subfamilies (calcium, potassium, sodium and chloride) of voltage gated ion channels, and four subfamilies of ligand gated ion channels and predict subfamilies of voltage gated calcium, potassium, sodium and chloride ion channels respectively. … (more)
- Is Part Of:
- Computational biology and chemistry. Volume 58(2015)
- Journal:
- Computational biology and chemistry
- Issue:
- Volume 58(2015)
- Issue Display:
- Volume 58, Issue 2015 (2015)
- Year:
- 2015
- Volume:
- 58
- Issue:
- 2015
- Issue Sort Value:
- 2015-0058-2015-0000
- Page Start:
- 205
- Page End:
- 221
- Publication Date:
- 2015-10
- Subjects:
- Ion channels -- Voltage gated ion channels -- Ligand gated ion channels -- Random forest -- Minimum redundancy maximum relevance -- Sodium -- Potassium -- Calcium -- Chloride
Chemistry -- Data processing -- Periodicals
Biology -- Data processing -- Periodicals
Biochemistry -- Data processing
Biology -- Data processing
Molecular biology -- Data processing
Periodicals
Electronic journals
542.85 - Journal URLs:
- http://www.sciencedirect.com/science/journal/14769271 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.compbiolchem.2015.07.002 ↗
- Languages:
- English
- ISSNs:
- 1476-9271
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3390.576700
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 9091.xml