A new feature selection algorithm based on relevance, redundancy and complementarity. (April 2020)
- Record Type:
- Journal Article
- Title:
- A new feature selection algorithm based on relevance, redundancy and complementarity. (April 2020)
- Main Title:
- A new feature selection algorithm based on relevance, redundancy and complementarity
- Authors:
- Li, Chao
Luo, Xiao
Qi, Yanpeng
Gao, Zhenbo
Lin, Xiaohui - Abstract:
- Abstract: Defining important information from biological data is critical for the study of disease diagnosis, drug efficacy and individualized treatment. Hence, the feature selection technique is widely applied. Many feature selection methods measure features based on relevance, redundancy and complementarity. Feature complementarity means that two features' cooperation can provide more information than the simple summation of their individual information. In this paper, we studied the feature selection technique and proposed a new feature selection algorithm based on relevance, redundancy and complementarity (FS-RRC). On selecting the feature subset, FS-RRC not only evaluates the feature relevance with the class label and the redundancy among the features but also evaluates the feature complementarity. If complementary features exist for a selected relevant feature, FS-RRC retains the feature with the largest complementarity to the selected feature subset. To show the performance of FS-RRC, it was compared with eleven efficient feature selection methods, MIFS, mRMR, CMIM, ReliefF, FCBF, PGVNS, MCRMCR, MCRMICR, RCDFS, SAFE and SVM-RFE on two synthetic datasets and fifteen public biological datasets. The experimental results showed the superiority of FS-RRC in accuracy, sensitivity, specificity, stability and time complexity. Hence, integrating feature individual discriminative ability, redundancy and complementarity can define more powerful feature subset for biological dataAbstract: Defining important information from biological data is critical for the study of disease diagnosis, drug efficacy and individualized treatment. Hence, the feature selection technique is widely applied. Many feature selection methods measure features based on relevance, redundancy and complementarity. Feature complementarity means that two features' cooperation can provide more information than the simple summation of their individual information. In this paper, we studied the feature selection technique and proposed a new feature selection algorithm based on relevance, redundancy and complementarity (FS-RRC). On selecting the feature subset, FS-RRC not only evaluates the feature relevance with the class label and the redundancy among the features but also evaluates the feature complementarity. If complementary features exist for a selected relevant feature, FS-RRC retains the feature with the largest complementarity to the selected feature subset. To show the performance of FS-RRC, it was compared with eleven efficient feature selection methods, MIFS, mRMR, CMIM, ReliefF, FCBF, PGVNS, MCRMCR, MCRMICR, RCDFS, SAFE and SVM-RFE on two synthetic datasets and fifteen public biological datasets. The experimental results showed the superiority of FS-RRC in accuracy, sensitivity, specificity, stability and time complexity. Hence, integrating feature individual discriminative ability, redundancy and complementarity can define more powerful feature subset for biological data analysis, and feature complementarity can help to study the biomedical phenomena more accurately. Highlights: A novel feature selection method based on relevance, redundancy and complementarity (FS - RRC) is proposed. Multi - information is applied to examine the complementarity between features. Experimental results shows that FS - RRC could measure features more accurately and stably than other methods. … (more)
- Is Part Of:
- Computers in biology and medicine. Volume 119(2020)
- Journal:
- Computers in biology and medicine
- Issue:
- Volume 119(2020)
- Issue Display:
- Volume 119, Issue 2020 (2020)
- Year:
- 2020
- Volume:
- 119
- Issue:
- 2020
- Issue Sort Value:
- 2020-0119-2020-0000
- Page Start:
- Page End:
- Publication Date:
- 2020-04
- Subjects:
- Biological data analysis -- Feature selection -- Feature relevance -- Feature redundancy -- Feature complementarity
Medicine -- Data processing -- Periodicals
Biology -- Data processing -- Periodicals
610.285 - Journal URLs:
- http://www.sciencedirect.com/science/journal/00104825/ ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.compbiomed.2020.103667 ↗
- Languages:
- English
- ISSNs:
- 0010-4825
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.880000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 13507.xml