A new feature selection method based on symmetrical uncertainty and interaction gain. (December 2019)
- Record Type:
- Journal Article
- Title:
- A new feature selection method based on symmetrical uncertainty and interaction gain. (December 2019)
- Main Title:
- A new feature selection method based on symmetrical uncertainty and interaction gain
- Authors:
- Lin, Xiaohui
Li, Chao
Ren, Weijie
Luo, Xiao
Qi, Yanpeng - Abstract:
- Graphical abstract: Highlights: Proposing a novel feature evaluation criterion taking feature relevance and feature interaction into consideration. Employing Interaction Gain to examine the interaction among features. Combining proposed feature evaluation criterion with Recursive Feature Elimination (RFE) technique to define the informative feature subset. Experimental results on eleven public data sets showed that proposed method could measure features more accurately and stably than other methods. Abstract: Defining important information from complex biological data is of great significance in biological study. It is known that the physiological and pathological changes in an organism are usually influenced by molecule interactions. Analyzing biological data by fusing the evaluation of the individual molecules and molecule interactions could induce a more accurate and comprehensive understanding of the organism. This study proposes an Interaction Gain - Recursive Feature Elimination (IG-RFE) method which evaluates the feature importance by combining the relevance between feature and class label and the interaction among features. Symmetrical uncertainty is adopted to measure the relevance between feature and the class label. The average normalized interaction gain of feature f, every other features and the class label is calculated to reflect the interaction of feature f with other features in the feature set F . Based on the combination of symmetrical uncertainty andGraphical abstract: Highlights: Proposing a novel feature evaluation criterion taking feature relevance and feature interaction into consideration. Employing Interaction Gain to examine the interaction among features. Combining proposed feature evaluation criterion with Recursive Feature Elimination (RFE) technique to define the informative feature subset. Experimental results on eleven public data sets showed that proposed method could measure features more accurately and stably than other methods. Abstract: Defining important information from complex biological data is of great significance in biological study. It is known that the physiological and pathological changes in an organism are usually influenced by molecule interactions. Analyzing biological data by fusing the evaluation of the individual molecules and molecule interactions could induce a more accurate and comprehensive understanding of the organism. This study proposes an Interaction Gain - Recursive Feature Elimination (IG-RFE) method which evaluates the feature importance by combining the relevance between feature and class label and the interaction among features. Symmetrical uncertainty is adopted to measure the relevance between feature and the class label. The average normalized interaction gain of feature f, every other features and the class label is calculated to reflect the interaction of feature f with other features in the feature set F . Based on the combination of symmetrical uncertainty and normalized interaction gain, less important features are removed iteratively. To show the performance of IG-RFE, it was compared with seven efficient feature selection methods, MIFS, mRMR, CMIM, ReliefF, FCBF, PGVNS and SVM-RFE, on eleven public datasets. The experiment results showed the superiority of IG-RFE in accuracy, sensitivity, specificity and stability. Hence, integrating feature individual discriminative ability and the interaction among features could better evaluate feature importance in biological data analysis. … (more)
- Is Part Of:
- Computational biology and chemistry. Volume 83(2019)
- Journal:
- Computational biology and chemistry
- Issue:
- Volume 83(2019)
- Issue Display:
- Volume 83, Issue 2019 (2019)
- Year:
- 2019
- Volume:
- 83
- Issue:
- 2019
- Issue Sort Value:
- 2019-0083-2019-0000
- Page Start:
- Page End:
- Publication Date:
- 2019-12
- Subjects:
- Feature selection -- Biological data analysis -- Interaction gain
Chemistry -- Data processing -- Periodicals
Biology -- Data processing -- Periodicals
Biochemistry -- Data processing
Biology -- Data processing
Molecular biology -- Data processing
Periodicals
Electronic journals
542.85 - Journal URLs:
- http://www.sciencedirect.com/science/journal/14769271 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.compbiolchem.2019.107149 ↗
- Languages:
- English
- ISSNs:
- 1476-9271
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3390.576700
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 17912.xml