Feature selection using Joint Mutual Information Maximisation. Issue 22 (1st December 2015)
- Record Type:
- Journal Article
- Title:
- Feature selection using Joint Mutual Information Maximisation. Issue 22 (1st December 2015)
- Main Title:
- Feature selection using Joint Mutual Information Maximisation
- Authors:
- Bennasar, Mohamed
Hicks, Yulia
Setchi, Rossitza - Abstract:
- Highlights: Two new feature selection methods are proposed based on joint mutual information. The methods use joint mutual information with maximum of the minimum criterion. The methods address the problem of selection of redundant and irrelevant features. The methods are evaluated using eleven public data sets and five competing methods. The proposed JMIM method outperforms five competing methods in terms of accuracy. Abstract: Feature selection is used in many application areas relevant to expert and intelligent systems, such as data mining and machine learning, image processing, anomaly detection, bioinformatics and natural language processing. Feature selection based on information theory is a popular approach due its computational efficiency, scalability in terms of the dataset dimensionality, and independence from the classifier. Common drawbacks of this approach are the lack of information about the interaction between the features and the classifier, and the selection of redundant and irrelevant features. The latter is due to the limitations of the employed goal functions leading to overestimation of the feature significance. To address this problem, this article introduces two new nonlinear feature selection methods, namely Joint Mutual Information Maximisation (JMIM) and Normalised Joint Mutual Information Maximisation (NJMIM); both these methods use mutual information and the 'maximum of the minimum' criterion, which alleviates the problem of overestimation of theHighlights: Two new feature selection methods are proposed based on joint mutual information. The methods use joint mutual information with maximum of the minimum criterion. The methods address the problem of selection of redundant and irrelevant features. The methods are evaluated using eleven public data sets and five competing methods. The proposed JMIM method outperforms five competing methods in terms of accuracy. Abstract: Feature selection is used in many application areas relevant to expert and intelligent systems, such as data mining and machine learning, image processing, anomaly detection, bioinformatics and natural language processing. Feature selection based on information theory is a popular approach due its computational efficiency, scalability in terms of the dataset dimensionality, and independence from the classifier. Common drawbacks of this approach are the lack of information about the interaction between the features and the classifier, and the selection of redundant and irrelevant features. The latter is due to the limitations of the employed goal functions leading to overestimation of the feature significance. To address this problem, this article introduces two new nonlinear feature selection methods, namely Joint Mutual Information Maximisation (JMIM) and Normalised Joint Mutual Information Maximisation (NJMIM); both these methods use mutual information and the 'maximum of the minimum' criterion, which alleviates the problem of overestimation of the feature significance as demonstrated both theoretically and experimentally. The proposed methods are compared using eleven publically available datasets with five competing methods. The results demonstrate that the JMIM method outperforms the other methods on most tested public datasets, reducing the relative average classification error by almost 6% in comparison to the next best performing method. The statistical significance of the results is confirmed by the ANOVA test. Moreover, this method produces the best trade-off between accuracy and stability. … (more)
- Is Part Of:
- Expert systems with applications. Volume 42:Issue 22(2015)
- Journal:
- Expert systems with applications
- Issue:
- Volume 42:Issue 22(2015)
- Issue Display:
- Volume 42, Issue 22 (2015)
- Year:
- 2015
- Volume:
- 42
- Issue:
- 22
- Issue Sort Value:
- 2015-0042-0022-0000
- Page Start:
- 8520
- Page End:
- 8532
- Publication Date:
- 2015-12-01
- Subjects:
- Feature selection -- Mutual information -- Joint mutual information -- Conditional mutual information -- Subset feature selection -- Classification -- Dimensionality reduction -- Feature selection stability
Expert systems (Computer science) -- Periodicals
Systèmes experts (Informatique) -- Périodiques
Electronic journals
006.33 - Journal URLs:
- http://www.sciencedirect.com/science/journal/09574174 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.eswa.2015.07.007 ↗
- Languages:
- English
- ISSNs:
- 0957-4174
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3842.004220
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 8773.xml