Simultaneous feature selection and discretization based on mutual information. (July 2019)
- Record Type:
- Journal Article
- Title:
- Simultaneous feature selection and discretization based on mutual information. (July 2019)
- Main Title:
- Simultaneous feature selection and discretization based on mutual information
- Authors:
- Sharmin, Sadia
Shoyaib, Mohammad
Ali, Amin Ahsan
Khan, Muhammad Asif Hossain
Chae, Oksam - Abstract:
- Highlights: We address discretization and feature selection jointly with a single criteria. The proposed discretization method is dynamic and independent of classification algorithms. The amount of errors introduced for Relevancy, Redundancy and Complementary Information are derived analytically. It is also analytically shown that Relevancy, Redundancy and Complementary follows χ 2 -distribution. A χ 2 -based search is introduced to select a small set of features and to discretize them with small number of intervals. Abstract: Recently mutual information based feature selection criteria have gained popularity for their superior performances in different applications of pattern recognition and machine learning areas. However, these methods do not consider the correction while computing mutual information for finite samples. Again, finding appropriate discretization of features is often a necessary step prior to feature selection. However, existing researches rarely discuss both discretization and feature selection simultaneously. To solve these issues, Joint Bias corrected Mutual Information (JBMI) is firstly proposed in this paper for feature selection. Secondly, a framework namely modified discretization and feature selection based on mutual information is proposed that incorporates JBMI based feature selection and dynamic discretization, both of which use a χ 2 based searching method. Experimental results on thirty benchmark datasets show that in most of the cases, theHighlights: We address discretization and feature selection jointly with a single criteria. The proposed discretization method is dynamic and independent of classification algorithms. The amount of errors introduced for Relevancy, Redundancy and Complementary Information are derived analytically. It is also analytically shown that Relevancy, Redundancy and Complementary follows χ 2 -distribution. A χ 2 -based search is introduced to select a small set of features and to discretize them with small number of intervals. Abstract: Recently mutual information based feature selection criteria have gained popularity for their superior performances in different applications of pattern recognition and machine learning areas. However, these methods do not consider the correction while computing mutual information for finite samples. Again, finding appropriate discretization of features is often a necessary step prior to feature selection. However, existing researches rarely discuss both discretization and feature selection simultaneously. To solve these issues, Joint Bias corrected Mutual Information (JBMI) is firstly proposed in this paper for feature selection. Secondly, a framework namely modified discretization and feature selection based on mutual information is proposed that incorporates JBMI based feature selection and dynamic discretization, both of which use a χ 2 based searching method. Experimental results on thirty benchmark datasets show that in most of the cases, the proposed methods outperform the state-of-the-art methods. … (more)
- Is Part Of:
- Pattern recognition. Volume 91(2019:Jul.)
- Journal:
- Pattern recognition
- Issue:
- Volume 91(2019:Jul.)
- Issue Display:
- Volume 91 (2019)
- Year:
- 2019
- Volume:
- 91
- Issue Sort Value:
- 2019-0091-0000-0000
- Page Start:
- 162
- Page End:
- 174
- Publication Date:
- 2019-07
- Subjects:
- Feature selection -- Mutual information -- Bias -- Dynamic discretization
Pattern perception -- Periodicals
Perception des structures -- Périodiques
Patroonherkenning
006.4 - Journal URLs:
- http://www.sciencedirect.com/science/journal/00313203 ↗
http://www.sciencedirect.com/ ↗ - DOI:
- 10.1016/j.patcog.2019.02.016 ↗
- Languages:
- English
- ISSNs:
- 0031-3203
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 9741.xml