Feature selection for regression problems based on the Morisita estimator of intrinsic dimension. (October 2017)
- Record Type:
- Journal Article
- Title:
- Feature selection for regression problems based on the Morisita estimator of intrinsic dimension. (October 2017)
- Main Title:
- Feature selection for regression problems based on the Morisita estimator of intrinsic dimension
- Authors:
- Golay, Jean
Leuenberger, Michael
Kanevski, Mikhail - Abstract:
- Highlights: A new supervised filter for regression problems is proposed. The filter uses the newly introduced Morisita estimator of intrinsic dimension. The filter distinguishes between relevant, irrelevant and redundant features. The filter is comprehensively validated using real and simulated datasets. A generic methodology for validating and comparing filters is suggested. Abstract: Data acquisition, storage and management have been improved, while the key factors of many phenomena are not well known. Consequently, irrelevant and redundant features artificially increase the size of datasets, which complicates learning tasks, such as regression. To address this problem, feature selection methods have been proposed. This paper introduces a new supervised filter based on the Morisita estimator of intrinsic dimension. It can identify relevant features and distinguish between redundant and irrelevant information. Besides, it offers a clear graphical representation of the results, and it can be easily implemented in different programming languages. Comprehensive numerical experiments are conducted using simulated datasets characterized by different levels of complexity, sample size and noise. The suggested algorithm is also successfully tested on a selection of real world applications and compared with RReliefF using extreme learning machine. In addition, a new measure of feature relevance is presented and discussed.
- Is Part Of:
- Pattern recognition. Volume 70(2017:Oct.)
- Journal:
- Pattern recognition
- Issue:
- Volume 70(2017:Oct.)
- Issue Display:
- Volume 70 (2017)
- Year:
- 2017
- Volume:
- 70
- Issue Sort Value:
- 2017-0070-0000-0000
- Page Start:
- 126
- Page End:
- 138
- Publication Date:
- 2017-10
- Subjects:
- Feature selection -- Intrinsic dimension -- Morisita index -- Measure of relevance -- Data mining
Pattern perception -- Periodicals
Perception des structures -- Périodiques
Patroonherkenning
006.4 - Journal URLs:
- http://www.sciencedirect.com/science/journal/00313203 ↗
http://www.sciencedirect.com/ ↗ - DOI:
- 10.1016/j.patcog.2017.05.008 ↗
- Languages:
- English
- ISSNs:
- 0031-3203
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 1043.xml