Online multi-label streaming feature selection based on neighborhood rough set. (December 2018)
- Record Type:
- Journal Article
- Title:
- Online multi-label streaming feature selection based on neighborhood rough set. (December 2018)
- Main Title:
- Online multi-label streaming feature selection based on neighborhood rough set
- Authors:
- Liu, Jinghua
Lin, Yaojin
Li, Yuwen
Weng, Wei
Wu, Shunxiang - Abstract:
- Highlights: A new neighborhood relation is proposed to effectively solve the problem of granularity selection in neighborhood rough set. We generalize classical neighborhood rough set model to fit multi-label learning and present a novel measure to compute positive region. We propose a new feature selection framework, which solves online streaming feature selection and multi-label feature selection simultaneously. The experiment on ten benchmark datasets with different application scenarios shows a competitive performance of our proposed method against the state-of-the-art multi-label feature selection algorithms. Abstract: Multi-label feature selection has grabbed intensive attention in many big data applications. However, traditional multi-label feature selection methods generally ignore a real-world scenario, i.e., the features constantly flow into the model one by one over time. To address this problem, we develop a novel online multi-label streaming feature selection method based on neighborhood rough set to select a feature subset which contains strongly relevant and non-redundant features. The main motivation is that data mining based on neighborhood rough set does not require any priori knowledge of the feature space structure. Moreover, neighborhood rough set deals with mixed data without breaking the neighborhood and order structure of data. In this paper, we first introduce the maximum-nearest-neighbor of instance to granulate all instances which can solve theHighlights: A new neighborhood relation is proposed to effectively solve the problem of granularity selection in neighborhood rough set. We generalize classical neighborhood rough set model to fit multi-label learning and present a novel measure to compute positive region. We propose a new feature selection framework, which solves online streaming feature selection and multi-label feature selection simultaneously. The experiment on ten benchmark datasets with different application scenarios shows a competitive performance of our proposed method against the state-of-the-art multi-label feature selection algorithms. Abstract: Multi-label feature selection has grabbed intensive attention in many big data applications. However, traditional multi-label feature selection methods generally ignore a real-world scenario, i.e., the features constantly flow into the model one by one over time. To address this problem, we develop a novel online multi-label streaming feature selection method based on neighborhood rough set to select a feature subset which contains strongly relevant and non-redundant features. The main motivation is that data mining based on neighborhood rough set does not require any priori knowledge of the feature space structure. Moreover, neighborhood rough set deals with mixed data without breaking the neighborhood and order structure of data. In this paper, we first introduce the maximum-nearest-neighbor of instance to granulate all instances which can solve the problem of granularity selection in neighborhood rough set, and then generalize neighborhood rough set in single-label to fit multi-label learning. Meanwhile, an online multi-label streaming feature selection framework, which includes online importance selection and online redundancy update, is presented. Under this framework, we propose a criterion to select the important features relative to the currently selected features, and design a bound on pairwise correlations between features under label set to filter out redundant features. An empirical study using a series of benchmark datasets demonstrates that the proposed method outperforms other state-of-the-art multi-label feature selection methods. … (more)
- Is Part Of:
- Pattern recognition. Volume 84(2018:Dec.)
- Journal:
- Pattern recognition
- Issue:
- Volume 84(2018:Dec.)
- Issue Display:
- Volume 84 (2018)
- Year:
- 2018
- Volume:
- 84
- Issue Sort Value:
- 2018-0084-0000-0000
- Page Start:
- 273
- Page End:
- 287
- Publication Date:
- 2018-12
- Subjects:
- Online feature selection -- Multi-label learning -- Neighborhood rough set -- Granularity
Pattern perception -- Periodicals
Perception des structures -- Périodiques
Patroonherkenning
006.4 - Journal URLs:
- http://www.sciencedirect.com/science/journal/00313203 ↗
http://www.sciencedirect.com/ ↗ - DOI:
- 10.1016/j.patcog.2018.07.021 ↗
- Languages:
- English
- ISSNs:
- 0031-3203
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 16664.xml