Vertical federated learning-based feature selection with non-overlapping sample utilization. (1st December 2022)
- Record Type:
- Journal Article
- Title:
- Vertical federated learning-based feature selection with non-overlapping sample utilization. (1st December 2022)
- Main Title:
- Vertical federated learning-based feature selection with non-overlapping sample utilization
- Authors:
- Feng, Siwei
- Abstract:
- Abstract: Vertical federated learning (VFL) is a privacy preserving collaborative machine learning technique designed for distributed learning scenarios in which data from different parties have overlap in the sample space. In this paper, a VFL method for feature selection, which is an effective dimensionality reduction technique that selects a subset of informative features from high-dimensional data by eliminating irrelevant and redundant features, is proposed. Because of the potential insufficiency of useful information for learning informative features and the difficulty in sharing raw data among parties due to the increasing awareness of data privacy protection, it is desirable to exploit information from multiple parties without raw data sharing. In this paper, we propose a VFL-based feature selection method that leverages deep learning models as well as complementary information from features in the same samples at multiple parties without data disclosure. In order to further improve feature selection performance, information of samples that do not have features appearing in all parties are also utilized. Promising results in extensive experiments show the effectiveness of the proposed approach in terms of collaborative feature selection without data sharing. Highlights: In this paper, we bridge this gap by proposing a novel VFL-based feature selection method—Vertical Federated Learning-based Feature Selection (VFLFS). To the best of our knowledge, this is the firstAbstract: Vertical federated learning (VFL) is a privacy preserving collaborative machine learning technique designed for distributed learning scenarios in which data from different parties have overlap in the sample space. In this paper, a VFL method for feature selection, which is an effective dimensionality reduction technique that selects a subset of informative features from high-dimensional data by eliminating irrelevant and redundant features, is proposed. Because of the potential insufficiency of useful information for learning informative features and the difficulty in sharing raw data among parties due to the increasing awareness of data privacy protection, it is desirable to exploit information from multiple parties without raw data sharing. In this paper, we propose a VFL-based feature selection method that leverages deep learning models as well as complementary information from features in the same samples at multiple parties without data disclosure. In order to further improve feature selection performance, information of samples that do not have features appearing in all parties are also utilized. Promising results in extensive experiments show the effectiveness of the proposed approach in terms of collaborative feature selection without data sharing. Highlights: In this paper, we bridge this gap by proposing a novel VFL-based feature selection method—Vertical Federated Learning-based Feature Selection (VFLFS). To the best of our knowledge, this is the first deep learning-based vertical federated learning approach with feature selection. A strategy to make use of non-overlapping samples is also proposed to improve feature selection effectiveness. The proposed VFLFS approach has been evaluated extensively based on real-world datasets. The results show that VFLFS can significantly improve model performance under VFL settings compared to four state of the art baselines, especially in conditions where a large proportion of the data samples do not overlap across data owners. … (more)
- Is Part Of:
- Expert systems with applications. Volume 208(2022)
- Journal:
- Expert systems with applications
- Issue:
- Volume 208(2022)
- Issue Display:
- Volume 208, Issue 2022 (2022)
- Year:
- 2022
- Volume:
- 208
- Issue:
- 2022
- Issue Sort Value:
- 2022-0208-2022-0000
- Page Start:
- Page End:
- Publication Date:
- 2022-12-01
- Subjects:
- Feature selection -- Vertical federated learning -- Privacy protection -- Deep learning
Expert systems (Computer science) -- Periodicals
Systèmes experts (Informatique) -- Périodiques
Electronic journals
006.33 - Journal URLs:
- http://www.sciencedirect.com/science/journal/09574174 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.eswa.2022.118097 ↗
- Languages:
- English
- ISSNs:
- 0957-4174
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3842.004220
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 23331.xml