Unified distributed robust regression and variable selection framework for massive data. (30th December 2021)
- Record Type:
- Journal Article
- Title:
- Unified distributed robust regression and variable selection framework for massive data. (30th December 2021)
- Main Title:
- Unified distributed robust regression and variable selection framework for massive data
- Authors:
- Wang, Kangning
- Abstract:
- Abstract: This paper proposes a unified distributed robust regression framework for distributed massive data, which can include many robust regressions in one setting. Specifically, we first transfer different types of robust regressions into an asymptotically equivalent least-squares problem. Then the resulting estimator can be calculated as a weighted average of robust local estimators, and the communication cost is reduced, since it involves only one round of communication. In addition, since the local data information is incorporated sufficiently, it is adaptive to the heterogeneity. The new estimator is proven to be equivalent with the corresponding global robust regression estimator. Furthermore, we conduct variable selection based on the unified robust regression framework and adaptive LASSO, and the path of solution can also be conveniently obtained by LARS algorithm. It is theoretically shown that the new variable selection method can select true relevant variables consistently by using a new distributed BIC-type tuning parameter selector. The simulation results confirm the effectiveness of the new methods and the correctness of the theoretical results. Highlights: Unified distributed robust regression framework for massive data is proposed. The new method can be easily implemented on the master machine. The communication cost is significantly reduced. Theoretical properties are established under mild conditions.
- Is Part Of:
- Expert systems with applications. Volume 186(2021)
- Journal:
- Expert systems with applications
- Issue:
- Volume 186(2021)
- Issue Display:
- Volume 186, Issue 2021 (2021)
- Year:
- 2021
- Volume:
- 186
- Issue:
- 2021
- Issue Sort Value:
- 2021-0186-2021-0000
- Page Start:
- Page End:
- Publication Date:
- 2021-12-30
- Subjects:
- Distributed massive data -- Robust regression -- Communication efficiency -- Variable selection
Expert systems (Computer science) -- Periodicals
Systèmes experts (Informatique) -- Périodiques
Electronic journals
006.33 - Journal URLs:
- http://www.sciencedirect.com/science/journal/09574174 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.eswa.2021.115701 ↗
- Languages:
- English
- ISSNs:
- 0957-4174
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3842.004220
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 19627.xml