A comparative evaluation of aggregation methods for machine learning over vertically partitioned data. (15th August 2020)

Record Type:: Journal Article
Title:: A comparative evaluation of aggregation methods for machine learning over vertically partitioned data. (15th August 2020)
Main Title:: A comparative evaluation of aggregation methods for machine learning over vertically partitioned data
Authors:: Trevizan, Bernardo
Chamby-Diaz, Jorge
Bazzan, Ana L.C.
Recamonde-Mendoza, Mariana
Abstract:: Highlights: We compare aggregation methods for vertically partitioned data in several scenarios. Impact of datasets characteristics over aggregators' performance is investigated. Silhouette and imbalance coefficient are the most influential characteristics. Characteristics impact varies according to the specific scenario. Decision and regression trees are trained to guide the aggregator choice. Abstract: It is increasingly common applications where data are naturally generated in a distributed fashion, especially after the emergence of technologies like the Internet of Things (IoT). In sensor networks, in collaborative health or genomic projects, in credit risk analysis, among other domains, distinct features are collected from multiple sources, including the use of social media and mobile applications, and due to privacy concerns or communication costs, may not be shared among sites. This scenario of vertical data partitioning poses challenges to traditional machine learning (ML) approaches, as classical algorithms are designed to learn from the complete set of features. A common strategy is to combine predictions from local models trained at each site into a global model, and for this purpose, several aggregation methods have been proposed. In this work we tackle a gap within the related literature, performing a comparative evaluation of elementary and meta-learning-based aggregation methods to reveal their strengths and weakness for 46 datasets with varied … (more)
Is Part Of:: Expert systems with applications. Volume 152(2020)
Journal:: Expert systems with applications
Issue:: Volume 152(2020)
Issue Display:: Volume 152, Issue 2020 (2020)
Year:: 2020
Volume:: 152
Issue:: 2020
Issue Sort Value:: 2020-0152-2020-0000
Page Start:
Page End:
Publication Date:: 2020-08-15
Subjects:: Vertical data partitioning -- Distributed machine learning -- Classification -- Predictions aggregation -- Attribute-partitioned data
Expert systems (Computer science) -- Periodicals
Systèmes experts (Informatique) -- Périodiques
Electronic journals
006.33
Journal URLs:: http://www.sciencedirect.com/science/journal/09574174 ↗
http://www.elsevier.com/journals ↗
DOI:: 10.1016/j.eswa.2020.113406 ↗
Languages:: English
ISSNs:: 0957-4174
Deposit Type:: Legaldeposit
View Content:: Available online (eLD content is only available in our Reading Rooms) ↗
Physical Locations:: British Library DSC - 3842.004220
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store
Ingest File:: 13613.xml