An empirical evaluation of exact set similarity join techniques using GPUs. (March 2020)
- Record Type:
- Journal Article
- Title:
- An empirical evaluation of exact set similarity join techniques using GPUs. (March 2020)
- Main Title:
- An empirical evaluation of exact set similarity join techniques using GPUs
- Authors:
- Bellas, Christos
Gounaris, Anastasios - Abstract:
- Abstract: Exact set similarity join is a notoriously expensive operation, for which several solutions have been proposed. Recently, there have been studies that present a comparative analysis using MapReduce or a non-parallel setting. Our contribution is that we complement these works through conducting a thorough evaluation of the state-of-the-art GPU-enabled techniques. These techniques are highly diverse in their key features and our experiments manage to reveal the key strengths of each one. As we explain, in real-life applications there is no dominant solution. Depending on specific dataset and query characteristics, each solution, even not using the GPU at all, has its own sweet spot. All our work is repeatable and extensible. Highlights: A thorough evaluation showing the sweet spot of each different technique for exact set similarity joins using a GPU. In large threshold values the sequential CPU techniques are competitive. In lower threshold values, employing parallel GPU techniques seems beneficial. Overall, GPU techniques may perform worse due to the imposed quadratic space overhead. A CPU-GPU co-process scheme performs better in some cases due to efficient workload balance.
- Is Part Of:
- Information systems. Volume 89(2020)
- Journal:
- Information systems
- Issue:
- Volume 89(2020)
- Issue Display:
- Volume 89, Issue 2020 (2020)
- Year:
- 2020
- Volume:
- 89
- Issue:
- 2020
- Issue Sort Value:
- 2020-0089-2020-0000
- Page Start:
- Page End:
- Publication Date:
- 2020-03
- Subjects:
- Set-similarity join -- GPU computing -- CUDA
Database management -- Periodicals
Electronic data processing -- Periodicals
Bases de données -- Gestion -- Périodiques
Informatique -- Périodiques
Database management
Electronic data processing
Periodicals
005.7 - Journal URLs:
- http://www.sciencedirect.com/science/journal/03064379 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.is.2019.101485 ↗
- Languages:
- English
- ISSNs:
- 0306-4379
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4496.367300
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 12668.xml