The similarity-aware relational database set operators. (July 2016)
- Record Type:
- Journal Article
- Title:
- The similarity-aware relational database set operators. (July 2016)
- Main Title:
- The similarity-aware relational database set operators
- Authors:
- Al Marri, Wadha J.
Malluhi, Qutaibah
Ouzzani, Mourad
Tang, Mingjie
Aref, Walid G. - Abstract:
- Abstract: Identifying similarities in large datasets is an essential operation in several applications such as bioinformatics, pattern recognition, and data integration. To make a relational database management system similarity-aware, the core relational operators have to be extended. While similarity-awareness has been introduced in database engines for relational operators such as joins and group-by, little has been achieved for relational set operators, namely Intersection, Difference, and Union . In this paper, we propose to extend the semantics of relational set operators to take into account the similarity of values. We develop efficient query processing algorithms for evaluating them, and implement these operators inside an open-source database system, namely PostgreSQL. By extending several queries from the TPC-H benchmark to include predicates that involve similarity-based set operators, we perform extensive experiments that demonstrate up to three orders of magnitude speedup in performance over equivalent queries that only employ regular operators.
- Is Part Of:
- Information systems. Volume 59(2016)
- Journal:
- Information systems
- Issue:
- Volume 59(2016)
- Issue Display:
- Volume 59, Issue 2016 (2016)
- Year:
- 2016
- Volume:
- 59
- Issue:
- 2016
- Issue Sort Value:
- 2016-0059-2016-0000
- Page Start:
- 79
- Page End:
- 93
- Publication Date:
- 2016-07
- Subjects:
- Similarity query processing -- Relational databases -- Set operators
Database management -- Periodicals
Electronic data processing -- Periodicals
Bases de données -- Gestion -- Périodiques
Informatique -- Périodiques
Database management
Electronic data processing
Periodicals
005.7 - Journal URLs:
- http://www.sciencedirect.com/science/journal/03064379 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.is.2015.10.008 ↗
- Languages:
- English
- ISSNs:
- 0306-4379
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4496.367300
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 366.xml