Approximate OLAP of document-oriented databases: A variety-aware approach. (November 2019)
- Record Type:
- Journal Article
- Title:
- Approximate OLAP of document-oriented databases: A variety-aware approach. (November 2019)
- Main Title:
- Approximate OLAP of document-oriented databases: A variety-aware approach
- Authors:
- Gallinucci, Enrico
Golfarelli, Matteo
Rizzi, Stefano - Abstract:
- Abstract: Schemaless databases, and document-oriented databases in particular, are preferred to relational ones for storing heterogeneous data with variable schemas and structural forms. However, the absence of a unique schema adds complexity to analytical applications, in which a single analysis often involves large sets of data with different schemas. In this paper we propose an original approach to OLAP on collections stored in document-oriented databases. The basic idea is to stop fighting against schema variety and welcome it as an inherent source of information wealth in schemaless sources. Our approach builds on four stages: schema extraction, schema integration, FD enrichment, and querying; these stages are discussed in detail in the paper. To make users aware of the impact of schema variety, we propose a set of indicators inspired by the definition of attribute density. Finally, we experimentally evaluate our approach in terms of efficiency and effectiveness. Highlights: The inherent variety of documents hinders proper OLAP analyses. We propose an approximated OLAP approach that captures and exploits schema variety. A multidimensional view is given by detecting approximated functional dependencies. We propose indicators to predict and evaluate the quality of OLAP queries. We show that the approach improves the coverage and precision of OLAP queries.
- Is Part Of:
- Information systems. Volume 85(2019)
- Journal:
- Information systems
- Issue:
- Volume 85(2019)
- Issue Display:
- Volume 85, Issue 2019 (2019)
- Year:
- 2019
- Volume:
- 85
- Issue:
- 2019
- Issue Sort Value:
- 2019-0085-2019-0000
- Page Start:
- 114
- Page End:
- 130
- Publication Date:
- 2019-11
- Subjects:
- NoSQL -- Document-oriented databases -- Multidimensional modeling -- OLAP
Database management -- Periodicals
Electronic data processing -- Periodicals
Bases de données -- Gestion -- Périodiques
Informatique -- Périodiques
Database management
Electronic data processing
Periodicals
005.7 - Journal URLs:
- http://www.sciencedirect.com/science/journal/03064379 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.is.2019.02.004 ↗
- Languages:
- English
- ISSNs:
- 0306-4379
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4496.367300
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 11052.xml