A foundation for spatio-textual-temporal cube analytics. Issue 108 (September 2022)
- Record Type:
- Journal Article
- Title:
- A foundation for spatio-textual-temporal cube analytics. Issue 108 (September 2022)
- Main Title:
- A foundation for spatio-textual-temporal cube analytics
- Authors:
- Iqbal, Mohsin
Lissandrini, Matteo
Pedersen, Torben Bach - Abstract:
- Abstract: Large amounts of spatial, textual, and temporal (STT) data are being produced daily. This is data containing an unstructured component (text), a spatial component (geographic position), and a time component (timestamp). Therefore, there is a need for a powerful and general way of analyzing STT data together . In this paper, we define and formalize the Spatio-Textual-Temporal Cube (STTCube) structure to enable combined effective and efficient analytical queries over STT data . Our novel data model over STT objects enables novel joint and integrated STT insights that are hard to obtain using existing methods. Furthermore, our proposed STTCube Incremental Maintenance ( I M s t t ) method maintains the already constructed STTCube efficiently when new data arrives. Moreover, we introduce the new concept of STT measures with associated novel STT-OLAP operators. To allow for efficient large-scale analytics, we present a pre-aggregation framework for exact and approximate computation of STT measures . Our comprehensive experimental evaluation on a real-world Twitter dataset confirms that our proposed methods reduce query response time by 1–5 orders of magnitude compared to the No Materialization baseline and decrease storage cost between 97% and 99.9% compared to the Full Materialization baseline while adding only a negligible overhead in the STTCube construction time. Moreover, approximate computation achieves an accuracy between 90% and 100% while reducing query responseAbstract: Large amounts of spatial, textual, and temporal (STT) data are being produced daily. This is data containing an unstructured component (text), a spatial component (geographic position), and a time component (timestamp). Therefore, there is a need for a powerful and general way of analyzing STT data together . In this paper, we define and formalize the Spatio-Textual-Temporal Cube (STTCube) structure to enable combined effective and efficient analytical queries over STT data . Our novel data model over STT objects enables novel joint and integrated STT insights that are hard to obtain using existing methods. Furthermore, our proposed STTCube Incremental Maintenance ( I M s t t ) method maintains the already constructed STTCube efficiently when new data arrives. Moreover, we introduce the new concept of STT measures with associated novel STT-OLAP operators. To allow for efficient large-scale analytics, we present a pre-aggregation framework for exact and approximate computation of STT measures . Our comprehensive experimental evaluation on a real-world Twitter dataset confirms that our proposed methods reduce query response time by 1–5 orders of magnitude compared to the No Materialization baseline and decrease storage cost between 97% and 99.9% compared to the Full Materialization baseline while adding only a negligible overhead in the STTCube construction time. Moreover, approximate computation achieves an accuracy between 90% and 100% while reducing query response time by 3–5 orders of magnitude compared to No Materialization and I M s t t achieves an order of magnitude improvement in maintenance time compared to the baseline maintenance method. Highlights: We extend the standard cube model to add support for Spatial-Textual-Temporal (STT) data We also propose STT measures and a set of analytical operators (STTOLAP) over STT data We propose a pre-aggregation framework for the efficient computation of STT measures We compare STTCube's query response time, storage cost, and accuracy with baseline methods Our comprehensive experimental evaluation shows that STTCube outperforms all the baseline methods … (more)
- Is Part Of:
- Information systems. Issue 108(2022)
- Journal:
- Information systems
- Issue:
- Issue 108(2022)
- Issue Display:
- Volume 108, Issue 108 (2022)
- Year:
- 2022
- Volume:
- 108
- Issue:
- 108
- Issue Sort Value:
- 2022-0108-0108-0000
- Page Start:
- Page End:
- Publication Date:
- 2022-09
- Subjects:
- Data cube -- OLAP -- Spatial analytics -- Textual analytics -- Spatio-textual-temporal data -- Spatial-textual-temporal measures
Database management -- Periodicals
Electronic data processing -- Periodicals
Bases de données -- Gestion -- Périodiques
Informatique -- Périodiques
Database management
Electronic data processing
Periodicals
005.7 - Journal URLs:
- http://www.sciencedirect.com/science/journal/03064379 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.is.2022.102009 ↗
- Languages:
- English
- ISSNs:
- 0306-4379
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4496.367300
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 21565.xml