EQUALITY: Quality-aware intensive analytics on the edge. Issue 105 (March 2022)
- Record Type:
- Journal Article
- Title:
- EQUALITY: Quality-aware intensive analytics on the edge. Issue 105 (March 2022)
- Main Title:
- EQUALITY: Quality-aware intensive analytics on the edge
- Authors:
- Michailidou, Anna-Valentini
Gounaris, Anastasios
Symeonides, Moysis
Trihinas, Demetris - Abstract:
- Abstract: Our work is motivated by the fact that there is an increasing need to perform complex analytics jobs over streaming data as close to the edge devices as possible and, in parallel, it is important that data quality is considered as an optimization objective along with performance metrics. In this work, we develop a solution that trades latency for an increased fraction of incoming data, for which data quality-related measurements and operations are performed, in jobs running over geo-distributed heterogeneous and constrained resources. Our solution is hybrid: on the one hand, we perform search heuristics over locally optimal partial solutions to yield an enhanced global solution regarding task allocations; on the other hand, we employ a spring relaxation algorithm to avoid unnecessarily increased degree of partitioned parallelism. Through thorough experiments, we show that we can improve upon state-of-the-art solutions in terms of our objective function that combines latency and extent of quality checks by up to 2.56X. Moreover, we implement our solution within Apache Storm, and we perform experiments in an emulated setting. The results show that we can reduce the latency in 86.9% of the cases examined, while latency is up to 8 times lower compared to the built-in Storm scheduler, with the average latency reduction being 52.5%. Highlights: Quality-aware task allocation over heterogeneous geo-distributed devices. Real prototype developed and tested. SignificantAbstract: Our work is motivated by the fact that there is an increasing need to perform complex analytics jobs over streaming data as close to the edge devices as possible and, in parallel, it is important that data quality is considered as an optimization objective along with performance metrics. In this work, we develop a solution that trades latency for an increased fraction of incoming data, for which data quality-related measurements and operations are performed, in jobs running over geo-distributed heterogeneous and constrained resources. Our solution is hybrid: on the one hand, we perform search heuristics over locally optimal partial solutions to yield an enhanced global solution regarding task allocations; on the other hand, we employ a spring relaxation algorithm to avoid unnecessarily increased degree of partitioned parallelism. Through thorough experiments, we show that we can improve upon state-of-the-art solutions in terms of our objective function that combines latency and extent of quality checks by up to 2.56X. Moreover, we implement our solution within Apache Storm, and we perform experiments in an emulated setting. The results show that we can reduce the latency in 86.9% of the cases examined, while latency is up to 8 times lower compared to the built-in Storm scheduler, with the average latency reduction being 52.5%. Highlights: Quality-aware task allocation over heterogeneous geo-distributed devices. Real prototype developed and tested. Significant improvements over the state-of-the-art techniques. … (more)
- Is Part Of:
- Information systems. Issue 105(2022)
- Journal:
- Information systems
- Issue:
- Issue 105(2022)
- Issue Display:
- Volume 105, Issue 105 (2022)
- Year:
- 2022
- Volume:
- 105
- Issue:
- 105
- Issue Sort Value:
- 2022-0105-0105-0000
- Page Start:
- Page End:
- Publication Date:
- 2022-03
- Subjects:
- Fog computing -- Optimization -- Sensors -- Data quality
Database management -- Periodicals
Electronic data processing -- Periodicals
Bases de données -- Gestion -- Périodiques
Informatique -- Périodiques
Database management
Electronic data processing
Periodicals
005.7 - Journal URLs:
- http://www.sciencedirect.com/science/journal/03064379 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.is.2021.101953 ↗
- Languages:
- English
- ISSNs:
- 0306-4379
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4496.367300
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 20307.xml