SPMgr: Dynamic workflow manager for sampling and filtering data streams over Apache Storm. (July 2019)
- Record Type:
- Journal Article
- Title:
- SPMgr: Dynamic workflow manager for sampling and filtering data streams over Apache Storm. (July 2019)
- Main Title:
- SPMgr: Dynamic workflow manager for sampling and filtering data streams over Apache Storm
- Authors:
- Kim, Youngkuk
Son, Siwoon
Moon, Yang-Sae - Abstract:
- In this article, we address dynamic workflow management for sampling and filtering data streams in Apache Storm. As many sensors generate data streams continuously, we often use sampling to choose some representative data or filtering to remove unnecessary data. Apache Storm is a real-time distributed processing platform suitable for handling large data streams. Storm, however, must stop the entire work when it changes the input data structure or processing algorithm as it needs to modify, redistribute, and restart the programs. In addition, for effective data processing, we often use Storm with Kafka and databases, but it is difficult to use these platforms in an integrated manner. In this article, we derive the problems when applying sampling and filtering algorithms to Storm and propose a dynamic workflow management model that solves these problems. First, we present the concept of a plan consisting of input, processing, and output modules of a data stream. Second, we propose Storm Plan Manager, which can operate Storm, Kafka, and database as a single integrated system. Storm Plan Manager is an integrated workflow manager that dynamically controls sampling and filtering of data streams through plans. Third, as a key feature, Storm Plan Manager provides a Web client interface to visually create, execute, and monitor plans. In this article, we show the usefulness of the proposed Storm Plan Manager by presenting its design, implementation, and experimental results in order.
- Is Part Of:
- International journal of distributed sensor networks. Volume 15:Number 7(2019)
- Journal:
- International journal of distributed sensor networks
- Issue:
- Volume 15:Number 7(2019)
- Issue Display:
- Volume 15, Issue 7 (2019)
- Year:
- 2019
- Volume:
- 15
- Issue:
- 7
- Issue Sort Value:
- 2019-0015-0007-0000
- Page Start:
- Page End:
- Publication Date:
- 2019-07
- Subjects:
- Data stream -- Apache Storm -- data sampling -- data filtering -- distributed processing -- workflow management
Sensor networks -- Periodicals
Intelligent agents (Computer software) -- Periodicals
Multisensor data fusion -- Periodicals
681.2 - Journal URLs:
- http://www.informaworld.com/smpp/title~content=t714578688~db=all ↗
http://www.metapress.com/openurl.asp?genre=journal&issn=1550-1329 ↗
http://dsn.sagepub.com/ ↗
http://www.tandfonline.com/ ↗ - DOI:
- 10.1177/1550147719862206 ↗
- Languages:
- English
- ISSNs:
- 1550-1329
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4542.186400
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 11053.xml