A template framework for environmental timeseries data acquisition. (July 2019)
- Record Type:
- Journal Article
- Title:
- A template framework for environmental timeseries data acquisition. (July 2019)
- Main Title:
- A template framework for environmental timeseries data acquisition
- Authors:
- Samourkasidis, Argyrios
Papoutsoglou, Evangelia
Athanasiadis, Ioannis N. - Abstract:
- Abstract: Environmental timeseries data variety is exploding in the Internet of Things era, making data reuse a very demanding task. Data acquisition and integration remains a laborious step of the environmental data lifecycle. Environmental data heterogeneity is a persistent issue, as data are becoming available through different protocols and stored under diverse, custom formats. In this work, we deal with syntactic heterogeneity in environmental timeseries data. Our approach is based on describing different dataset syntaxes using abstract representations, called templates. We designed and implemented EDAM (Environmental Data Acquisition Module), a template framework that facilitates timeseries data acquisition and integration. EDAM templates are written using programming language-agnostic semantics, and can be reused both for input and output, thus enabling data reuse via transformations across different formats. We demonstrate EDAM generality in seven case studies, which involve scraping online data, extracting observations from a relational database, or aggregating historical timeseries stored in local files. Case studies span different environmental sciences domains, including meteorology, agriculture, urban air quality and hydrology. We also demonstrate EDAM for data dissemination, as instructed by output templates. We identified several syntactic interoperability challenges though the case studies, that include managing with differences in formatting observables,Abstract: Environmental timeseries data variety is exploding in the Internet of Things era, making data reuse a very demanding task. Data acquisition and integration remains a laborious step of the environmental data lifecycle. Environmental data heterogeneity is a persistent issue, as data are becoming available through different protocols and stored under diverse, custom formats. In this work, we deal with syntactic heterogeneity in environmental timeseries data. Our approach is based on describing different dataset syntaxes using abstract representations, called templates. We designed and implemented EDAM (Environmental Data Acquisition Module), a template framework that facilitates timeseries data acquisition and integration. EDAM templates are written using programming language-agnostic semantics, and can be reused both for input and output, thus enabling data reuse via transformations across different formats. We demonstrate EDAM generality in seven case studies, which involve scraping online data, extracting observations from a relational database, or aggregating historical timeseries stored in local files. Case studies span different environmental sciences domains, including meteorology, agriculture, urban air quality and hydrology. We also demonstrate EDAM for data dissemination, as instructed by output templates. We identified several syntactic interoperability challenges though the case studies, that include managing with differences in formatting observables, temporal and spatial references, and metadata documentation, and addressed them with EDAM. EDAM implementation has been released under an open-source license. … (more)
- Is Part Of:
- Environmental modelling & software. Volume 117(2019)
- Journal:
- Environmental modelling & software
- Issue:
- Volume 117(2019)
- Issue Display:
- Volume 117, Issue 2019 (2019)
- Year:
- 2019
- Volume:
- 117
- Issue:
- 2019
- Issue Sort Value:
- 2019-0117-2019-0000
- Page Start:
- 237
- Page End:
- 249
- Publication Date:
- 2019-07
- Subjects:
- Environmental timeseries -- Internet of things -- Syntactic interoperability -- Data acquisition -- Templates -- Big data
Environmental monitoring -- Computer programs -- Periodicals
Ecology -- Computer simulation -- Periodicals
Digital computer simulation -- Periodicals
Computer software -- Periodicals
Environmental Monitoring -- Periodicals
Computer Simulation -- Periodicals
Environnement -- Surveillance -- Logiciels -- Périodiques
Écologie -- Simulation, Méthodes de -- Périodiques
Simulation par ordinateur -- Périodiques
Logiciels -- Périodiques
Computer software
Digital computer simulation
Ecology -- Computer simulation
Environmental monitoring -- Computer programs
Periodicals
Electronic journals
363.70015118 - Journal URLs:
- http://www.sciencedirect.com/science/journal/13648152 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.envsoft.2018.10.009 ↗
- Languages:
- English
- ISSNs:
- 1364-8152
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3791.522800
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 10070.xml