Harmonise and integrate heterogeneous areal data with the R package arealDB. (November 2020)
- Record Type:
- Journal Article
- Title:
- Harmonise and integrate heterogeneous areal data with the R package arealDB. (November 2020)
- Main Title:
- Harmonise and integrate heterogeneous areal data with the R package arealDB
- Authors:
- Ehrmann, Steffen
Seppelt, Ralf
Meyer, Carsten - Abstract:
- Abstract: Many relevant applications in the environmental and socioeconomic sciences use areal data, such as biodiversity checklists, agricultural statistics, or socioeconomic surveys. For applications that surpass the spatial, temporal or thematic scope of any single data source, data must be integrated from several heterogeneous sources. Inconsistent concepts, definitions, or messy data tables make this a tedious and error-prone process. To date, a dedicated tool to address these challenges is still lacking. Here, we introduce the R package arealDB that integrates heterogeneous areal data and associated geometries into a consistent database, in an easy-to-use workflow. It is useful for harmonising language and semantics of variables, relating data to geometries, and documenting metadata and provenance. We illustrate the functionality by integrating two disparate datasets (Brazil, USA) on the harvested area of soybean. arealDB promises quality-improvements to downstream scientific, monitoring, and management applications but also substantial time-savings to database collation efforts. Highlights: Areal data are census, checklist, indicator, or other data linked to polygon units. 'arealDB ′ harmonises and integrates disparate areal data into a consistent database. 'arealDB ′ considers semantic differences in source data and documents data provenance. 'arealDB ′ combines various complex expert tools into one intuitive, easy-to-use tool. 'arealDB ′ promises time-savings andAbstract: Many relevant applications in the environmental and socioeconomic sciences use areal data, such as biodiversity checklists, agricultural statistics, or socioeconomic surveys. For applications that surpass the spatial, temporal or thematic scope of any single data source, data must be integrated from several heterogeneous sources. Inconsistent concepts, definitions, or messy data tables make this a tedious and error-prone process. To date, a dedicated tool to address these challenges is still lacking. Here, we introduce the R package arealDB that integrates heterogeneous areal data and associated geometries into a consistent database, in an easy-to-use workflow. It is useful for harmonising language and semantics of variables, relating data to geometries, and documenting metadata and provenance. We illustrate the functionality by integrating two disparate datasets (Brazil, USA) on the harvested area of soybean. arealDB promises quality-improvements to downstream scientific, monitoring, and management applications but also substantial time-savings to database collation efforts. Highlights: Areal data are census, checklist, indicator, or other data linked to polygon units. 'arealDB ′ harmonises and integrates disparate areal data into a consistent database. 'arealDB ′ considers semantic differences in source data and documents data provenance. 'arealDB ′ combines various complex expert tools into one intuitive, easy-to-use tool. 'arealDB ′ promises time-savings and quality improvements to downstream applications. … (more)
- Is Part Of:
- Environmental modelling & software. Volume 133(2020)
- Journal:
- Environmental modelling & software
- Issue:
- Volume 133(2020)
- Issue Display:
- Volume 133, Issue 2020 (2020)
- Year:
- 2020
- Volume:
- 133
- Issue:
- 2020
- Issue Sort Value:
- 2020-0133-2020-0000
- Page Start:
- Page End:
- Publication Date:
- 2020-11
- Subjects:
- Interoperability -- Census data -- Indicator data -- Polygon data -- Data warehouse -- Provenance documentation
Environmental monitoring -- Computer programs -- Periodicals
Ecology -- Computer simulation -- Periodicals
Digital computer simulation -- Periodicals
Computer software -- Periodicals
Environmental Monitoring -- Periodicals
Computer Simulation -- Periodicals
Environnement -- Surveillance -- Logiciels -- Périodiques
Écologie -- Simulation, Méthodes de -- Périodiques
Simulation par ordinateur -- Périodiques
Logiciels -- Périodiques
Computer software
Digital computer simulation
Ecology -- Computer simulation
Environmental monitoring -- Computer programs
Periodicals
Electronic journals
363.70015118 - Journal URLs:
- http://www.sciencedirect.com/science/journal/13648152 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.envsoft.2020.104799 ↗
- Languages:
- English
- ISSNs:
- 1364-8152
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3791.522800
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 14543.xml