A data mining approach to evaluate suitability of dissolved oxygen sensor observations for lake metabolism analysis. (4th October 2018)
- Record Type:
- Journal Article
- Title:
- A data mining approach to evaluate suitability of dissolved oxygen sensor observations for lake metabolism analysis. (4th October 2018)
- Main Title:
- A data mining approach to evaluate suitability of dissolved oxygen sensor observations for lake metabolism analysis
- Authors:
- Muraoka, Kohji
Hanson, Paul
Frank, Eibe
Jiang, Meilan
Chiu, Kenneth
Hamilton, David - Abstract:
- Abstract: Despite rapid growth in continuous monitoring of dissolved oxygen for lake metabolism studies, the current best practice still relies on visual assessment and manual data filtering of sensor observations by experienced scientists in order to achieve meaningful results. This time consuming approach is fraught with potential for inconsistency and individual subjectivity. An automated method to assure the quality of data for the purpose of metabolism modeling is clearly needed to obtain consistent results representative of collective expertise. We used a hybrid approach of expert panel and data mining for data filtration. Symbolic Aggregate approXimation (SAX) treats discretized numerical timeseries segments as symbolic indications, creating a series of strings which are literally comparable to human words and sentences. This conversion allows established text mining techniques, such as classification methods to be applied to timeseries data. Half‐hourly frequency surface dissolved oxygen data from 18 global lakes were used to create day‐long segments of the original time series data. Three hundred sets of 1‐d measurements were provided to a group of seven anonymous experts, experienced in manual filtering of oxygen data for metabolism modeling studies. The collective results were treated as expert panel decisions, and were used to rank the data by confidence level for use in metabolism calculations. While considerable variation occurred in the way the expertsAbstract: Despite rapid growth in continuous monitoring of dissolved oxygen for lake metabolism studies, the current best practice still relies on visual assessment and manual data filtering of sensor observations by experienced scientists in order to achieve meaningful results. This time consuming approach is fraught with potential for inconsistency and individual subjectivity. An automated method to assure the quality of data for the purpose of metabolism modeling is clearly needed to obtain consistent results representative of collective expertise. We used a hybrid approach of expert panel and data mining for data filtration. Symbolic Aggregate approXimation (SAX) treats discretized numerical timeseries segments as symbolic indications, creating a series of strings which are literally comparable to human words and sentences. This conversion allows established text mining techniques, such as classification methods to be applied to timeseries data. Half‐hourly frequency surface dissolved oxygen data from 18 global lakes were used to create day‐long segments of the original time series data. Three hundred sets of 1‐d measurements were provided to a group of seven anonymous experts, experienced in manual filtering of oxygen data for metabolism modeling studies. The collective results were treated as expert panel decisions, and were used to rank the data by confidence level for use in metabolism calculations. While considerable variation occurred in the way the experts perceived the quality of the data, the model provides an objective and quantitative assessment method. The program output will assist the decision making process in determining whether data should be used for metabolism calculations. An R version of the program is available for download. … (more)
- Is Part Of:
- Limnology and oceanography, methods. Volume 16:Number 11(2018:Nov.)
- Journal:
- Limnology and oceanography, methods
- Issue:
- Volume 16:Number 11(2018:Nov.)
- Issue Display:
- Volume 16, Issue 11 (2018)
- Year:
- 2018
- Volume:
- 16
- Issue:
- 11
- Issue Sort Value:
- 2018-0016-0011-0000
- Page Start:
- 787
- Page End:
- 801
- Publication Date:
- 2018-10-04
- Subjects:
- Limnology -- Methodology -- Periodicals
Oceanography -- Methodology -- Periodicals
551.48 - Journal URLs:
- http://onlinelibrary.wiley.com/journal/10.1002/(ISSN)1541-5856 ↗
http://www.aslo.org/lomethods ↗
http://onlinelibrary.wiley.com/ ↗ - DOI:
- 10.1002/lom3.10283 ↗
- Languages:
- English
- ISSNs:
- 1541-5856
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 8500.xml