Semi-supervised time series classification on positive and unlabeled problems using cross-recurrence quantification analysis. (August 2018)
- Record Type:
- Journal Article
- Title:
- Semi-supervised time series classification on positive and unlabeled problems using cross-recurrence quantification analysis. (August 2018)
- Main Title:
- Semi-supervised time series classification on positive and unlabeled problems using cross-recurrence quantification analysis
- Authors:
- de Carvalho Pagliosa, Lucas
de Mello, Rodrigo Fernandes - Abstract:
- Highlights: We show time-domain similarity measurements lead to inconsistent classification due to the noise and local differences; We use CRQA to compare time series recurrences on Positive and Unlabeled scenarios; Our approach has achieved better classification performances while classifying time series from natural phenomena. Abstract: When dealing with semi-supervised scenarios, the Positive and Unlabeled (PU) problem is a special case in which few labeled examples from a single class of interest are received to proceed with the classification of unseen instances, according to their similarities with the known class. In the scope of time series, most of the current studies propose to address this subject using a self-training approach based on the 1-Nearest Neighbor algorithm. In order to compute the most similar instance, they compare features along the time domain using the Euclidean Distance and the Dynamic Time Warping-Delta. Despite time-domain measurements permit the analysis of local series shapes, they disconsider temporal recurrences commonly found in natural phenomena (e.g. population growth, climate studies) and are more sensitive to local noise and fluctuations, leading to poor classification performances as confirmed in this paper. This drawback motivated us to propose the use of the Maximum Diagonal Line of the Cross-Recurrence Quantification Analysis (MDL-CRQA), applied on the time series phase space, as similarity measurement. The phase space is obtainedHighlights: We show time-domain similarity measurements lead to inconsistent classification due to the noise and local differences; We use CRQA to compare time series recurrences on Positive and Unlabeled scenarios; Our approach has achieved better classification performances while classifying time series from natural phenomena. Abstract: When dealing with semi-supervised scenarios, the Positive and Unlabeled (PU) problem is a special case in which few labeled examples from a single class of interest are received to proceed with the classification of unseen instances, according to their similarities with the known class. In the scope of time series, most of the current studies propose to address this subject using a self-training approach based on the 1-Nearest Neighbor algorithm. In order to compute the most similar instance, they compare features along the time domain using the Euclidean Distance and the Dynamic Time Warping-Delta. Despite time-domain measurements permit the analysis of local series shapes, they disconsider temporal recurrences commonly found in natural phenomena (e.g. population growth, climate studies) and are more sensitive to local noise and fluctuations, leading to poor classification performances as confirmed in this paper. This drawback motivated us to propose the use of the Maximum Diagonal Line of the Cross-Recurrence Quantification Analysis (MDL-CRQA), applied on the time series phase space, as similarity measurement. The phase space is obtained after applying Takens embedding theorem on the series, unfolding temporal relationships and dependencies among data observations. As consequence, by comparing phase spaces rather than the series themselves, we can assess how their trajectories evolve along time, including their periodicities and temporal cycles, as well as decreasing noise influences. Experimental results confirm MDL-CRQA improves classification results for PU time series when compared against the mostly used time-domain similarity measurements. … (more)
- Is Part Of:
- Pattern recognition. Volume 80(2018:Aug.)
- Journal:
- Pattern recognition
- Issue:
- Volume 80(2018:Aug.)
- Issue Display:
- Volume 80 (2018)
- Year:
- 2018
- Volume:
- 80
- Issue Sort Value:
- 2018-0080-0000-0000
- Page Start:
- 53
- Page End:
- 63
- Publication Date:
- 2018-08
- Subjects:
- Time series -- Semi-supervised classification -- Positive and unlabeled -- Self-training -- Phase space -- Cross-recurrence quantification analysis
Pattern perception -- Periodicals
Perception des structures -- Périodiques
Patroonherkenning
006.4 - Journal URLs:
- http://www.sciencedirect.com/science/journal/00313203 ↗
http://www.sciencedirect.com/ ↗ - DOI:
- 10.1016/j.patcog.2018.02.030 ↗
- Languages:
- English
- ISSNs:
- 0031-3203
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 6399.xml