From knowledge-driven to data-driven inter-case feature encoding in predictive process monitoring. (September 2019)
- Record Type:
- Journal Article
- Title:
- From knowledge-driven to data-driven inter-case feature encoding in predictive process monitoring. (September 2019)
- Main Title:
- From knowledge-driven to data-driven inter-case feature encoding in predictive process monitoring
- Authors:
- Senderovich, Arik
Francescomarino, Chiara Di
Maggi, Fabrizio Maria - Abstract:
- Abstract: Predictive process monitoring (PPM) is a research area that focuses on predicting measures of interest (e.g., the completion time) for running cases based on event logs. State-of-the-art PPM techniques only consider intra-case information that comes from the case whose measures of interest one wishes to predict. However, in many systems, the outcome of a running case depends on the interplay of all cases that are being executed concurrently, or can be derived from the characteristics of cases that are executed in the same period of time. For example, in many situations, running cases compete over scarce resources, and the completion time of a running case can be derived from the number of similar cases running concurrently. In this work, we present a general framework for feature encoding that relies on a bi-dimensional state space representation. The first dimension corresponds to intra-case dependencies and utilizes existing feature encoding techniques. The second dimension encodes inter-case features using two approaches: (1) a knowledge-driven encoding (KDE), which assumes prior knowledge on case types, and (2) a data-driven encoding (DDE), which automatically identifies case types from data using case proximity metrics . Both approaches partition the event log into sets of cases that share common characteristics, and derive features according to these commonalities. We demonstrate the usefulness of the proposed framework with an empirical evaluation carriedAbstract: Predictive process monitoring (PPM) is a research area that focuses on predicting measures of interest (e.g., the completion time) for running cases based on event logs. State-of-the-art PPM techniques only consider intra-case information that comes from the case whose measures of interest one wishes to predict. However, in many systems, the outcome of a running case depends on the interplay of all cases that are being executed concurrently, or can be derived from the characteristics of cases that are executed in the same period of time. For example, in many situations, running cases compete over scarce resources, and the completion time of a running case can be derived from the number of similar cases running concurrently. In this work, we present a general framework for feature encoding that relies on a bi-dimensional state space representation. The first dimension corresponds to intra-case dependencies and utilizes existing feature encoding techniques. The second dimension encodes inter-case features using two approaches: (1) a knowledge-driven encoding (KDE), which assumes prior knowledge on case types, and (2) a data-driven encoding (DDE), which automatically identifies case types from data using case proximity metrics . Both approaches partition the event log into sets of cases that share common characteristics, and derive features according to these commonalities. We demonstrate the usefulness of the proposed framework with an empirical evaluation carried out against two real-life datasets coming from an outpatient hospital process and a manufacturing process. Highlights: A data-driven approach for inter-case feature encoding. K-nearest neighbors like approach for measuring distances between cases. An experimental evaluation with a real-life event logs. … (more)
- Is Part Of:
- Information systems. Volume 84(2019)
- Journal:
- Information systems
- Issue:
- Volume 84(2019)
- Issue Display:
- Volume 84, Issue 2019 (2019)
- Year:
- 2019
- Volume:
- 84
- Issue:
- 2019
- Issue Sort Value:
- 2019-0084-2019-0000
- Page Start:
- 255
- Page End:
- 264
- Publication Date:
- 2019-09
- Subjects:
- Predictive process monitoring -- Inter-case predictions -- Knowledge-driven encoding -- Data-driven encoding
Database management -- Periodicals
Electronic data processing -- Periodicals
Bases de données -- Gestion -- Périodiques
Informatique -- Périodiques
Database management
Electronic data processing
Periodicals
005.7 - Journal URLs:
- http://www.sciencedirect.com/science/journal/03064379 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.is.2019.01.007 ↗
- Languages:
- English
- ISSNs:
- 0306-4379
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4496.367300
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 10968.xml