Linguistic summarization of event logs – A practical approach. (July 2017)
- Record Type:
- Journal Article
- Title:
- Linguistic summarization of event logs – A practical approach. (July 2017)
- Main Title:
- Linguistic summarization of event logs – A practical approach
- Authors:
- Dijkman, Remco
Wilbik, Anna - Abstract:
- Highlights: Automatically create diagnostic statements about a business process. Efficiently explore a search space of 10 19 possible statements. Summaries are condensed 80–97% in size compared to previous work. Abstract: The amount of data that is generated during the execution of a business process is growing. As a consequence it is increasingly hard to extract useful information from the large amount of data that is produced. Linguistic summarization helps to point business analysts in the direction of useful information, by verbalizing interesting patterns that exist in the data. In previous work we showed how linguistic summarization can be used to automatically generate diagnostic statements about event logs, such as 'for most cases that contained the sequence ABC, the throughput time was long'. However, we also showed that our technique produced too many of these statements to be useful in a practical setting. Therefore this paper presents a novel technique for linguistic summarization of event logs, which generates linguistic summaries that are concise enough to be used in a practical setting, while at the same time enriching the summaries that are produced by also enabling conjunctive statements. The improved technique is based on pruning and clustering of linguistic summaries. We show that it can be used to reduce the number of summary statements 80–100% compared to previous work. In a survey among 51 practitioners, we found that practitioners consider linguisticHighlights: Automatically create diagnostic statements about a business process. Efficiently explore a search space of 10 19 possible statements. Summaries are condensed 80–97% in size compared to previous work. Abstract: The amount of data that is generated during the execution of a business process is growing. As a consequence it is increasingly hard to extract useful information from the large amount of data that is produced. Linguistic summarization helps to point business analysts in the direction of useful information, by verbalizing interesting patterns that exist in the data. In previous work we showed how linguistic summarization can be used to automatically generate diagnostic statements about event logs, such as 'for most cases that contained the sequence ABC, the throughput time was long'. However, we also showed that our technique produced too many of these statements to be useful in a practical setting. Therefore this paper presents a novel technique for linguistic summarization of event logs, which generates linguistic summaries that are concise enough to be used in a practical setting, while at the same time enriching the summaries that are produced by also enabling conjunctive statements. The improved technique is based on pruning and clustering of linguistic summaries. We show that it can be used to reduce the number of summary statements 80–100% compared to previous work. In a survey among 51 practitioners, we found that practitioners consider linguistic summarization useful and easy to use and intend to use it if it were commercially available. … (more)
- Is Part Of:
- Information systems. Volume 67(2017)
- Journal:
- Information systems
- Issue:
- Volume 67(2017)
- Issue Display:
- Volume 67, Issue 2017 (2017)
- Year:
- 2017
- Volume:
- 67
- Issue:
- 2017
- Issue Sort Value:
- 2017-0067-2017-0000
- Page Start:
- 114
- Page End:
- 125
- Publication Date:
- 2017-07
- Subjects:
- Linguistic summarization -- Business process -- Event log -- Data mining -- Similarity clustering
Database management -- Periodicals
Electronic data processing -- Periodicals
Bases de données -- Gestion -- Périodiques
Informatique -- Périodiques
Database management
Electronic data processing
Periodicals
005.7 - Journal URLs:
- http://www.sciencedirect.com/science/journal/03064379 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.is.2017.03.009 ↗
- Languages:
- English
- ISSNs:
- 0306-4379
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4496.367300
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 165.xml