Multiple topic identification in human/human conversations. (November 2015)
- Record Type:
- Journal Article
- Title:
- Multiple topic identification in human/human conversations. (November 2015)
- Main Title:
- Multiple topic identification in human/human conversations
- Authors:
- Bost, X.
Senay, G.
El-Bèze, M.
De Mori, R. - Abstract:
- Abstract : Highlights: A multiple classification methods for multiple theme hypothesization is proposed. Four methods, one of which is new, are initially used and separately evaluated. A new sequential decision strategy for multiple theme hypothesization is introduced. A new hypothesis refinancing component is presented, based on ASR word lattice. Results show that the strategy makes it possible to obtain reliable service surveys. Abstract: The paper deals with the automatic analysis of real-life telephone conversations between an agent and a customer of a customer care service (ccs ). The application domain is the public transportation system in Paris and the purpose is to collect statistics about customer problems in order to monitor the service and decide priorities on the intervention for improving user satisfaction. Of primary importance for the analysis is the detection of themes that are the object of customer problems. Themes are defined in the application requirements and are part of the application ontology that is implicit in theccs documentation. Due to variety of customer population, the structure of conversations with an agent is unpredictable. A conversation may be about one or more themes. Theme mentions can be interleaved with mentions of facts that are irrelevant for the application purpose. Furthermore, in certain conversations theme mentions are localized in specific conversation segments while in other conversations mentions cannot be localized. As aAbstract : Highlights: A multiple classification methods for multiple theme hypothesization is proposed. Four methods, one of which is new, are initially used and separately evaluated. A new sequential decision strategy for multiple theme hypothesization is introduced. A new hypothesis refinancing component is presented, based on ASR word lattice. Results show that the strategy makes it possible to obtain reliable service surveys. Abstract: The paper deals with the automatic analysis of real-life telephone conversations between an agent and a customer of a customer care service (ccs ). The application domain is the public transportation system in Paris and the purpose is to collect statistics about customer problems in order to monitor the service and decide priorities on the intervention for improving user satisfaction. Of primary importance for the analysis is the detection of themes that are the object of customer problems. Themes are defined in the application requirements and are part of the application ontology that is implicit in theccs documentation. Due to variety of customer population, the structure of conversations with an agent is unpredictable. A conversation may be about one or more themes. Theme mentions can be interleaved with mentions of facts that are irrelevant for the application purpose. Furthermore, in certain conversations theme mentions are localized in specific conversation segments while in other conversations mentions cannot be localized. As a consequence, approaches to feature extraction with and without mention localization are considered. Application domain relevant themes identified by an automatic procedure are expressed by specific sentences whose words are hypothesized by an automatic speech recognition (asr ) system. Theasr system is error prone. The word error rates can be very high for many reasons. Among them it is worth mentioning unpredictable background noise, speaker accent, and various types of speech disfluencies. As the application task requires the composition of proportions of theme mentions, a sequential decision strategy is introduced in this paper for performing a survey of the large amount of conversations made available in a given time period. The strategy has to sample the conversations to form a survey containing enough data analyzed with high accuracy so that proportions can be estimated with sufficient accuracy. Due to the unpredictable type of theme mentions, it is appropriate to consider methods for theme hypothesization based on global as well as local feature extraction. Two systems based on each type of feature extraction will be considered by the strategy. One of the four methods is novel. It is based on a new definition of density of theme mentions and on the localization of high density zones whose boundaries do not need to be precisely detected. The sequential decision strategy starts by grouping theme hypotheses into sets of different expected accuracy and coverage levels. For those sets for which accuracy can be improved with a consequent increase of coverage a new system with new features is introduced. Its execution is triggered only when specific preconditions are met on the hypotheses generated by the basic four systems. Experimental results are provided on a corpus collected in the call center of the Paris transportation system known asratp . The results show that surveys with high accuracy and coverage can be composed with the proposed strategy and systems. This makes it possible to apply a previously published proportion estimation approach that takes into account hypothesization errors. … (more)
- Is Part Of:
- Computer speech & language. Volume 34(2015)
- Journal:
- Computer speech & language
- Issue:
- Volume 34(2015)
- Issue Display:
- Volume 34, Issue 2015 (2015)
- Year:
- 2015
- Volume:
- 34
- Issue:
- 2015
- Issue Sort Value:
- 2015-0034-2015-0000
- Page Start:
- 18
- Page End:
- 42
- Publication Date:
- 2015-11
- Subjects:
- Human/human conversation analysis -- Multi-topic identification -- Spoken language understanding -- Interpretation strategies
Speech processing systems -- Periodicals
Automatic speech recognition -- Periodicals
Computers -- Periodicals
Linguistics -- Periodicals
Speech-Language Pathology -- Periodicals
Traitement automatique de la parole -- Périodiques
Reconnaissance automatique de la parole -- Périodiques
Automatic speech recognition
Speech processing systems
Electronic journals
Periodicals
006.454 - Journal URLs:
- http://www.journals.elsevier.com/computer-speech-and-language/ ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.csl.2015.03.006 ↗
- Languages:
- English
- ISSNs:
- 0885-2308
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.276600
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 6446.xml