Semantic language models with deep neural networks. (November 2016)
- Record Type:
- Journal Article
- Title:
- Semantic language models with deep neural networks. (November 2016)
- Main Title:
- Semantic language models with deep neural networks
- Authors:
- Bayer, Ali Orkan
Riccardi, Giuseppe - Abstract:
- Highlights: We incorporate semantic features into LMs via theory of frame semantics. We use deep autoencoders for noise robust encoding of semantic context. Deep semantic encodings on target words are more noise robust. Semantic LMs (SELMs) have a better recognition performance on target words. SELMs have a better understanding performance on the frame identification task. Abstract: In this paper we explore the use of semantics in training language models for automatic speech recognition and spoken language understanding. Traditional language models (LMs) do not consider the semantic constraints and train models based on fixed-sized word histories. The theory of frame semantics analyzes word meanings and their constructs by using "semantic frames". Semantic frames represent a linguistic scene with its relevant participants and their relations. They are triggered by target words and include slots which are filled by frame elements. We present semantic LMs (SELMs), which use recurrent neural network architectures and the linguistic scene of frame semantics as context. SELMs incorporate semantic features which are extracted from semantic frames and target words. In this way, long-range and "latent" dependencies, i.e. the implicit semantic dependencies between words, are incorporated into LMs. This is crucial especially when the main aim of spoken language systems is understanding what the user means. Semantic features consist of low-level features, where frame and targetHighlights: We incorporate semantic features into LMs via theory of frame semantics. We use deep autoencoders for noise robust encoding of semantic context. Deep semantic encodings on target words are more noise robust. Semantic LMs (SELMs) have a better recognition performance on target words. SELMs have a better understanding performance on the frame identification task. Abstract: In this paper we explore the use of semantics in training language models for automatic speech recognition and spoken language understanding. Traditional language models (LMs) do not consider the semantic constraints and train models based on fixed-sized word histories. The theory of frame semantics analyzes word meanings and their constructs by using "semantic frames". Semantic frames represent a linguistic scene with its relevant participants and their relations. They are triggered by target words and include slots which are filled by frame elements. We present semantic LMs (SELMs), which use recurrent neural network architectures and the linguistic scene of frame semantics as context. SELMs incorporate semantic features which are extracted from semantic frames and target words. In this way, long-range and "latent" dependencies, i.e. the implicit semantic dependencies between words, are incorporated into LMs. This is crucial especially when the main aim of spoken language systems is understanding what the user means. Semantic features consist of low-level features, where frame and target information is directly used; and deep semantic encodings, where deep autoencoders are used to extract semantic features. We evaluate the performance of SELMs on publicly available corpora: the Wall Street Journal read-speech corpus and the LUNA human–human conversational corpus. The encoding of semantic frames into SELMs improves the word recognition performance and especially the recognition performance of the target words, the meaning bearing elements of semantic frames. We assess the performance of SELMs for the understanding tasks and we show that SELMs yield better semantic frame identification performance compared to recurrent neural network LMs. … (more)
- Is Part Of:
- Computer speech & language. Volume 40(2016)
- Journal:
- Computer speech & language
- Issue:
- Volume 40(2016)
- Issue Display:
- Volume 40, Issue 2016 (2016)
- Year:
- 2016
- Volume:
- 40
- Issue:
- 2016
- Issue Sort Value:
- 2016-0040-2016-0000
- Page Start:
- 1
- Page End:
- 22
- Publication Date:
- 2016-11
- Subjects:
- Language modeling -- Recurrent neural networks -- Frame semantics -- Semantic language models -- Deep autoencoders
Speech processing systems -- Periodicals
Automatic speech recognition -- Periodicals
Computers -- Periodicals
Linguistics -- Periodicals
Speech-Language Pathology -- Periodicals
Traitement automatique de la parole -- Périodiques
Reconnaissance automatique de la parole -- Périodiques
Automatic speech recognition
Speech processing systems
Electronic journals
Periodicals
006.454 - Journal URLs:
- http://www.journals.elsevier.com/computer-speech-and-language/ ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.csl.2016.04.001 ↗
- Languages:
- English
- ISSNs:
- 0885-2308
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.276600
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 7560.xml