Extraction of prosody for automatic speaker, language, emotion and speech recognition. (2019)
- Record Type:
- Book
- Title:
- Extraction of prosody for automatic speaker, language, emotion and speech recognition. (2019)
- Main Title:
- Extraction of prosody for automatic speaker, language, emotion and speech recognition
- Further Information:
- Note: Leena Mary.
- Authors:
- Mary, Leena
- Contents:
- Intro; Preface; Contents; 1 Significance of Prosody for Speaker, Language, Emotion, and Speech Recognition; 1.1 Introduction; 1.2 What is Prosody?; 1.2.1 Intonation; 1.2.2 Stress; 1.2.3 Rhythm; 1.3 Probabilistic Formulation of Recognition; 1.4 Significance of Prosody for Robust Recognition; 1.5 Automatic Speaker Recognition; 1.5.1 Speaker Recognition by Humans; 1.5.2 Speaker-Specific Aspect of Speech; 1.5.3 Significance of Prosody for Automatic Speaker Recognition; 1.6 Automatic Language Recognition; 1.6.1 Language Recognition by Humans; 1.6.2 Language-Specific Aspect of Speech 1.6.3 Significance of Prosody for Automatic Language Recognition1.6.3.1 Intonation; 1.6.3.2 Rhythm; 1.6.3.3 Stress; 1.7 Automatic Emotion Recognition; 1.7.1 Emotion Recognition by Humans; 1.7.2 Emotion-Specific Aspect of Speech; 1.7.3 Significance of Prosody for Automatic Emotion Recognition; 1.8 Automatic Speech Recognition; 1.8.1 Speech Recognition by Humans; 1.8.2 Significance of Prosody for Automatic Speech Recognition; 1.9 Summary; 2 Extraction and Representation of Prosody for Speaker, Language, Emotion, and Speech Recognition; 2.1 Introduction 2.2 ASR-Free Approaches for Automatic Segmentation and Representation of Prosody2.2.1 Syllable-Like Segmentation Using Location of Vowel Onset Points; 2.2.1.1 Detection of Vowel Onset Points; 2.2.1.2 Feature Parameterization; 2.2.1.3 Representation of Prosodic Features for Speaker Recognition; 2.2.1.4 Representation of Prosodic Features for LanguageIntro; Preface; Contents; 1 Significance of Prosody for Speaker, Language, Emotion, and Speech Recognition; 1.1 Introduction; 1.2 What is Prosody?; 1.2.1 Intonation; 1.2.2 Stress; 1.2.3 Rhythm; 1.3 Probabilistic Formulation of Recognition; 1.4 Significance of Prosody for Robust Recognition; 1.5 Automatic Speaker Recognition; 1.5.1 Speaker Recognition by Humans; 1.5.2 Speaker-Specific Aspect of Speech; 1.5.3 Significance of Prosody for Automatic Speaker Recognition; 1.6 Automatic Language Recognition; 1.6.1 Language Recognition by Humans; 1.6.2 Language-Specific Aspect of Speech 1.6.3 Significance of Prosody for Automatic Language Recognition1.6.3.1 Intonation; 1.6.3.2 Rhythm; 1.6.3.3 Stress; 1.7 Automatic Emotion Recognition; 1.7.1 Emotion Recognition by Humans; 1.7.2 Emotion-Specific Aspect of Speech; 1.7.3 Significance of Prosody for Automatic Emotion Recognition; 1.8 Automatic Speech Recognition; 1.8.1 Speech Recognition by Humans; 1.8.2 Significance of Prosody for Automatic Speech Recognition; 1.9 Summary; 2 Extraction and Representation of Prosody for Speaker, Language, Emotion, and Speech Recognition; 2.1 Introduction 2.2 ASR-Free Approaches for Automatic Segmentation and Representation of Prosody2.2.1 Syllable-Like Segmentation Using Location of Vowel Onset Points; 2.2.1.1 Detection of Vowel Onset Points; 2.2.1.2 Feature Parameterization; 2.2.1.3 Representation of Prosodic Features for Speaker Recognition; 2.2.1.4 Representation of Prosodic Features for Language Recognition; 2.2.2 Syllable-Like Segmentation Using Information from F0 and Energy Contour; 2.2.2.1 Feature Representation for Speaker, Language, and Emotion Recognition; 2.2.3 Syllable-Like Segmentation Using Detection of Vowel Region 2.2.3.1 Feature Representation for Language Recognition2.2.4 Segmentation Using Inflections or Start/End of Voicing; 2.2.5 Segmentation as Pseudo Syllables; 2.2.6 Segmentation at Predefined Intervals; 2.2.6.1 Feature Representation for Emotion Recognition; 2.2.7 Suprasegmental Parameterization; 2.2.8 Segmentation at Sentence/Phrase and Syllable Level; 2.2.8.1 Prosodic Features for Emotion Recognition; 2.3 ASR-Based Approaches for Extraction and Representation of Prosody; 2.3.1 Segmentation into Nonuniform Extraction Regions; 2.3.2 Segmentation into Pseudo Syllables; 2.4 Summary 3 Modeling and Fusion of Prosody for Speaker, Language, Emotion, and Speech Recognition3.1 Introduction; 3.2 Modeling of Prosody; 3.3 Speaker Recognition Systems Based on Prosody; 3.4 Language Recognition Systems Based on Prosody; 3.5 Emotion Recognition Systems Based on Prosody; 3.6 Speech Recognition Systems Based on Prosody; 3.7 Fusion of Prosodic Evidence into the Conventional Recognition Applications; 3.8 Summary; References … (more)
- Edition:
- Second edition
- Publisher Details:
- Cham, Switzerland : Springer
- Publication Date:
- 2019
- Extent:
- 1 online resource
- Subjects:
- 006.4/54
Engineering
Speech processing systems
Versification
Natural language processing (Computer science)
COMPUTERS / General
Natural language processing (Computer science)
Speech processing systems
Versification
Language Arts & Disciplines -- Linguistics -- General
Computers -- Speech & Audio Processing
Computational linguistics
Natural language & machine translation
Computational linguistics
Natural language processing (Computer science)
Technology & Engineering -- Electronics -- General
Imaging systems & technology
Electronic books - Languages:
- English
- ISBNs:
- 9783319911717
3319911716 - Related ISBNs:
- 9783319911700
3319911708 - Notes:
- Note: Includes bibliographical references.
Note: Online resource; title from PDF title page (Library Ideas, viewed Aug. 10, 2018). - Access Rights:
- Legal Deposit; Only available on premises controlled by the deposit library and to one user at any one time; The Legal Deposit Libraries (Non-Print Works) Regulations (UK).
- Access Usage:
- Restricted: Printing from this resource is governed by The Legal Deposit Libraries (Non-Print Works) Regulations (UK) and UK copyright law currently in force.
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library HMNTS - ELD.DS.367695
- Ingest File:
- 01_344.xml