Attentional triplet neural networks for text-dependent speaker verification. (August 2020)
- Record Type:
- Journal Article
- Title:
- Attentional triplet neural networks for text-dependent speaker verification. (August 2020)
- Main Title:
- Attentional triplet neural networks for text-dependent speaker verification
- Authors:
- Wang, Yiyang
Zhang, Wenjun - Abstract:
- Abstract: Deep Neural Networks (DNNs) have been widely used in speech processing and show great performance on a range of tasks, such as speech recognition, machine translation, and speaker verification. In this paper we propose a new type of DNN model for text-dependent speaker verification. The frame-level features, being extracted by DNN, are usually equally weighted and aggregated (or averaged) to compute an utterance-level speaker representation. We combine Convolutional Neural Network (CNN) and Recurrent Neural Network (RNN) to extract frame-level features, then provide every frame with different weights produced by attention mechanism, so that utterance-level speaker representation can be generated by weighted averaging frame-level features. We explore different generating methods on the attention weights. Besides, attention mechanism can also be used to align temporal information between enrollment and evaluation utterance. Triplet loss function is used to optimize our models, requiring inputs group in triplet style. Ultimately, results of experiment on RSR2015 database show that our attention-based model outperforms various baseline models.
- Is Part Of:
- Journal of physics. Volume 1619(2020)
- Journal:
- Journal of physics
- Issue:
- Volume 1619(2020)
- Issue Display:
- Volume 1619, Issue 1 (2020)
- Year:
- 2020
- Volume:
- 1619
- Issue:
- 1
- Issue Sort Value:
- 2020-1619-0001-0000
- Page Start:
- Page End:
- Publication Date:
- 2020-08
- Subjects:
- Physics -- Congresses
530.5 - Journal URLs:
- http://www.iop.org/EJ/journal/1742-6596 ↗
http://ioppublishing.org/ ↗ - DOI:
- 10.1088/1742-6596/1619/1/012015 ↗
- Languages:
- English
- ISSNs:
- 1742-6588
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 5036.223000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 25413.xml