A hybrid noise robust model for multireplay attack detection in Automatic speaker verification systems. (April 2022)
- Record Type:
- Journal Article
- Title:
- A hybrid noise robust model for multireplay attack detection in Automatic speaker verification systems. (April 2022)
- Main Title:
- A hybrid noise robust model for multireplay attack detection in Automatic speaker verification systems
- Authors:
- Dua, Mohit
Sadhu, Ambika
Jindal, Anisha
Mehta, Raman - Abstract:
- Abstract: Biometric Systems are automatic methods of verifying the identity of a person based on some characteristics such as fingerprint, face, speech etc. Speech biometrics involves verifying the identity of a person based on voice. The process is called speaker verification, and such biometric systems are called Automatic Speaker Verification systems. However, these systems are prone to malicious attacks such as replay attacks. Thus, Automatic Speaker Verification systems need to be robust against these attacks. Existing approaches have worked majorly with single-replay attacks. The proposed work in this paper explores three back-end models: Convolutional Neural Network, Long Short-Term Memory and hybrid of these two models, with different input feature formats. It analyses their performance in a diverse multi-replay attack detection scenario, using the Voice Spoofing Detection Corpus. In the frontend of the proposed hybrid system, mel -spectrograms is used as the feature extraction technique for Convolutional Neural Network based model, and Constant-Q Cepstral Coefficient approach is used for extracting features for Long Short-Term Memory based model. A comparison of this hybrid model's performance is done with past approaches in single-replay attack detection as well. The average Equal Error Rate achieved on the test set was 0.036 for single-replayed attack and 0.0296 for two times replayed. The hybrid model consistently outperforms other models with lower Equal ErrorAbstract: Biometric Systems are automatic methods of verifying the identity of a person based on some characteristics such as fingerprint, face, speech etc. Speech biometrics involves verifying the identity of a person based on voice. The process is called speaker verification, and such biometric systems are called Automatic Speaker Verification systems. However, these systems are prone to malicious attacks such as replay attacks. Thus, Automatic Speaker Verification systems need to be robust against these attacks. Existing approaches have worked majorly with single-replay attacks. The proposed work in this paper explores three back-end models: Convolutional Neural Network, Long Short-Term Memory and hybrid of these two models, with different input feature formats. It analyses their performance in a diverse multi-replay attack detection scenario, using the Voice Spoofing Detection Corpus. In the frontend of the proposed hybrid system, mel -spectrograms is used as the feature extraction technique for Convolutional Neural Network based model, and Constant-Q Cepstral Coefficient approach is used for extracting features for Long Short-Term Memory based model. A comparison of this hybrid model's performance is done with past approaches in single-replay attack detection as well. The average Equal Error Rate achieved on the test set was 0.036 for single-replayed attack and 0.0296 for two times replayed. The hybrid model consistently outperforms other models with lower Equal Error Rate values, thus showing promising results, paving the way for future research along this line. Further, an analysis on unseen noisy audio files suggests the proposed model's utility in noise-robust systems as well. … (more)
- Is Part Of:
- Biomedical signal processing and control. Volume 74(2022)
- Journal:
- Biomedical signal processing and control
- Issue:
- Volume 74(2022)
- Issue Display:
- Volume 74, Issue 2022 (2022)
- Year:
- 2022
- Volume:
- 74
- Issue:
- 2022
- Issue Sort Value:
- 2022-0074-2022-0000
- Page Start:
- Page End:
- Publication Date:
- 2022-04
- Subjects:
- ASV -- Spoof detection -- Cqcc -- Lstm -- MelSpectrogram -- CNN -- VSDC -- Replay attack
Signal processing -- Periodicals
Biomedical engineering -- Periodicals
Signal Processing, Computer-Assisted -- Periodicals
Image Processing, Computer-Assisted -- Periodicals
Biomedical Engineering -- Periodicals
610.28 - Journal URLs:
- http://www.sciencedirect.com/science/journal/17468094 ↗
http://www.elsevier.com/journals ↗
http://www.sciencedirect.com/science?_ob=PublicationURL&_tockey=%23TOC%2329675%232006%23999989998%23626449%23FLA%23&_cdi=29675&_pubType=J&_auth=y&_acct=C000045259&_version=1&_urlVersion=0&_userid=836873&md5=664b5cf9a57fc91971a17faf20c32ec1 ↗ - DOI:
- 10.1016/j.bspc.2022.103517 ↗
- Languages:
- English
- ISSNs:
- 1746-8094
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 2087.880400
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 21057.xml