X-vector anonymization using autoencoders and adversarial training for preserving speech privacy. (July 2022)

Record Type:: Journal Article
Title:: X-vector anonymization using autoencoders and adversarial training for preserving speech privacy. (July 2022)
Main Title:: X-vector anonymization using autoencoders and adversarial training for preserving speech privacy
Authors:: Perero-Codosero, Juan M.
Espinoza-Cuadros, Fernando M.
Hernández-Gómez, Luis A.
Abstract:: Abstract: The rapid increase in web services and mobile apps, which collect personal data from users, has also increased the risk that their privacy may be severely compromised. In particular, the increasing variety of spoken language interfaces and voice assistants empowered by the vertiginous breakthroughs in deep learning have prompted important concerns in the European Union in terms of preserving the privacy of speech data. For instance, an attacker can record speech from users and impersonate them to obtain access to systems that require voice identification. By extracting speaker, linguistic (e.g., dialect), and paralinguistic features (e.g., age) from a speech signal, the speaker profiles can also be hacked from users through existing technology. To mitigate these weaknesses, in this study, we present a speech anonymization method based on autoencoders and adversarial training. Given an utterance, we first extract an x-vector, which is a powerful utterance-level embedding used in state-of-the-art speaker recognition. This original x-vector is transformed by an autoencoder producing a new x-vector, where speaker, gender, and accent information are suppressed through adversarial training. The anonymized speech is finally generated through a neural speech synthesizer driven by the anonymized x-vector, fundamental frequency, and phoneme information extracted from the input speech. For the evaluation, we followed the VoicePrivacy Challenge framework, where anonymization … (more)
Is Part Of:: Computer speech & language. Volume 74(2022)
Journal:: Computer speech & language
Issue:: Volume 74(2022)
Issue Display:: Volume 74, Issue 2022 (2022)
Year:: 2022
Volume:: 74
Issue:: 2022
Issue Sort Value:: 2022-0074-2022-0000
Page Start:
Page End:
Publication Date:: 2022-07
Subjects:: Speaker anonymization -- Adversarial training -- Autoencoders -- Adversarial neural networks -- Automatic speech recognition -- Automatic speaker verification
Speech processing systems -- Periodicals
Automatic speech recognition -- Periodicals
Computers -- Periodicals
Linguistics -- Periodicals
Speech-Language Pathology -- Periodicals
Traitement automatique de la parole -- Périodiques
Reconnaissance automatique de la parole -- Périodiques
Automatic speech recognition
Speech processing systems
Electronic journals
Periodicals
006.454
Journal URLs:: http://www.journals.elsevier.com/computer-speech-and-language/ ↗
http://www.elsevier.com/journals ↗
DOI:: 10.1016/j.csl.2022.101351 ↗
Languages:: English
ISSNs:: 0885-2308
Deposit Type:: Legaldeposit
View Content:: Available online (eLD content is only available in our Reading Rooms) ↗
Physical Locations:: British Library DSC - 3394.276600
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store
Ingest File:: 21011.xml