Constant‐Q magnitude–phase coefficients extraction for synthetic speech detection. Issue 5 (22nd July 2020)
- Record Type:
- Journal Article
- Title:
- Constant‐Q magnitude–phase coefficients extraction for synthetic speech detection. Issue 5 (22nd July 2020)
- Main Title:
- Constant‐Q magnitude–phase coefficients extraction for synthetic speech detection
- Authors:
- Yang, Jichen
Lin, Pei
He, Qianhua - Abstract:
- Abstract : Previous works in synthetic speech detection have focused on features based on magnitude or phase spectrum. In this study, to extract useful discriminative information for synthetic speech detection, the authors propose a feature based on magnitude–phase spectrum (MPS), combining magnitude‐ and phase‐spectra information. The proposed feature is termed as constant‐Q magnitude–phase coefficient (CMPC), which is obtained by combining constant‐Q transform (CQT), MPS, uniform resampling, and discrete cosine transform. The CQT used in this study is a long‐term window transform, which can provide the basis for CMPC to capture important artefacts of synthetic speech. Such artefacts are obtained using a unit selection algorithm, which have difficulties when based on the short‐term window transform. Uniform resampling aims to convert MPS from the octave domain into the linear domain. The discrete cosine transform is used when extracting principal components to remove correlations among the feature dimensions. The experimental results on AVspoof and ASVspoof 2015 corpora show that CMPC performs better than some commonly used features based on magnitude or phase spectrum alone. Their system based on CMPC outperforms many known systems.
- Is Part Of:
- IET biometrics. Volume 9:Issue 5(2020)
- Journal:
- IET biometrics
- Issue:
- Volume 9:Issue 5(2020)
- Issue Display:
- Volume 9, Issue 5 (2020)
- Year:
- 2020
- Volume:
- 9
- Issue:
- 5
- Issue Sort Value:
- 2020-0009-0005-0000
- Page Start:
- 216
- Page End:
- 221
- Publication Date:
- 2020-07-22
- Subjects:
- speech synthesis -- principal component analysis -- feature extraction -- discrete cosine transforms
magnitude–phase coefficients extraction -- synthetic speech detection -- useful discriminative information -- magnitude–phase spectrum -- phase‐spectra information -- CMPC -- uniform resampling -- long‐term window -- short‐term window -- feature dimensions -- constant‐Q magnitude–phase coefficients extraction -- CQT -- MPS
Biometric identification -- Periodicals
570.15195 - Journal URLs:
- http://digital-library.theiet.org/IET-BMT ↗
http://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6072579 ↗
http://www.bibliothek.uni-regensburg.de/ezeit/?2659842 ↗
https://ietresearch.onlinelibrary.wiley.com/journal/20474946 ↗
http://ieeexplore.ieee.org/Xplore/home.jsp ↗ - DOI:
- 10.1049/iet-bmt.2018.5100 ↗
- Languages:
- English
- ISSNs:
- 2047-4938
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4363.252100
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 17375.xml