Audiovisual Singing Voice Separation. Issue 1 (25th November 2021)
- Record Type:
- Journal Article
- Title:
- Audiovisual Singing Voice Separation. Issue 1 (25th November 2021)
- Main Title:
- Audiovisual Singing Voice Separation
- Authors:
- Li, Bochen
Wang, Yuxuan
Duan, Zhiyao - Abstract:
- Separating a song into vocal and accompaniment components is an active research topic, and recent years witnessed an increased performance from supervised training using deep learning techniques. We propose to apply the visual information corresponding to the singers' vocal activities to further improve the quality of the separated vocal signals. The video frontend model takes the input of mouth movement and fuses it into the feature embeddings of an audio-based separation framework. To facilitate the network to learn audiovisual correlation of singing activities, we add extra vocal signals irrelevant to the mouth movement to the audio mixture during training. We create two audiovisual singing performance datasets for training and evaluation, respectively, one curated from audition recordings on the Internet, and the other recorded in house. The proposed method outperforms audio-based methods in terms of separation quality on most test recordings. This advantage is especially pronounced when there are backing vocals in the accompaniment, which poses a great challenge for audio-only methods.
- Is Part Of:
- Transactions of the International Society for Music Information Retrieval. Volume 4:Issue 1(2021)
- Journal:
- Transactions of the International Society for Music Information Retrieval
- Issue:
- Volume 4:Issue 1(2021)
- Issue Display:
- Volume 4, Issue 1 (2021)
- Year:
- 2021
- Volume:
- 4
- Issue:
- 1
- Issue Sort Value:
- 2021-0004-0001-0000
- Page Start:
- 195
- Page End:
- 209
- Publication Date:
- 2021-11-25
- Subjects:
- Source separation -- audiovisual analysis -- singing performance
025 - Journal URLs:
- https://transactions.ismir.net/ ↗
- DOI:
- 10.5334/tismir.108 ↗
- Languages:
- English
- ISSNs:
- 2514-3298
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library HMNTS - ELD Digital store
- Ingest File:
- 17945.xml