An Attention Enhanced Cross-Modal Image–Sound Mutual Generation Model for Birds. (1st February 2021)
- Record Type:
- Journal Article
- Title:
- An Attention Enhanced Cross-Modal Image–Sound Mutual Generation Model for Birds. (1st February 2021)
- Main Title:
- An Attention Enhanced Cross-Modal Image–Sound Mutual Generation Model for Birds
- Authors:
- Hao, Wangli
Han, Meng
Li, Shancang
Li, Fuzhong - Abstract:
- Abstract: Cross-modal bird image–audio mutual generation has appealing potential benefits for bird classification. To achieve promising cross-modal bird visual–audio mutual generation, we propose an attention enhanced cross-modal cycle adversarial generation network. Specifically, the attention module endows our model with long-term intra-modality dependency and inter-modality dependency capabilities, which can provide more information during the generation process and further improve the generation performance. Moreover, because there was no dataset concerning bird visual–audio mutual generation, the authors established a novel bird cross-modal generation dataset, called Bird_Crossmodal_Generation (BCG). Based on BCG, our model obtains promising performance and achieves significant improvement under both inception score and Frechet inception distance criteria. The experimental results validate the feasibility of the proposed task and the superiority of our model. Additionally, this investigation provides a basis for more researchers to develop cross-modality methods for bird visual–audio generation.
- Is Part Of:
- Computer journal. Volume 65:Number 2(2022)
- Journal:
- Computer journal
- Issue:
- Volume 65:Number 2(2022)
- Issue Display:
- Volume 65, Issue 2 (2022)
- Year:
- 2022
- Volume:
- 65
- Issue:
- 2
- Issue Sort Value:
- 2022-0065-0002-0000
- Page Start:
- 410
- Page End:
- 422
- Publication Date:
- 2021-02-01
- Subjects:
- attention enhanced -- cross-modal audiovisual -- bird generation -- generative adversarial network
Computers -- Periodicals
005.1 - Journal URLs:
- http://comjnl.oxfordjournals.org/ ↗
http://ukcatalogue.oup.com/ ↗ - DOI:
- 10.1093/comjnl/bxaa188 ↗
- Languages:
- English
- ISSNs:
- 0010-4620
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.060000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 20958.xml