Where you edit is what you get: Text-guided image editing with region-based attention. (July 2023)
- Record Type:
- Journal Article
- Title:
- Where you edit is what you get: Text-guided image editing with region-based attention. (July 2023)
- Main Title:
- Where you edit is what you get: Text-guided image editing with region-based attention
- Authors:
- Xiao, Changming
Yang, Qi
Xu, Xiaoqiang
Zhang, Jianwei
Zhou, Feng
Zhang, Changshui - Abstract:
- Highlights: A novel framework is proposed which enables stable training of multitext image editing within one model without the need for per-sample or per-prompt optimization. A region-based attention mechanism is adopted to ensure spatially-localized editing. With the help of these designs, real-time interaction is enabled and several practical applications such as sequential editing can be achieved in high-quality. Abstract: Leveraging the abundant knowledge learned from pre-trained multi-modal models like CLIP has recently proved to be effective for text-guided image editing. Though convincing results have been made when combining the image generator StyleGAN with CLIP, most methods need to train separate models for different prompts, and irrelevant regions are often changed after editing due to the lack of spatial disentanglement. We propose a novel framework that can edit different images according to different prompts in one model. Besides, an innovative region-based spatial attention mechanism is adopted to explicitly guarantee the locality of editing. Experiments mainly in the face domain verify the feasibility of our framework and show that when multi-text editing and local editing are accomplishable, our method can complete practical applications like sequential editing and regional style transfer.
- Is Part Of:
- Pattern recognition. Volume 139(2023)
- Journal:
- Pattern recognition
- Issue:
- Volume 139(2023)
- Issue Display:
- Volume 139, Issue 2023 (2023)
- Year:
- 2023
- Volume:
- 139
- Issue:
- 2023
- Issue Sort Value:
- 2023-0139-2023-0000
- Page Start:
- Page End:
- Publication Date:
- 2023-07
- Subjects:
- Generative adversarial networks -- Text-guided image editing -- Spatial disentanglement
Pattern perception -- Periodicals
Perception des structures -- Périodiques
Patroonherkenning
006.4 - Journal URLs:
- http://www.sciencedirect.com/science/journal/00313203 ↗
http://www.sciencedirect.com/ ↗ - DOI:
- 10.1016/j.patcog.2023.109458 ↗
- Languages:
- English
- ISSNs:
- 0031-3203
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 26855.xml