Abstract
We present a novel method for exemplar-based image translation, called matching interleaved diffusion models (MIDMs). Most existing methods for this task were formulated as GAN-based matching-then-generation framework. However, in this framework, matching errors induced by the difficulty of semantic matching across cross-domain, e.g., sketch and photo, can be easily propagated to the generation step, which in turn leads to degenerated results. Motivated by the recent success of diffusion models overcoming the shortcomings of GANs, we incorporate the diffusion models to overcome these limitations. Specifically, we formulate a diffusion-based matching-and-generation framework that interleaves cross-domain matching and diffusion steps in the latent space by iteratively feeding the intermediate warp into the noising process and denoising it to generate a translated image. In addition, to improve the reliability of the diffusion process, we design a confidence-aware process using cycle-consistency to consider only confident regions during translation. Experimental results show that our MIDMs generate more plausible images than state-of-the-art methods.
| Original language | English |
|---|---|
| Title of host publication | AAAI-23 Technical Tracks 2 |
| Editors | Brian Williams, Yiling Chen, Jennifer Neville |
| Publisher | AAAI press |
| Pages | 2191-2199 |
| Number of pages | 9 |
| ISBN (Electronic) | 9781577358800 |
| DOIs | |
| State | Published - 27 Jun 2023 |
| Event | 37th AAAI Conference on Artificial Intelligence, AAAI 2023 - Washington, United States Duration: 7 Feb 2023 → 14 Feb 2023 |
Publication series
| Name | Proceedings of the 37th AAAI Conference on Artificial Intelligence, AAAI 2023 |
|---|---|
| Volume | 37 |
Conference
| Conference | 37th AAAI Conference on Artificial Intelligence, AAAI 2023 |
|---|---|
| Country/Territory | United States |
| City | Washington |
| Period | 7/02/23 → 14/02/23 |
Bibliographical note
Publisher Copyright:Copyright © 2023, Association for the Advancement of Artificial Intelligence (www.aaai.org).
Fingerprint
Dive into the research topics of 'MIDMs: Matching Interleaved Diffusion Models for Exemplar-Based Image Translation'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver