DreamWalk: Style Space Exploration using Diffusion Guidance

Text-conditioned diffusion models can generate impressive images, but fall short when it comes to fine-grained control. Unlike direct-editing tools like Photoshop, text conditioned models require the artist to perform “prompt engineering,” constructing special text sentences to control the style or amount of a particular subject present in the output image. Our goal is to provide fine-grained control over the style and substance specified by the prompt, for example to adjust the intensity of styles in different regions of the image. Our approach is to decompose the text prompt into conceptual elements, and apply a separate guidance term for each element in a single diffusion process. We introduce guidance scale functions to control when in the diffusion process and where in the image to intervene. Since the method is based solely on adjusting diffusion guidance, it does not require fine-tuning or manipulating the internal layers of the diffusion model's neural network, and can be used in conjunction with LoRA- or DreamBooth-trained models.

DreamWalk: Style Space Exploration using Diffusion Guidance

Abstract

Video Presentation

Single Style Application

Pixel Art

Rene Magritte

van Gogh

Picasso

Qi Baishi

Rene Magritte

Hokusai

Pixel Art

Qi Baishi

Monet

Pixel Art

Qi Baishi

Watercolor

Picasso

Interpolation between Styles

(SD1.5) A group of children flying kites on a breezy summer day at the park in the style of {Monet, Magritte}. First Row: CLIP embedding baseline. Second Row: Ours.

(SD1.5) A Horse galloping freely across vast open field in the style of {water color, pixel art}. First Row: CLIP embedding baseline. Second Row: Ours.

(SD1.5) Artist painting vivid sunset on beach canvas in the style of {Picasso, Hokusai}. First Row: CLIP embedding baseline. Second Row: Ours.

"Campsite with a fire at night (SDXL: Monet -> Picasso)"

"A dog running on a beach (SDXL: Monet -> Hokusai)"

Spatially Varying Guidance Function

"Fish swimming down a stream (SDXL: Picasso)"

"Dog running on a beach (SDXL: Monet)"

"Bird sitting on a tree branch (SDXL: Hokusai)"

"Campsite with a fire at night (SDXL: Hokusai)"

Controllable subject vs. prompt emphasis

DreamWalk on Real Images

in Spring

in Winter

Monet

Hokusai

Rene Magritte

Picasso

BibTeX