Skip to content

AutoPipeline

Diffusers provides many pipelines for basic tasks like generating images, videos, audio, and inpainting. On top of these, there are specialized pipelines for adapters and features like upscaling, super-resolution, and more. Different pipeline classes can even use the same checkpoint because they share the same pretrained model! With so many different pipelines, it can be overwhelming to know which pipeline class to use.

The AutoPipeline class is designed to simplify the variety of pipelines in Diffusers. It is a generic task-first pipeline that lets you focus on a task (AutoPipelineForText2Image, AutoPipelineForImage2Image, and AutoPipelineForInpainting) without needing to know the specific pipeline class. The AutoPipeline automatically detects the correct pipeline class to use.

For example, let's use the dreamlike-art/dreamlike-photoreal-2.0 checkpoint.

Under the hood, AutoPipeline:

  1. Detects a "stable-diffusion" class from the model_index.json file.
  2. Depending on the task you're interested in, it loads the StableDiffusionPipeline, StableDiffusionImg2ImgPipeline, or StableDiffusionInpaintPipeline. Any parameter (strength, num_inference_steps, etc.) you would pass to these specific pipelines can also be passed to the AutoPipeline.
from mindone.diffusers import AutoPipelineForText2Image
import mindspore as ms
import numpy as np

pipe_txt2img = AutoPipelineForText2Image.from_pretrained(
    "dreamlike-art/dreamlike-photoreal-2.0", mindspore_dtype=ms.float16, use_safetensors=True
)

prompt = "cinematic photo of Godzilla eating sushi with a cat in a izakaya, 35mm photograph, film, professional, 4k, highly detailed"
generator = np.random.Generator(np.random.PCG64(22))
image = pipe_txt2img(prompt, generator=generator)[0][0]
image

from mindone.diffusers import AutoPipelineForImage2Image
from mindone.diffusers.utils import load_image
import mindspore as ms
import numpy as np

pipe_img2img = AutoPipelineForImage2Image.from_pretrained(
    "dreamlike-art/dreamlike-photoreal-2.0", mindspore_dtype=ms.float16, use_safetensors=True
)

init_image = load_image("https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/diffusers/autopipeline-text2img.png")

prompt = "cinematic photo of Godzilla eating burgers with a cat in a fast food restaurant, 35mm photograph, film, professional, 4k, highly detailed"
generator = np.random.Generator(np.random.PCG64(7))
image = pipe_img2img(prompt, image=init_image, generator=generator)[0][0]
image

Notice how the dreamlike-art/dreamlike-photoreal-2.0 checkpoint is used for both text-to-image and image-to-image tasks? To save memory and avoid loading the checkpoint twice, use the from_pipe method.

pipe_img2img = AutoPipelineForImage2Image.from_pipe(pipe_txt2img)
image = pipeline(prompt, image=init_image, generator=generator)[0][0]
image

from mindone.diffusers import AutoPipelineForInpainting
from mindone.diffusers.utils import load_image
import mindspore as ms
import numpy as np

pipeline = AutoPipelineForInpainting.from_pretrained(
    "stabilityai/stable-diffusion-xl-base-1.0", mindspore_dtype=ms.float16, use_safetensors=True
)

init_image = load_image("https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/diffusers/autopipeline-img2img.png")
mask_image = load_image("https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/diffusers/autopipeline-mask.png")

prompt = "cinematic photo of a owl, 35mm photograph, film, professional, 4k, highly detailed"
generator = np.random.Generator(np.random.PCG64(38))
image = pipeline(prompt, image=init_image, mask_image=mask_image, generator=generator, strength=0.4)[0][0]
image

Unsupported checkpoints

The AutoPipeline supports Stable Diffusion, Stable Diffusion XL, ControlNet, Kandinsky 2.1, Kandinsky 2.2, and DeepFloyd IF checkpoints.

If you try to load an unsupported checkpoint, you'll get an error.

from mindone.diffusers import AutoPipelineForImage2Image
import mindspore as ms

pipeline = AutoPipelineForImage2Image.from_pretrained(
    "openai/shap-e-img2img", mindspore_dtype=ms.float16, use_safetensors=True
)
"ValueError: AutoPipeline can't find a pipeline linked to ShapEImg2ImgPipeline for None"