Chroma Pipeline #11698

Ednaordinary · 2025-06-12T01:47:47Z

What does this PR do?

relevant #11566

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline?
Did you read our philosophy doc (important for complex PRs)?
Was this discussed/approved via a GitHub issue or the forum? Please add a link to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

@DN6

ghunkins · 2025-06-13T18:01:37Z

Amazing work on this @Ednaordinary, huge thanks. Getting the below when adding a LoRA:

 scale_expansion_fn = _SET_ADAPTER_SCALE_FN_MAPPING[self.__class__.__name__]
E       KeyError: 'ChromaTransformer2DModel'

Can we add ChromaTransformer2DModel here: https://github.com/huggingface/diffusers/blob/main/src/diffusers/loaders/peft.py#L45

_SET_ADAPTER_SCALE_FN_MAPPING = {
  ...
  "ChromaTransformer2DModel": lambda model_cls, weights: weights,
  ...
}

I think that will fix.

This reverts commit 3fe4ad6.

ghunkins · 2025-06-13T20:29:24Z

Looks great 🔥

DN6 · 2025-06-14T01:37:12Z

Great work @Ednaordinary @hameerabbasi and @iddl! 🚀

nitinmukesh · 2025-06-14T03:39:22Z

Awesome. Thank you all. 👍

tin2tin · 2025-06-14T07:13:25Z

Thank you for this great commit!

A question: is bitsandbytes quantization not supported for this model? (As it, it's 18.5 GB VRAM, it's a bit too heavy for a lot of consumer cards)

import torch
from diffusers import ChromaTransformer2DModel, ChromaPipeline, BitsAndBytesConfig
from transformers import T5EncoderModel, T5Tokenizer

bfl_repo = "black-forest-labs/FLUX.1-dev"
dtype = torch.bfloat16

nf4_config = BitsAndBytesConfig(
    load_in_4bit=True,
    bnb_4bit_quant_type="nf4",
    bnb_4bit_compute_dtype=torch.bfloat16,
)

transformer = ChromaTransformer2DModel.from_single_file("https://huggingface.co/lodestones/Chroma/blob/main/chroma-unlocked-v35.safetensors", torch_dtype=dtype, quantization_config=nf4_config) 

text_encoder = T5EncoderModel.from_pretrained(bfl_repo, subfolder="text_encoder_2", torch_dtype=dtype)
tokenizer = T5Tokenizer.from_pretrained(bfl_repo, subfolder="tokenizer_2", torch_dtype=dtype)

pipe = ChromaPipeline.from_pretrained(bfl_repo, transformer=transformer, text_encoder=text_encoder, tokenizer=tokenizer, torch_dtype=dtype)

pipe.enable_model_cpu_offload()

Gives me this error:

 File ".\python\Lib\site-packages\huggingface_hub\utils\_validators.py", line 114, in _inner_fn
    return fn(*args, **kwargs)
           ^^^^^^^^^^^^^^^^^^^
  File ".\Python\Python311\site-packages\diffusers\loaders\single_file_model.py", line 415, in from_single_file
    load_model_dict_into_meta(
  File ".\Python\Python311\site-packages\diffusers\models\model_loading_utils.py", line 298, in load_model_dict_into_meta
    hf_quantizer.create_quantized_param(
  File ".\Python\Python311\site-packages\diffusers\quantizers\bitsandbytes\bnb_quantizer.py", line 182, in create_quantized_param
    raise ValueError(
ValueError: Supplied state dict for distilled_guidance_layer.in_proj.weight does not contain `bitsandbytes__*` and possibly other `quantized_stats` components.

Ednaordinary · 2025-06-14T07:23:59Z

@tin2tin bitsandbytes is supported! Just save the diffusers version first with .save_pretrained() and reload with the quantization config (.from_pretrained()). My diffusers weights may also work but ~~I'm not sure how out of date they are with the current code~~: https://huggingface.co/imnotednamode/Chroma-v36-dc-diffusers

Actually my weights still load fine, just prints some unnecessary attribute warnings. Will fix when I get around to it

tin2tin · 2025-06-14T08:41:02Z

@Ednaordinary Oh, that's super cool! How do you load your diffusers version? Just the transformer and quantize that? Can you share a snippet which shows how you use it? Like this?

import torch
from diffusers import ChromaTransformer2DModel, ChromaPipeline, BitsAndBytesConfig
from transformers import T5EncoderModel, T5Tokenizer

bfl_repo = "black-forest-labs/FLUX.1-dev"
dtype = torch.bfloat16

nf4_config = BitsAndBytesConfig(
    load_in_4bit=True,
    bnb_4bit_quant_type="nf4",
    bnb_4bit_compute_dtype=torch.bfloat16,
)
transformer = ChromaTransformer2DModel.from_pretrained(
    "imnotednamode/Chroma-v36-dc-diffusers",
    subfolder="transformer",
    quantization_config=nf4_config,
    torch_dtype=torch.bfloat16,
)

text_encoder = T5EncoderModel.from_pretrained(bfl_repo, subfolder="text_encoder_2", torch_dtype=dtype)
tokenizer = T5Tokenizer.from_pretrained(bfl_repo, subfolder="tokenizer_2", torch_dtype=dtype)

pipe = ChromaPipeline.from_pretrained(bfl_repo, transformer=transformer, text_encoder=text_encoder, tokenizer=tokenizer, torch_dtype=dtype)

pipe.enable_model_cpu_offload()

Getting this notice:

The config attributes {'approximator_in_factor': 16} were passed to ChromaTransformer2DModel, but are not expected and will be ignored. Please verify your config.json configuration file.

Ednaordinary · 2025-06-14T14:05:56Z

You can safely ignore the config notice, it's because changes have been made to the diffusers code since I generated that checkpoint. Also be sure to add llm_int8_skip_modules=["distilled_guidance_layer"] as noted in #11698 (comment) for the best quality

tin2tin · 2025-06-14T15:42:43Z

@Ednaordinary Where should I add llm_int8_skip_modules=["distilled_guidance_layer"]. Could you help me out with a quantized-example-code-snippet?

nitinmukesh · 2025-06-14T15:59:12Z

@tin2tin

See here is the code for different model.
bitsandbytes-foundation/bitsandbytes#1611

model = AutoModelForCausalLM.from_pretrained(
    model_name, device_map='cuda:0', attn_implementation='flash_attention_2',
    torch_dtype=torch.bfloat16,
    quantization_config=BitsAndBytesConfig(
        load_in_4bit=True,

        llm_int8_skip_modules=["lm_head"],

        bnb_4bit_compute_dtype=torch.bfloat16,
        bnb_4bit_quant_type="nf4",
        bnb_4bit_use_double_quant=True
    )

Also instead of applying quantization on text_encoder_2 and transformer separately, both these modules can be specified in quantization_config. If I remember correctly @sayakpaul posted example somewhere. I somehow can't find it.

It was something like

pipe=FluxPipeline (
        quantization_config=BitsAndBytesConfig(
            load_in_4bit=True,
            llm_int8_skip_modules=["lm_head"],
            bnb_4bit_compute_dtype=torch.bfloat16,
            bnb_4bit_quant_type="nf4",
            bnb_4bit_use_double_quant=True,
            ??=[text_encoder_2, transformer]
        )
)

nitinmukesh · 2025-06-14T16:04:55Z

Ok I found it
#11648 (comment)

components_to_quantize=["transformer", "text_encoder_2"]

Ednaordinary · 2025-06-14T16:15:25Z

Another pipeline level quant example is here #11698 (comment)

Also yes, the parameter is passed to the BitsAndBytes config

BitsAndBytesConfig(
    load_in_4bit=True,
    bnb_4bit_compute_dtype=torch.bfloat16,
    bnb_4bit_quant_type="nf4",
    llm_int8_skip_modules=["distilled_guidance_layer"],
)

nitinmukesh · 2025-06-14T16:31:37Z

Another pipeline level quant example is here [#11698 (comment)]

Ahh this is what I was referring to (sample from Sayakpaul). Thanks.
#11698 (comment)

asomoza · 2025-06-14T17:41:30Z

if you want a ready to use code, this one works with main branch:

import torch

from diffusers import ChromaPipeline
from diffusers.quantizers import PipelineQuantizationConfig


dtype = torch.bfloat16

repo_id = "imnotednamode/Chroma-v36-dc-diffusers"

pipeline_quant_config = PipelineQuantizationConfig(
    quant_backend="bitsandbytes_4bit",
    quant_kwargs={
        "load_in_4bit": True,
        "bnb_4bit_quant_type": "nf4",
        "bnb_4bit_compute_dtype": dtype,
        "llm_int8_skip_modules": ["distilled_guidance_layer"],
    },
    components_to_quantize=["transformer", "text_encoder"],
)

pipe = ChromaPipeline.from_pretrained(
    "imnotednamode/Chroma-v36-dc-diffusers",
    quantization_config=pipeline_quant_config,
    torch_dtype=dtype,
)
pipe.enable_model_cpu_offload()

prompt = 'Ultra-realistic, high-quality photo of an anthropomorphic capybara with a tough, streetwise attitude, wearing a worn black leather jacket, dark sunglasses, and ripped jeans. The capybara is leaning casually against a gritty urban wall covered in vibrant graffiti. Behind it, in bold, dripping yellow spray paint, the word "HuggingFace" is scrawled in large street-art style letters. The scene is set in a dimly lit alleyway with moody lighting, scattered trash, and an edgy, rebellious vibe — like a character straight out of an underground comic book.'
negative = "low quality, bad anatomy, extra digits, missing digits, extra limbs, missing limbs"

image = pipe(
    prompt=prompt,
    negative_prompt=negative,
    num_inference_steps=30,
    guidance_scale=4.0,
    width=1024,
    height=1024,
    generator=torch.Generator().manual_seed(42),
).images[0]

image.save("chroma.png")

Ednaordinary added 30 commits June 9, 2025 20:59

working state from hameerabbasi and iddl

ff0b9a3

working state form hameerabbasi and iddl (transformer)

3c2865c

working state (normalization)

e271af9

working state (embeddings)

15f2bd5

add chroma loader

32e6a00

add chroma to mappings

bc36a0d

add chroma to transformer init

33ea0b6

take out variant stuff

22ecd19

get decently far in changing variant stuff

b0df969

add chroma init

c8cbb31

make chroma output class

3265923

add chroma transformer to dummy tp

7400278

add chroma to init

c22930d

add chroma to init

4e698b1

fix single file

5eb4b82

update

f0c75b6

update

6441e70

add chroma to auto pipeline

a6f231c

add chroma to pipeline init

7445cf4

change to chroma transformer

af918c8

take out variant from blocks

2fcc75a

swap embedder location

0b027a2

remove prompt_2

6c0aed1

work on swapping text encoders

f190c02

remove mask function

38429ff

dont modify mask (for now)

7c75d8e

wrap attn mask

c9b46af

no attn mask (can't get it to work)

146255a

remove pooled prompt embeds

3309ffe

change to my own unpooled embeddeer

77b429e

fix equal size list input

3fe4ad6

DN6 and others added 13 commits June 13, 2025 20:41

update

41751a3

push local changes, fix docs

fd3e944

add encoder test, remove pooled dim

8694f2c

default proj dim

4e24f26

fix tests

0978b60

fix equal size list input

c711e8f

Revert "fix equal size list input"

589e939

This reverts commit 3fe4ad6.

Merge branch 'chroma-fork' into chroma-final

2b559e9

update

a967e66

update

4f00bae

update

0497faa

update

e10f701

update

d267bb6

DN6 approved these changes Jun 14, 2025

View reviewed changes

DN6 merged commit 8adc600 into huggingface:main Jun 14, 2025
30 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Chroma Pipeline #11698

Chroma Pipeline #11698

Ednaordinary commented Jun 12, 2025

Uh oh!

ghunkins commented Jun 13, 2025 •

edited

Loading

Uh oh!

ghunkins commented Jun 13, 2025

Uh oh!

Uh oh!

DN6 commented Jun 14, 2025

Uh oh!

nitinmukesh commented Jun 14, 2025

Uh oh!

tin2tin commented Jun 14, 2025 •

edited

Loading

Uh oh!

Ednaordinary commented Jun 14, 2025 •

edited

Loading

Uh oh!

tin2tin commented Jun 14, 2025 •

edited

Loading

Uh oh!

Ednaordinary commented Jun 14, 2025 •

edited

Loading

Uh oh!

tin2tin commented Jun 14, 2025

Uh oh!

nitinmukesh commented Jun 14, 2025 •

edited

Loading

Uh oh!

nitinmukesh commented Jun 14, 2025

Uh oh!

Ednaordinary commented Jun 14, 2025 •

edited

Loading

Uh oh!

nitinmukesh commented Jun 14, 2025 •

edited

Loading

Uh oh!

asomoza commented Jun 14, 2025

Uh oh!

Uh oh!

Chroma Pipeline #11698

Chroma Pipeline #11698

Conversation

Ednaordinary commented Jun 12, 2025

What does this PR do?

Before submitting

Who can review?

Uh oh!

ghunkins commented Jun 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ghunkins commented Jun 13, 2025

Uh oh!

Uh oh!

DN6 commented Jun 14, 2025

Uh oh!

nitinmukesh commented Jun 14, 2025

Uh oh!

tin2tin commented Jun 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Ednaordinary commented Jun 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tin2tin commented Jun 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Ednaordinary commented Jun 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tin2tin commented Jun 14, 2025

Uh oh!

nitinmukesh commented Jun 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

nitinmukesh commented Jun 14, 2025

Uh oh!

Ednaordinary commented Jun 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

nitinmukesh commented Jun 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

asomoza commented Jun 14, 2025

Uh oh!

Uh oh!

ghunkins commented Jun 13, 2025 •

edited

Loading

tin2tin commented Jun 14, 2025 •

edited

Loading

Ednaordinary commented Jun 14, 2025 •

edited

Loading

tin2tin commented Jun 14, 2025 •

edited

Loading

Ednaordinary commented Jun 14, 2025 •

edited

Loading

nitinmukesh commented Jun 14, 2025 •

edited

Loading

Ednaordinary commented Jun 14, 2025 •

edited

Loading

nitinmukesh commented Jun 14, 2025 •

edited

Loading