You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Single-file GGUF weights for Ideogram4 are now published
(stduhpf/ideogram-4-gguf), so add the model to the gallery. Ideogram4 is a
text-to-image model with strong, accurate in-image text rendering, driven by
a Qwen3-VL-8B text encoder and real classifier-free guidance from a separate
unconditional diffusion model (the uncond_diffusion_model_path support added
in the preceding commit).
Two index entries, both built on gallery/virtual.yaml with the full config
inlined in overrides (same pattern as the other models, no dedicated template
file):
- ideogram-4-iq4nl-ggml (4-bit, ~11.6GB diffusion)
- ideogram-4-q8_0-ggml (8-bit, ~20GB diffusion)
Each bundles the diffusion + unconditional GGUF (stduhpf), the
Qwen3-VL-8B-Instruct text encoder (unsloth), and the FLUX.2 VAE (Comfy-Org
mirror, non-gated). cfg_scale is 7 to match the upstream Ideogram4 default,
since it performs real CFG unlike the guidance-distilled Flux/Z-Image models.
Signed-off-by: Ettore Di Giacinto <mudler@localai.io>
Assisted-by: Claude:claude-opus-4-8 [Claude Code]
Ideogram 4 is a text-to-image diffusion model known for state-of-the-art prompt adherence and exceptional, accurate text rendering inside images. It is driven by a Qwen3-VL-8B text encoder and performs real classifier-free guidance from a separate unconditional diffusion model.
26175
+
26176
+
This is the iQ4_NL (4-bit) quantization, a good balance of quality and footprint (~5.8GB diffusion + ~5.8GB unconditional). The bundle also pulls the Qwen3-VL-8B-Instruct text encoder and the FLUX.2 VAE. Quantized GGUF weights by stduhpf for use with stable-diffusion.cpp.
26177
+
license: ideogram-non-commercial-model-agreement
26178
+
tags:
26179
+
- ideogram
26180
+
- ideogram4
26181
+
- text-to-image
26182
+
- image-generation
26183
+
- gguf
26184
+
- quantized
26185
+
- 8b
26186
+
- diffusion
26187
+
last_checked: "2026-06-06"
26188
+
overrides:
26189
+
backend: stablediffusion-ggml
26190
+
step: 25
26191
+
# Ideogram4 runs real classifier-free guidance from a separate
26192
+
# unconditional diffusion model, so it needs a CFG scale > 1 (unlike the
26193
+
# guidance-distilled Flux / Z-Image models). 7 matches the upstream
26194
+
# stable-diffusion.cpp default used in the Ideogram4 example.
Ideogram 4 is a text-to-image diffusion model known for state-of-the-art prompt adherence and exceptional, accurate text rendering inside images. It is driven by a Qwen3-VL-8B text encoder and performs real classifier-free guidance from a separate unconditional diffusion model.
26225
+
26226
+
This is the Q8_0 (8-bit) quantization for highest quality (~10.1GB diffusion + ~10.1GB unconditional). The bundle also pulls the Qwen3-VL-8B-Instruct text encoder and the FLUX.2 VAE. Quantized GGUF weights by stduhpf for use with stable-diffusion.cpp.
26227
+
license: ideogram-non-commercial-model-agreement
26228
+
tags:
26229
+
- ideogram
26230
+
- ideogram4
26231
+
- text-to-image
26232
+
- image-generation
26233
+
- gguf
26234
+
- quantized
26235
+
- 8b
26236
+
- diffusion
26237
+
last_checked: "2026-06-06"
26238
+
overrides:
26239
+
backend: stablediffusion-ggml
26240
+
step: 25
26241
+
# Ideogram4 runs real classifier-free guidance from a separate
26242
+
# unconditional diffusion model, so it needs a CFG scale > 1 (unlike the
26243
+
# guidance-distilled Flux / Z-Image models). 7 matches the upstream
26244
+
# stable-diffusion.cpp default used in the Ideogram4 example.
0 commit comments