Stable Diffusion 3.5 Large (Stability)
Stability AI's 8B-parameter flagship SD3.5 model. Strong prompt adherence and aesthetic quality.
Stable Diffusion 3.5 Large (Stability) is image generation AI model from Custom, priced at β¬0.000 per 1M input tokens with a unknown context window.
Pricing
API Integration
Use our OpenAI-compatible API to integrate Stable Diffusion 3.5 Large (Stability) into your application.
npm install railwailimport railwail from "railwail";
const rw = railwail("YOUR_API_KEY");
const images = await rw.run("sd-3-5-large-stability", "A beautiful sunset over Tokyo");
console.log(images[0].url);
// Or use the image() method for full control
const res = await rw.image("sd-3-5-large-stability", "A cat in space", {
size: "1024x1024",
n: 1,
});
console.log(res.data[0].url);Deep dive β Stability AI's Stable Diffusion 3.5 Large (Stability)
Stability AI was founded in 2019 by Emad Mostaque (CEO until March 2024, succeeded by Prem Akkaraju) and is headquartered in London. The company famously sponsored and released Stable Diffusion 1.x and 2.x (2022) in partnership with CompVis (LMU Munich, Rombach et al.) and Runway, and then SDXL (2023), SD3 (2024) and SD 3.5 (Oct 2024). After the original team behind Stable Diffusion left in 2024 to found Black Forest Labs, Stability AI was restructured and recapitalised under new investors led by Sean Parker and Sean 'Diddy' Combs' backers. The SD 3.5 family β Large (8.1B), Large Turbo and Medium (2.5B) β was released in October 2024 under a permissive Stability AI Community License allowing free commercial use up to $1M revenue.
Visit Stability AI βStable Diffusion 3.5 Large is the flagship model of Stability AI's October 2024 SD 3.5 release. The architecture is a Multimodal Diffusion Transformer (MMDiT), introduced in the SD3 paper 'Scaling Rectified Flow Transformers for High-Resolution Image Synthesis' (Esser et al. 2024). MMDiT processes text and image tokens with separate parameter streams that interact only at the attention layers, which improves text fidelity and prompt adherence compared with classical cross-attention. The 8.1B parameter backbone is trained with the rectified-flow / flow-matching objective on a curated image-text dataset, with classifier-free guidance distilled into the model. SD 3.5 Large uses three parallel text encoders β CLIP-L, CLIP-G and T5-XXL β so prompts up to ~256 tokens are richly understood. Inference runs at 1024x1024 or higher with 28-50 sampling steps. SD 3.5 Large improves over the original SD3 Medium with better aesthetics, stronger prompt adherence and substantially better photorealism, addressing the widely criticised 'anatomy issues' of SD3 Medium 2B.
- Parameters
- 8.1B parameters
- Context
- 256 tokens
- Open weights with permissive Community License (commercial up to $1M revenue)
- 8.1B parameter MMDiT β largest open Stability model
- Three text encoders (CLIP-L, CLIP-G, T5-XXL) for strong prompt adherence
- 1024x1024 native + scalable to higher resolutions
- Strong photorealism and aesthetic control
- Compatible with ControlNet, LoRA, IP-Adapter ecosystem
- Self-hostable on a single 24GB+ GPU with quantisation
- Best for: open creative pipelines, fine-tuning, ControlNet workflows, research, on-prem deployments.
Pretrained on a large licensed and publicly available image-text dataset; data was filtered for safety, deduplication and quality.
License: Stability AI Community License β free for non-commercial use and commercial use up to $1M annual revenue; enterprise license required above that threshold.
Known limitations
- Smaller than FLUX.1 (12B) and Imagen 4
- Hands and complex anatomy still imperfect
- Commercial threshold ($1M revenue) requires enterprise upgrade
- Open weights have no built-in safety classifier
Frequently asked questions
Related Models
View all Image GenerationFlux 1.1 Pro Ultra
FLUX 1.1 Pro in ultra mode. Up to 4 megapixel images with raw mode for photorealism.
Flux Dev
Black Forest Labs' development model. Fast, high-quality image generation with LoRA support.
Google Imagen 4
Google's Imagen 4. Text-to-image with strong photorealism and improved typography support.
Google Imagen 4 Ultra
Premium Imagen 4 tier. Highest fidelity, prompt adherence and typography quality from Google.
Start using Stable Diffusion 3.5 Large (Stability) today
Get started with free credits. No credit card required. Access Stable Diffusion 3.5 Large (Stability) and 100+ other models through a single API.