FLUX.1 Depth
FLUX structural control via depth maps. Keep 3D scene layout while changing style/content.
FLUX.1 Depth is image generation AI model from Replicate, priced at β¬0.000 per 1M input tokens with a unknown context window.
Pricing
API Integration
Use our OpenAI-compatible API to integrate FLUX.1 Depth into your application.
npm install railwailimport railwail from "railwail";
const rw = railwail("YOUR_API_KEY");
const images = await rw.run("flux-1-depth", "A beautiful sunset over Tokyo");
console.log(images[0].url);
// Or use the image() method for full control
const res = await rw.image("flux-1-depth", "A cat in space", {
size: "1024x1024",
n: 1,
});
console.log(res.data[0].url);Deep dive β Black Forest Labs's FLUX.1 Depth
Black Forest Labs was founded in August 2024 in Freiburg, Germany by Robin Rombach, Patrick Esser, Andreas Blattmann and Dominik Lorenz, the team that built Latent Diffusion at LMU Munich and Stable Diffusion at Stability AI. The company raised a $31M seed round led by Andreessen Horowitz in 2024 and launched the FLUX.1 family (pro, dev, schnell) plus the FLUX.1 Tools (Fill, Canny, Depth, Redux). Black Forest Labs positions itself as Europe's flagship open foundation-model lab for generative media.
Visit Black Forest Labs βFLUX.1 Depth is part of the FLUX.1 Tools suite released in late 2024. It extends the base FLUX.1 [dev] / [pro] rectified-flow DiT backbone with an additional conditioning branch that consumes a depth map (typically produced by MiDaS, Depth Anything v2 or ZoeDepth) of a reference image. Depth features are injected into DiT blocks via ControlNet-style adapters so that the generated image preserves the 3D layout (perspective, occlusions, scale) of the source while colour, materials, lighting and style are dictated by the text prompt. The model uses T5-XXL plus CLIP-L text encoders and a 16x VAE, and samples with 28-50 flow-matching steps. Black Forest Labs fine-tuned the conditioning branch on a large licensed corpus of depth-image pairs, building on the rectified-flow training recipe from the SD3 paper (Esser et al. 2024).
- Parameters
- ~12B (FLUX.1 dev backbone) plus depth conditioning branch
- Context
- 512 tokens
- Depth-conditioned generation that preserves 3D scene structure
- Works at up to 2 MP output resolution
- Strong photorealism and prompt fidelity inherited from FLUX.1
- Ideal for view-consistent restyling of indoor/outdoor scenes
- Compatible with depth maps from MiDaS, Depth Anything v2, ZoeDepth
- Pairs well with Canny and Fill for compound conditioning workflows
- Best for: interior design visualisation, automotive renders, architectural restyling, world-building.
Fine-tuned from FLUX.1 base weights on pairs of images and estimated depth maps from a curated licensed corpus. Exact composition is not disclosed.
License: FLUX.1 [dev] non-commercial license for dev weights; FLUX.1 [pro] commercial only via API.
Known limitations
- Requires a precomputed depth map at inference
- Output quality depends on depth-map accuracy
- Dev weights non-commercial only
- Less effective when source depth is very noisy or low-resolution
Research papers
Frequently asked questions
Related Models
View all Image GenerationFlux 1.1 Pro Ultra
FLUX 1.1 Pro in ultra mode. Up to 4 megapixel images with raw mode for photorealism.
Flux Dev
Black Forest Labs' development model. Fast, high-quality image generation with LoRA support.
Google Imagen 4
Google's Imagen 4. Text-to-image with strong photorealism and improved typography support.
Google Imagen 4 Ultra
Premium Imagen 4 tier. Highest fidelity, prompt adherence and typography quality from Google.
Start using FLUX.1 Depth today
Get started with free credits. No credit card required. Access FLUX.1 Depth and 100+ other models through a single API.