Image Generation

Create stunning visuals with state-of-the-art AI models

Modèles de génération d'images pour produit, marketing et design

Les modèles d'image transforment un prompt textuel — et en option une image de référence ou un masque — en image matricielle finie. La catégorie couvre tout, du shot produit photoréaliste à l'illustration vectorielle en passant par le inpainting et l'outpainting contrôlables. On y a recours pour produire des visuels conformes à la marque à grande échelle, contourner la file d'attente d'un designer, ou livrer une fonctionnalité générative à l'intérieur de son propre produit.

55 models available

Flux 1.1 Pro Ultra

ImageBlack Forest Labs
Popular

FLUX 1.1 Pro in ultra mode. Up to 4 megapixel images with raw mode for photorealism.

€0.6015.0s
high-qualityphotorealistic

Flux Dev

ImageBlack Forest Labs
Popular

Black Forest Labs' development model. Fast, high-quality image generation with LoRA support.

€0.5010.0s
popularfastlora

Google Imagen 4

ImageGoogle DeepMind
Popular

Google's Imagen 4. Text-to-image with strong photorealism and improved typography support.

€0.04
googleimagentext-to-image

Google Imagen 4 Ultra

ImageGoogle DeepMind
Popular

Premium Imagen 4 tier. Highest fidelity, prompt adherence and typography quality from Google.

€0.06
googleimagentext-to-image

Ideogram 3.0

ImageIdeogram
Popular

Ideogram's flagship text-to-image model with industry-leading text rendering and prompt adherence.

€0.0915.0s
ideogramtext-to-imagetypography

Midjourney V7

ImageReplicate
NewPopular

The latest Midjourney model. Industry-leading aesthetic quality and prompt adherence for image generation.

€3.0030.0s
high-qualityaestheticpopular

AuraFlow v0.3

ImageFal.ai

fal.ai's fully open-source 6.8B flow-based text-to-image model. Up to 1536x1536 resolution.

Free
auraflowtext-to-imageopen-weights

BRIA RMBG-1.4

ImageReplicate

BRIA's first commercial-safe background-removal model. Trained on fully-licensed data, suitable for production e-commerce and design pipelines.

€0.03
replicatebackground-removalbria

BRIA RMBG-2.0

ImageReplicate

BRIA's professional background-removal model trained on fully-licensed data. Commercial-safe.

€0.04
briaimage-editbackground-removal

CCSR (Content-Consistent SR)

ImageReplicate

Content-Consistent Super-Resolution model. Reduces hallucination compared to typical diffusion-based upscalers while keeping perceptual quality high.

€0.04
replicateupscalingimage-restore

Clarity Upscaler

ImageCommunity

High-resolution image upscaler with creative detail re-imagination via SD-based hallucination. Strong for photography and product shots.

€0.04
replicateupscalingcreative

CodeFormer

ImageCommunity

Robust face-restoration model using a transformer-based codebook prior. Handles severe degradation, occlusion, and old-photo restoration with adjustable fidelity-quality tradeoff.

€0.002
replicateface-restoreupscaling

ControlNet Canny

ImageReplicate

ControlNet conditioned on Canny edge maps. Preserves composition and outlines while restyling with Stable Diffusion 1.5 or SDXL backbones.

€0.01
replicatestyle-transferimage-edit

ControlNet Depth

ImageReplicate

ControlNet conditioned on depth maps. Preserves the 3D scene layout while letting the prompt change style, lighting and content.

€0.01
replicatestyle-transferimage-edit

DALL-E 3

ImageOpenAI

OpenAI's latest image generation model. Excellent at following complex prompts with high fidelity.

€4.0015.0s
high-qualityprompt-following

DreamGaussian

ImageReplicate

Generative Gaussian-splatting model for fast image-to-3D synthesis. Produces textured meshes in two minutes via differentiable rasterization.

€0.09
replicate3d-generationimage-to-3d

ESRGAN Classic

ImageReplicate

Enhanced Super-Resolution GAN, the original 2018 architecture. Produces sharp 4x upscales with strong perceptual quality on natural images.

€0.001
replicateupscalingesrgan

Flux Schnell

ImageBlack Forest Labs

The fastest Flux model. Generate images in under 2 seconds. Great for prototyping.

€0.032.0s
fastaffordable

FLUX.1 [Schnell]

ImageBlack Forest Labs

Black Forest Labs' fastest open-weights image model. Apache-2.0 licensed, ~1-4 step inference.

€0.003
fluxblack-forest-labsopen-weights

FLUX.1 Canny

ImageReplicate

FLUX structural control via Canny edge maps. Preserve composition while restyling.

€0.05
fluxblack-forest-labsimage-edit

FLUX.1 Depth

ImageReplicate

FLUX structural control via depth maps. Keep 3D scene layout while changing style/content.

€0.05
fluxblack-forest-labsimage-edit

FLUX.1 Fill

ImageReplicate

Black Forest Labs' inpainting/outpainting model for FLUX. Fill masked regions with prompt-guided content.

€0.05
fluxblack-forest-labsimage-edit

FLUX.1 Redux

ImageReplicate

FLUX image-variation adapter. Generate variations and remixes from a reference image.

€0.03
fluxblack-forest-labsimage-edit

Get3D (NVIDIA)

ImageCustom

NVIDIA GET3D generative model for textured 3D shapes. Trained on category-specific datasets producing meshes with high-quality textures.

Free
nvidia3d-generationopen-weights

GFPGAN v1.4

ImageTencent ARC

Tencent ARC face-restoration GAN. Reconstructs realistic facial detail in low-quality or compressed photos using a pretrained StyleGAN2 prior.

€0.002
replicateface-restoreupscaling

Hunyuan3D 2.0

ImageTencent

Tencent's Hunyuan3D 2.0 image-to-3D pipeline. Two-stage shape and texture generation producing high-resolution textured meshes.

€0.21
replicate3d-generationimage-to-3d

Hunyuan3D 2.1

ImageTencent
New

Refreshed Hunyuan3D 2.1 with improved texture fidelity and PBR-material support. Image-to-3D with textured GLB output.

€0.24
replicate3d-generationimage-to-3d

Ideogram 2.0 Turbo

ImageIdeogram

Ideogram's fast text-to-image variant. Strong typography and logo rendering at low latency.

€0.05
ideogramtext-to-imagetypography

InstantMesh

ImageReplicate

Image-to-3D mesh generator from sparse-view diffusion. Produces textured meshes in under one minute on a single A100.

€0.12
replicate3d-generationimage-to-3d

InstructPix2Pix

ImageReplicate

Berkeley InstructPix2Pix. Edits an image from natural-language instructions in a single forward pass. Trained on GPT-3 plus Stable Diffusion synthetic pairs.

€0.01
replicatestyle-transferimage-edit

IP-Adapter FaceID Plus v2

ImageReplicate

Tencent's face-identity conditioning adapter for SD/SDXL. Face embedding + CLIP for ID-consistent generation.

Free
tencentimage-editface-id

Janus Pro 7B

ImageReplicate

DeepSeek's unified multimodal model. Decouples vision encoding for both understanding and generation tasks.

Free
deepseekjanusopen-weights

Kuaishou Kolors

ImageReplicate

Kuaishou's bilingual (CN/EN) latent diffusion text-to-image model with strong text rendering.

Free
kuaishoutext-to-imageopen-weights

Magnific-Style Upscaler

ImageReplicate

Detail-hallucinating upscaler in the Magnific style. Adds plausible high-frequency texture using a Stable Diffusion refiner conditioned on the low-res input.

€0.06
replicateupscalingcreative

PhotoMaker

ImageTencent ARC

Tencent ARC PhotoMaker. Identity-preserving stylized photo generation from a stacked-ID embedding. Realistic re-styling of a subject in seconds.

€0.03
replicatestyle-transferimage-edit

Playground v3 (Design)

ImagePlayground AI

Playground's text-to-image model focused on graphic design aesthetics and embedded typography.

Free
playgroundtext-to-imagedesign

Point-E

ImageOpenAI

OpenAI Point-E text-to-point-cloud system. Fast 3D point-cloud generation from text, optionally lifted to a mesh via marching cubes.

€0.03
replicate3d-generationopenai

Real-ESRGAN 4x

ImageCommunity

AI-Upscaler that increases image resolution up to 4x while preserving texture and detail. Trained on synthetic and real data to reduce common ESRGAN artifacts.

€0.001
replicateupscalingimage-restore

Real-ESRGAN Anime 4x

ImageReplicate

Real-ESRGAN variant fine-tuned for anime, manga, and illustrated artwork. 4x upscaling with cartoon-aware artifact suppression.

€0.001
replicateupscalinganime

Recraft V3

ImageReplicate
New

State-of-the-art image generation optimized for design and branding. SVG vector output support.

€0.6012.0s
designvectorbranding

Recraft V3 Realistic

ImageRecraft

Recraft's high-prompt-adherence raster image model. Strong layout control and brand-style consistency.

€0.04
recrafttext-to-imagedesign

Recraft V3 SVG

ImageRecraft

Recraft's vector/SVG generation model. Editable illustrations and icons from text.

€0.08
recrafttext-to-svgvector

Rembg

ImageCommunity

Open-source background-removal tool wrapping U2Net. Produces alpha mattes for photos, products and people with no manual masking.

€0.001
replicatebackground-removalmatting

Shap-E (OpenAI)

ImageOpenAI

OpenAI Shap-E text/image to 3D. Generates implicit neural representations renderable as textured meshes or NeRFs.

€0.04
replicate3d-generationopenai

Stable Diffusion 3.5 Large (Stability)

ImageCustom

Stability AI's 8B-parameter flagship SD3.5 model. Strong prompt adherence and aesthetic quality.

€0.07
stabilitytext-to-imageopen-weights

Stable Diffusion 3.5 Large Turbo

ImageCustom

Distilled 4-step variant of SD3.5 Large. 8B params, ~4x faster inference at competitive quality.

€0.04
stabilitytext-to-imageopen-weights

Stable Diffusion 3.5 Medium

ImageCustom

Stability AI's 2.5B-parameter SD3.5 with strong quality/speed trade-off. Consumer-GPU friendly.

€0.04
stabilitytext-to-imageopen-weights

Stable Diffusion XL

ImageStability AI

Stability AI's SDXL model via Replicate. High-quality image generation with extensive customization.

€0.208.0s
open-sourcecustomizable

SUPIR Upscaler

ImageCommunity

SUPIR (Scaling-Up Image Restoration) photo-real restoration model. Combines SDXL prior with language-guided controls for severely degraded inputs.

€0.06
replicateupscalingimage-restore

Swin2SR

ImageReplicate

Transformer-based image super-resolution using Swin-V2 attention. Handles classical, lightweight, real-world, and compressed-input variants with 2x/4x upscaling.

€0.002
replicateupscalingtransformer

T2I-Adapter Color

ImageReplicate

Tencent T2I-Adapter color-guided generation for SDXL. Lightweight adapter that conditions image generation on a color reference image.

€0.009
replicatestyle-transferimage-edit

Transparent Background

ImageReplicate

PyTorch background-removal tool supporting multiple modes: base, fast and high-quality. Produces RGBA outputs and is suitable for batch processing.

€0.001
replicatebackground-removalopen-source

TRELLIS (3D)

ImageReplicate

Microsoft TRELLIS image-to-3D model. Generates textured 3D assets in GLB or Gaussian-splat format from a single reference image.

€0.18
replicate3d-generationimage-to-3d

TripoSR

ImageReplicate

Stability AI and Tripo single-image 3D reconstruction model. Generates 3D meshes from a single image in roughly half a second.

€0.03
replicate3d-generationimage-to-3d

U2Net Saliency

ImageReplicate

Salient-object detection network used for background removal and matting. Nested U-Net architecture trained on DUTS-TR for general scenes.

€0.001
replicatebackground-removalsaliency

Top image generation picks

Hand-picked across four common criteria — resolved against the live catalog so the picks track price and performance changes.

Meilleur global
Flux 1.1 Pro Ultra

FLUX 1.1 Pro in ultra mode. Up to 4 megapixel images with raw mode for photorealism.

Learn more
Le moins cher
AuraFlow v0.3

fal.ai's fully open-source 6.8B flow-based text-to-image model. Up to 1536x1536 resolution.

Learn more
Résolution la plus haute
Flux 1.1 Pro Ultra

FLUX 1.1 Pro in ultra mode. Up to 4 megapixel images with raw mode for photorealism.

Learn more
Le plus rapide
Flux Schnell

The fastest Flux model. Generate images in under 2 seconds. Great for prototyping.

Learn more

Contrairement aux modèles texte, la génération d'images est facturée à l'appel et non au token. Une image 1024×1024 coûte entre un demi-centime (open-weights SDXL turbo) et quinze centimes (Imagen flagship ou FLUX Pro). Les résolutions plus hautes et le nombre d'étapes accru coûtent proportionnellement plus cher. Certains fournisseurs exposent un endpoint d'édition à un tarif distinct ; vérifiez la fiche modèle avant d'intégrer.

Le compromis central est le photoréalisme contre la contrôlabilité. Les phares de diffusion (FLUX 1.1 Pro, Imagen 3, Recraft V3) produisent une qualité magazine mais ignorent les instructions compositionnelles détaillées une fois sur deux. Les modèles plus petits (SDXL, Playground V3, Stable Diffusion 3.5) coûtent dix fois moins, rendent en moins de deux secondes et vous laissent piloter le résultat via ControlNet, IP-Adapter ou LoRA. Pour la production par lots, le pipeline petit et pilotable gagne presque toujours ; pour le hero shot unique, optez pour le phare.

Attention à la dilution de prompt en image : la plupart des modèles de diffusion plafonnent la longueur utile autour de 75 tokens, si bien que charger douze adjectifs et trois références de style revient typiquement à tout moyenner au lieu de les empiler. Écrivez le sujet, l'action et l'éclairage en premier ; tout ce qui suit la troisième clause a une influence décroissante sur le résultat.

Les licences importent : la plupart des fournisseurs accordent une licence d'usage commercial perpétuelle sur les images générées, mais quelques-uns (offre gratuite FLUX Schnell, certains checkpoints open) restreignent au non-commercial. La fiche modèle l'indique — lisez-la avant de placer une sortie sur un panneau publicitaire.

Les top picks ci-dessous couvrent le phare photoréalisme, le cheval de trait le moins cher, le modèle au prompt le plus long et l'option temps réel la plus rapide de la catégorie.

Related comparisons

Side-by-side reviews of the most-compared models in this category.

Frequently asked questions

Start Building with AI

Access all models through a single API. Get free credits when you sign up — no credit card required.