Image Generation

Create stunning visuals with state-of-the-art AI models

Modele do generowania obrazów dla produktu, marketingu i designu

Modele obrazowe zamieniają prompt tekstowy — opcjonalnie z obrazem referencyjnym lub maską — w gotowy raster. Kategoria obejmuje wszystko od fotorealistycznych zdjęć produktowych przez ilustracje w stylu wektorowym aż po sterowalny inpainting i outpainting. Po model obrazowy sięgasz, gdy potrzebujesz on-brand wizualizacji na skalę, gdy kolejka do designera jest wąskim gardłem albo gdy chcesz dostarczyć funkcję generatywną wewnątrz własnego produktu.

W odróżnieniu od modeli tekstowych generowanie obrazu jest rozliczane za wywołanie, a nie za token. Jeden obraz 1024×1024 kosztuje od pół centa (open-weights SDXL turbo) do piętnastu centów (flagship Imagen lub FLUX Pro). Wyższe rozdzielczości i więcej kroków dyfuzji kosztują proporcjonalnie więcej. Niektórzy dostawcy udostępniają osobny endpoint edycji z inną stawką; sprawdź kartę modelu przed integracją.

Główny kompromis to fotorealizm kontra sterowalność. Flagshipy dyfuzyjne (FLUX 1.1 Pro, Imagen 3, Recraft V3) dają output o jakości magazynowej, ale ignorują szczegółowe instrukcje kompozycyjne mniej więcej w połowie przypadków. Mniejsze modele (SDXL, Playground V3, Stable Diffusion 3.5) kosztują dziesięć razy mniej, renderują w mniej niż dwie sekundy i pozwalają sterować wynikiem przez ControlNet, IP-Adapter lub LoRA. Dla pracy produkcyjnej w batchu prawie zawsze wygrywa mniejszy, sterowalny pipeline; dla jednorazowych hero shotów sięgnij po flagship.

Uwaga na rozmycie kontekstu w promptach obrazowych: większość modeli dyfuzyjnych ogranicza użyteczną długość promptu do około 75 tokenów, więc upchanie dwunastu przymiotników i trzech referencji stylu zwykle uśrednia je wszystkie, zamiast je piętrzyć. Najpierw podaj podmiot, czynność i oświetlenie; wszystko po trzeciej klauzuli ma malejący wpływ na wynik.

Licencje mają znaczenie: większość dostawców udziela bezterminowej licencji komercyjnej na wygenerowane obrazy, ale niektórzy (darmowy tier FLUX Schnell, część checkpointów open) ograniczają do użytku niekomercyjnego. Karta modelu to wyjaśnia — przeczytaj, zanim umieścisz output na billboardzie.

Top picks poniżej obejmują flagship fotorealizmu, najtańszy model do codziennej pracy, model o najdłuższym promptcie oraz najszybszą opcję real-time w kategorii.

55 models available

Flux 1.1 Pro Ultra

ImageBlack Forest Labs
Popular

FLUX 1.1 Pro in ultra mode. Up to 4 megapixel images with raw mode for photorealism.

€0.6015.0s
high-qualityphotorealistic

Flux Dev

ImageBlack Forest Labs
Popular

Black Forest Labs' development model. Fast, high-quality image generation with LoRA support.

€0.5010.0s
popularfastlora

Google Imagen 4

ImageGoogle DeepMind
Popular

Google's Imagen 4. Text-to-image with strong photorealism and improved typography support.

€0.04
googleimagentext-to-image

Google Imagen 4 Ultra

ImageGoogle DeepMind
Popular

Premium Imagen 4 tier. Highest fidelity, prompt adherence and typography quality from Google.

€0.06
googleimagentext-to-image

Ideogram 3.0

ImageIdeogram
Popular

Ideogram's flagship text-to-image model with industry-leading text rendering and prompt adherence.

€0.0915.0s
ideogramtext-to-imagetypography

Midjourney V7

ImageReplicate
NewPopular

The latest Midjourney model. Industry-leading aesthetic quality and prompt adherence for image generation.

€3.0030.0s
high-qualityaestheticpopular

AuraFlow v0.3

ImageFal.ai

fal.ai's fully open-source 6.8B flow-based text-to-image model. Up to 1536x1536 resolution.

Free
auraflowtext-to-imageopen-weights

BRIA RMBG-1.4

ImageReplicate

BRIA's first commercial-safe background-removal model. Trained on fully-licensed data, suitable for production e-commerce and design pipelines.

€0.03
replicatebackground-removalbria

BRIA RMBG-2.0

ImageReplicate

BRIA's professional background-removal model trained on fully-licensed data. Commercial-safe.

€0.04
briaimage-editbackground-removal

CCSR (Content-Consistent SR)

ImageReplicate

Content-Consistent Super-Resolution model. Reduces hallucination compared to typical diffusion-based upscalers while keeping perceptual quality high.

€0.04
replicateupscalingimage-restore

Clarity Upscaler

ImageCommunity

High-resolution image upscaler with creative detail re-imagination via SD-based hallucination. Strong for photography and product shots.

€0.04
replicateupscalingcreative

CodeFormer

ImageCommunity

Robust face-restoration model using a transformer-based codebook prior. Handles severe degradation, occlusion, and old-photo restoration with adjustable fidelity-quality tradeoff.

€0.002
replicateface-restoreupscaling

ControlNet Canny

ImageReplicate

ControlNet conditioned on Canny edge maps. Preserves composition and outlines while restyling with Stable Diffusion 1.5 or SDXL backbones.

€0.01
replicatestyle-transferimage-edit

ControlNet Depth

ImageReplicate

ControlNet conditioned on depth maps. Preserves the 3D scene layout while letting the prompt change style, lighting and content.

€0.01
replicatestyle-transferimage-edit

DALL-E 3

ImageOpenAI

OpenAI's latest image generation model. Excellent at following complex prompts with high fidelity.

€4.0015.0s
high-qualityprompt-following

DreamGaussian

ImageReplicate

Generative Gaussian-splatting model for fast image-to-3D synthesis. Produces textured meshes in two minutes via differentiable rasterization.

€0.09
replicate3d-generationimage-to-3d

ESRGAN Classic

ImageReplicate

Enhanced Super-Resolution GAN, the original 2018 architecture. Produces sharp 4x upscales with strong perceptual quality on natural images.

€0.001
replicateupscalingesrgan

Flux Schnell

ImageBlack Forest Labs

The fastest Flux model. Generate images in under 2 seconds. Great for prototyping.

€0.032.0s
fastaffordable

FLUX.1 [Schnell]

ImageBlack Forest Labs

Black Forest Labs' fastest open-weights image model. Apache-2.0 licensed, ~1-4 step inference.

€0.003
fluxblack-forest-labsopen-weights

FLUX.1 Canny

ImageReplicate

FLUX structural control via Canny edge maps. Preserve composition while restyling.

€0.05
fluxblack-forest-labsimage-edit

FLUX.1 Depth

ImageReplicate

FLUX structural control via depth maps. Keep 3D scene layout while changing style/content.

€0.05
fluxblack-forest-labsimage-edit

FLUX.1 Fill

ImageReplicate

Black Forest Labs' inpainting/outpainting model for FLUX. Fill masked regions with prompt-guided content.

€0.05
fluxblack-forest-labsimage-edit

FLUX.1 Redux

ImageReplicate

FLUX image-variation adapter. Generate variations and remixes from a reference image.

€0.03
fluxblack-forest-labsimage-edit

Get3D (NVIDIA)

ImageCustom

NVIDIA GET3D generative model for textured 3D shapes. Trained on category-specific datasets producing meshes with high-quality textures.

Free
nvidia3d-generationopen-weights

GFPGAN v1.4

ImageTencent ARC

Tencent ARC face-restoration GAN. Reconstructs realistic facial detail in low-quality or compressed photos using a pretrained StyleGAN2 prior.

€0.002
replicateface-restoreupscaling

Hunyuan3D 2.0

ImageTencent

Tencent's Hunyuan3D 2.0 image-to-3D pipeline. Two-stage shape and texture generation producing high-resolution textured meshes.

€0.21
replicate3d-generationimage-to-3d

Hunyuan3D 2.1

ImageTencent
New

Refreshed Hunyuan3D 2.1 with improved texture fidelity and PBR-material support. Image-to-3D with textured GLB output.

€0.24
replicate3d-generationimage-to-3d

Ideogram 2.0 Turbo

ImageIdeogram

Ideogram's fast text-to-image variant. Strong typography and logo rendering at low latency.

€0.05
ideogramtext-to-imagetypography

InstantMesh

ImageReplicate

Image-to-3D mesh generator from sparse-view diffusion. Produces textured meshes in under one minute on a single A100.

€0.12
replicate3d-generationimage-to-3d

InstructPix2Pix

ImageReplicate

Berkeley InstructPix2Pix. Edits an image from natural-language instructions in a single forward pass. Trained on GPT-3 plus Stable Diffusion synthetic pairs.

€0.01
replicatestyle-transferimage-edit

IP-Adapter FaceID Plus v2

ImageReplicate

Tencent's face-identity conditioning adapter for SD/SDXL. Face embedding + CLIP for ID-consistent generation.

Free
tencentimage-editface-id

Janus Pro 7B

ImageReplicate

DeepSeek's unified multimodal model. Decouples vision encoding for both understanding and generation tasks.

Free
deepseekjanusopen-weights

Kuaishou Kolors

ImageReplicate

Kuaishou's bilingual (CN/EN) latent diffusion text-to-image model with strong text rendering.

Free
kuaishoutext-to-imageopen-weights

Magnific-Style Upscaler

ImageReplicate

Detail-hallucinating upscaler in the Magnific style. Adds plausible high-frequency texture using a Stable Diffusion refiner conditioned on the low-res input.

€0.06
replicateupscalingcreative

PhotoMaker

ImageTencent ARC

Tencent ARC PhotoMaker. Identity-preserving stylized photo generation from a stacked-ID embedding. Realistic re-styling of a subject in seconds.

€0.03
replicatestyle-transferimage-edit

Playground v3 (Design)

ImagePlayground AI

Playground's text-to-image model focused on graphic design aesthetics and embedded typography.

Free
playgroundtext-to-imagedesign

Point-E

ImageOpenAI

OpenAI Point-E text-to-point-cloud system. Fast 3D point-cloud generation from text, optionally lifted to a mesh via marching cubes.

€0.03
replicate3d-generationopenai

Real-ESRGAN 4x

ImageCommunity

AI-Upscaler that increases image resolution up to 4x while preserving texture and detail. Trained on synthetic and real data to reduce common ESRGAN artifacts.

€0.001
replicateupscalingimage-restore

Real-ESRGAN Anime 4x

ImageReplicate

Real-ESRGAN variant fine-tuned for anime, manga, and illustrated artwork. 4x upscaling with cartoon-aware artifact suppression.

€0.001
replicateupscalinganime

Recraft V3

ImageReplicate
New

State-of-the-art image generation optimized for design and branding. SVG vector output support.

€0.6012.0s
designvectorbranding

Recraft V3 Realistic

ImageRecraft

Recraft's high-prompt-adherence raster image model. Strong layout control and brand-style consistency.

€0.04
recrafttext-to-imagedesign

Recraft V3 SVG

ImageRecraft

Recraft's vector/SVG generation model. Editable illustrations and icons from text.

€0.08
recrafttext-to-svgvector

Rembg

ImageCommunity

Open-source background-removal tool wrapping U2Net. Produces alpha mattes for photos, products and people with no manual masking.

€0.001
replicatebackground-removalmatting

Shap-E (OpenAI)

ImageOpenAI

OpenAI Shap-E text/image to 3D. Generates implicit neural representations renderable as textured meshes or NeRFs.

€0.04
replicate3d-generationopenai

Stable Diffusion 3.5 Large (Stability)

ImageCustom

Stability AI's 8B-parameter flagship SD3.5 model. Strong prompt adherence and aesthetic quality.

€0.07
stabilitytext-to-imageopen-weights

Stable Diffusion 3.5 Large Turbo

ImageCustom

Distilled 4-step variant of SD3.5 Large. 8B params, ~4x faster inference at competitive quality.

€0.04
stabilitytext-to-imageopen-weights

Stable Diffusion 3.5 Medium

ImageCustom

Stability AI's 2.5B-parameter SD3.5 with strong quality/speed trade-off. Consumer-GPU friendly.

€0.04
stabilitytext-to-imageopen-weights

Stable Diffusion XL

ImageStability AI

Stability AI's SDXL model via Replicate. High-quality image generation with extensive customization.

€0.208.0s
open-sourcecustomizable

SUPIR Upscaler

ImageCommunity

SUPIR (Scaling-Up Image Restoration) photo-real restoration model. Combines SDXL prior with language-guided controls for severely degraded inputs.

€0.06
replicateupscalingimage-restore

Swin2SR

ImageReplicate

Transformer-based image super-resolution using Swin-V2 attention. Handles classical, lightweight, real-world, and compressed-input variants with 2x/4x upscaling.

€0.002
replicateupscalingtransformer

T2I-Adapter Color

ImageReplicate

Tencent T2I-Adapter color-guided generation for SDXL. Lightweight adapter that conditions image generation on a color reference image.

€0.009
replicatestyle-transferimage-edit

Transparent Background

ImageReplicate

PyTorch background-removal tool supporting multiple modes: base, fast and high-quality. Produces RGBA outputs and is suitable for batch processing.

€0.001
replicatebackground-removalopen-source

TRELLIS (3D)

ImageReplicate

Microsoft TRELLIS image-to-3D model. Generates textured 3D assets in GLB or Gaussian-splat format from a single reference image.

€0.18
replicate3d-generationimage-to-3d

TripoSR

ImageReplicate

Stability AI and Tripo single-image 3D reconstruction model. Generates 3D meshes from a single image in roughly half a second.

€0.03
replicate3d-generationimage-to-3d

U2Net Saliency

ImageReplicate

Salient-object detection network used for background removal and matting. Nested U-Net architecture trained on DUTS-TR for general scenes.

€0.001
replicatebackground-removalsaliency

Top image generation picks

Hand-picked across four common criteria — resolved against the live catalog so the picks track price and performance changes.

Najlepszy ogólnie
Flux 1.1 Pro Ultra

FLUX 1.1 Pro in ultra mode. Up to 4 megapixel images with raw mode for photorealism.

Learn more
Najtańszy
AuraFlow v0.3

fal.ai's fully open-source 6.8B flow-based text-to-image model. Up to 1536x1536 resolution.

Learn more
Najwyższa rozdzielczość
Flux 1.1 Pro Ultra

FLUX 1.1 Pro in ultra mode. Up to 4 megapixel images with raw mode for photorealism.

Learn more
Najszybszy
Flux Schnell

The fastest Flux model. Generate images in under 2 seconds. Great for prototyping.

Learn more

Related comparisons

Side-by-side reviews of the most-compared models in this category.

Frequently asked questions

Start Building with AI

Access all models through a single API. Get free credits when you sign up — no credit card required.