184+ AI Models

AI Models

Browse and explore all available AI models.

184 models available

Claude Code

CodeAnthropic
NewPopular

Anthropic's specialized coding agent. Autonomous code writing, debugging, and refactoring with deep codebase understanding.

€0.03
agentautonomousrefactoring

Claude Opus 4

Text & ChatAnthropic
NewPopular

Anthropic's most powerful model. Exceptional at complex analysis, nuanced writing, math, and coding. Sets new benchmarks across evaluation suites.

€0.10
frontierreasoningcoding

Claude Sonnet 4

Text & ChatAnthropic
NewPopular

Anthropic's balanced model offering excellent performance at a lower cost than Opus. Great all-rounder for production workloads.

€0.02
balancedproductionreliable

Cursor (GPT-4o)

CodeOpenAI
Popular

AI-powered code editor backed by GPT-4o. Inline code completion, chat-based editing, and codebase-aware suggestions.

€0.01
IDEcompletionpopular

DALL-E 3

ImageOpenAI
Popular

OpenAI's latest image generation model. Creates highly detailed, accurate images from text descriptions with excellent prompt adherence.

€0.05
text-to-imagehigh-qualitypopular

DeepSeek R1

Text & ChatDeepSeek
Popular

DeepSeek's reasoning model trained with reinforcement learning. Performs chain-of-thought reasoning, rivaling OpenAI's o1 on math and science benchmarks.

€0.007
reasoningopen-weightmath

DeepSeek V3

Text & ChatDeepSeek
Popular

DeepSeek's flagship 671B MoE model. Competitive with GPT-4o on many benchmarks. Exceptional at coding, math, and Chinese language tasks.

€0.003
open-weightMoEcoding

ElevenLabs Multilingual v2

TTSElevenLabs
Popular

ElevenLabs' most capable TTS model. Natural-sounding speech in 29 languages with emotion control and voice cloning.

€0.04
29 languagesvoice-cloningnatural

Flux 1.1 Pro

ImageBlack Forest Labs
Popular

Black Forest Labs' most capable image model. Photorealistic outputs with exceptional text rendering and prompt following.

€0.07
photorealistictext-renderingpremium

Flux Pro 1.1 Ultra

ImageBlack Forest Labs
NewPopular

Black Forest Labs' highest resolution Flux model. Generates up to 4MP images with exceptional detail and prompt adherence.

€0.08
ultra-high-res4MPpremium

Gemini 2.0 Flash (Multimodal)

MultimodalGoogle DeepMind
Popular

Google's multimodal model accepting text, images, audio, and video. Native multimodal understanding across input types.

€0.007
visionaudiovideo-understanding

Gemini 2.5 Pro

Text & ChatGoogle DeepMind
NewPopular

Google's most advanced thinking model with built-in reasoning capabilities. Excels at complex tasks requiring multi-step reasoning.

€0.01
reasoning1M contextfrontier

GitHub Copilot

CodeOpenAI
Popular

GitHub's AI pair programmer. Real-time code suggestions, chat assistance, and PR reviews powered by OpenAI models.

€0.01
IDEpair-programmerGitHub

GPT-4.5 Preview

Text & ChatOpenAI
NewPopular

OpenAI's latest frontier model with improved reasoning, creativity, and instruction following. Significant improvements over GPT-4o.

€0.10
frontierreasoningcreative

GPT-4o

Text & ChatOpenAI
Popular

OpenAI's most capable multimodal model. Accepts text and image inputs, produces text outputs. Excellent for complex reasoning, creative writing, and analysis.

€0.01
multimodalreasoningflagship

GPT-4o (Vision)

MultimodalOpenAI
Popular

GPT-4o's vision capabilities. Analyze images, charts, documents, and screenshots with detailed understanding and reasoning.

€0.01
visiondocument-analysischarts

GR00T N1

RoboticsNVIDIA
NewPopular

NVIDIA's foundation model for humanoid robots. World-model-based VLA enabling whole-body control and human-like manipulation.

€0.13
humanoidNVIDIAwhole-body

Grok 3

Text & ChatxAI
NewPopular

xAI's most powerful model. Trained on massive compute with strong reasoning, humor, and real-time knowledge from X (Twitter).

€0.04
xAIreasoningreal-time

Llama 3.3 70B

Text & ChatMeta
Popular

Meta's latest 70B model delivering performance comparable to Llama 3.1 405B at a fraction of the cost. Excellent open-source option.

€0.004
open-sourceefficientMeta

Llama 4 Maverick

Text & ChatMeta
NewPopular

Meta's powerful Llama 4 Maverick model. A larger, more capable variant with strong reasoning, creative writing, and multilingual abilities.

€0.005
open-sourcenext-genpowerful

Luma Ray2

VideoLuma AI
NewPopular

Luma AI's latest video generation model. High-quality, realistic video creation with excellent motion dynamics.

€0.52
realisticmotionpremium

Midjourney v6.1

ImageMidjourney
Popular

Midjourney's latest model known for stunning artistic quality. Excels at creative, aesthetic images with a distinctive artistic style.

€0.13
artisticcreativepremium

o1

Text & ChatOpenAI
Popular

OpenAI's reasoning model that thinks before answering. Uses chain-of-thought to solve complex math, science, and coding problems.

€0.05
reasoningmathscience

OpenVLA

RoboticsOpenVLA
NewPopular

Open-source 7B Vision-Language-Action model built on Prismatic VLM and Llama 2. Converts visual observations and language goals into robot actions.

€0.05
open-source7BLlama-based

Pi0

RoboticsPhysical Intelligence
NewPopular

Physical Intelligence's foundation model for robot control. Combines vision-language understanding with dexterous manipulation across diverse tasks.

€0.13
dexterousfoundation-modelPhysical Intelligence

RT-2

RoboticsGoogle DeepMind
Popular

Google DeepMind's Robotic Transformer 2. Vision-Language-Action model that translates visual observations and language instructions directly into robot actions.

€0.13
roboticsGoogle DeepMindmanipulation

Runway Gen-3 Alpha

VideoRunway
Popular

Runway's latest video generation model. Professional-quality video creation with fine-grained control over motion and style.

€0.52
professionalmotion-controlcreative

Sora

VideoOpenAI
NewPopular

OpenAI's video generation model. Creates realistic and imaginative videos from text prompts with impressive temporal coherence.

€0.65
text-to-videorealisticflagship

Suno v4

AudioSuno
NewPopular

Suno's latest music AI. Generates complete songs with lyrics, vocals, and instrumentation. Supports many genres and custom lyrics.

€0.07
musiclyricspopular

text-embedding-3-large

EmbeddingOpenAI
Popular

OpenAI's most capable embedding model. 3072 dimensions with excellent retrieval performance for RAG and semantic search.

€0.001
3072 dimRAGhigh-quality

Udio

AudioUdio
Popular

AI music generation platform creating full songs with vocals and lyrics from text descriptions. Wide range of genres and styles.

€0.07
musicvocalsfull-songs

Veo 2

VideoGoogle DeepMind
NewPopular

Google DeepMind's video generation model. Creates high-fidelity, 1080p videos with strong understanding of physics and motion.

€0.52
1080pphysics-awareGoogle

Wan 2.1 14B

VideoWan AI
NewPopular

High-quality video generation from Wan AI. 14B parameter model creating detailed, coherent videos from text.

€0.52
high-quality14Btext-to-video

Whisper Large v3

STTOpenAI
Popular

OpenAI's state-of-the-art speech recognition model. Supports 100+ languages with exceptional accuracy for transcription and translation.

€0.007
100+ languagesaccuratepopular

AnimagineXL 3.1

ImageCagliostro Lab

Anime-focused SDXL fine-tune. High-quality anime and manga-style illustration generation.

€0.01
animeillustrationmanga

AnimateDiff

VideoCommunity

Animate images and create short video animations. Works with existing SD models for animated content.

€0.13
animationSD-compatibleshort-clips

AssemblyAI Universal-2

STTAssemblyAI

AssemblyAI's latest speech model. Excellent accuracy across accents and noisy environments with built-in speaker diarization.

€0.01
diarizationnoise-robustaccurate

AudioLDM 2

AudioAudioLDM

Audio generation from text descriptions. Create sound effects, music, and ambient audio from natural language.

€0.02
sound-effectsambienttext-to-audio

AuraFlow

ImageFal.ai

Open-source flow-based image generation model. Lightweight with fast generation and good quality output.

€0.01
flow-basedlightweightopen-source

Bark

AudioSuno

Suno's open-source TTS model. Generate realistic speech with laughter, music, and sound effects from text.

€0.03
TTSeffectsexpressive

BGE-M3

EmbeddingCustom

BAAI's versatile embedding model supporting dense, sparse, and multi-vector retrieval. Open-source and highly effective.

€0.001
open-sourcemulti-retrievalBAAI

BLIP-2

ImageSalesforce

Salesforce's image captioning model. Generate detailed descriptions of images with natural language.

€0.007
captioningdescriptionVQA

Clarity Upscaler

ImageCommunity

AI image upscaler with creative enhancement. Adds detail and clarity while upscaling images up to 4x.

€0.03
upscalerenhancement4x

Claude 3.5 Haiku

Text & ChatAnthropic

Anthropic's fastest and most affordable model. Ideal for high-volume tasks, customer support, and quick responses.

€0.007
fastaffordablehigh-volume

Claude 3.5 Sonnet

Text & ChatAnthropic

Previous generation balanced model from Anthropic. Still excellent for many tasks including coding, analysis, and creative writing.

€0.02
balancedcodingpopular

Claude 3.5 Sonnet (Vision)

MultimodalAnthropic

Claude's vision capabilities. Excellent at analyzing images, documents, and code screenshots with detailed, accurate descriptions.

€0.02
visiondocumentscode-screenshots

CLIP Interrogator

ImageCommunity

Generate text prompts from images. Reverse-engineer prompts that could reproduce a given image.

€0.005
prompt-generationreverse-engineerCLIP

CodeFormer

ImageCommunity

AI face restoration model. Restore severely degraded face photos with high fidelity.

€0.007
face-restorationrepairenhancement

CodeLlama 70B

CodeMeta

Meta's largest code-specialized Llama model. Trained on code-heavy data with strong performance on code generation and infilling.

€0.004
open-sourceMetainfilling

Codestral

Text & ChatMistral AI

Mistral's dedicated code model. Trained specifically for code generation, completion, and understanding across 80+ programming languages.

€0.004
code80+ languagesspecialized

CogVideoX

VideoReplicate

Open-source video generation model from Tsinghua University. Generates coherent videos from text with strong temporal consistency.

€0.39
open-sourceresearchtext-to-video

CogVideoX-5B

VideoCommunity

Tsinghua's 5B parameter video model. Creates coherent text-to-video with strong temporal consistency.

€0.39
5Btext-to-videocoherent

CogVLM

MultimodalCommunity

Powerful visual language model from Tsinghua. Deep image understanding with detailed visual reasoning.

€0.005
visionreasoningdetailed

Cohere Embed v3

EmbeddingCustom

Cohere's multilingual embedding model. Supports 100+ languages with separate search and classification modes.

€0.001
multilingualsearchCohere

Command R

Text & ChatCohere

Cohere's efficient model optimized for RAG and tool use. Great balance of quality and cost for production deployments.

€0.003
RAGefficientCohere

Command R+

Text & ChatCohere

Cohere's flagship model for enterprise RAG applications. Excellent at retrieval-augmented generation, summarization, and multi-step tasks.

€0.01
enterpriseRAGCohere

Consistent Character

ImageCommunity

Generate consistent character images across different poses and scenes from a single reference.

€0.07
characterconsistentposes

ControlNet SDXL

ImageCommunity

Controlled image generation with SDXL. Use depth, pose, canny edge, and other control signals.

€0.02
controlnetguidedprecision

DBRX Instruct

Text & ChatDatabricks

Databricks' open-source MoE model with 132B total parameters. Strong at enterprise tasks, SQL, and data-related queries.

€0.004
open-sourceMoEenterprise

Deepgram Nova 2

STTDeepgram

Deepgram's most accurate ASR model. Optimized for real-time transcription with industry-leading word error rates.

€0.007
real-timeaccurateenterprise

DeepSeek Coder V2

CodeDeepSeek

DeepSeek's dedicated coding model. Specialized for code generation, completion, and debugging across many programming languages.

€0.003
open-sourcemulti-languagespecialized

Depth Anything V2

ImageCommunity

State-of-the-art monocular depth estimation. Generate accurate depth maps from single images.

€0.005
depth3Destimation

Dolphin 2.5 Mixtral

Text & ChatCognitive Computations

Uncensored Mixtral fine-tune. Open-ended assistant without content restrictions for research purposes.

€0.003
uncensoredopen-endedresearch

DreamShaper XL

ImageCommunity

Versatile fine-tuned SDXL model. Excels at both realistic and stylized image generation with rich details.

€0.01
versatilestylizeddetailed

ElevenLabs Turbo v2.5

TTSElevenLabs

Low-latency TTS model from ElevenLabs. Optimized for real-time applications with natural-sounding output.

€0.03
low-latencyreal-timefast

Face to Sticker

ImageCommunity

Convert face photos to cartoon stickers. Fun, expressive sticker generation from selfies.

€0.01
stickercartoonfun

Florence 2

MultimodalMicrosoft

Microsoft's foundation vision model. Object detection, captioning, segmentation, and OCR in one model.

€0.003
multi-taskMicrosoftOCR

Flux Canny

ImageBlack Forest Labs

Edge-conditioned image generation. Use canny edge detection maps to control the structure of generated images.

€0.04
edge-detectionstructuralcontrol

Flux Depth

ImageBlack Forest Labs

Depth-conditioned image generation from Black Forest Labs. Generate images with precise 3D structural control.

€0.04
depth-control3Dstructural

Flux Dev

ImageBlack Forest Labs

Development version of Flux with high quality generation. Open-weight model suitable for fine-tuning and customization.

€0.03
open-weightcustomizablehigh-quality

Flux Fill

ImageBlack Forest Labs

Inpainting and outpainting model from Black Forest Labs. Edit and extend images seamlessly with text guidance.

€0.04
inpaintingoutpaintingediting

Flux Redux

ImageBlack Forest Labs

Image variation and remix model. Create new images inspired by reference images with text-guided modifications.

€0.03
image-variationremixstyle-transfer

Flux Schnell

ImageBlack Forest Labs

Ultra-fast image generation from Black Forest Labs. Generates images in under 2 seconds while maintaining good quality.

€0.004
fastopen-sourceefficient

Frame Interpolation (FILM)

VideoGoogle Research

Google's frame interpolation model. Create smooth slow-motion by generating intermediate frames.

€0.01
slow-motioninterpolationsmooth

Gemini 2.0 Flash

Text & ChatGoogle DeepMind

Google's fast, versatile multimodal model. Supports text, images, audio, and video inputs. Great balance of speed and capability.

€0.007
fastmultimodalversatile

Gemini 2.0 Flash Lite

Text & ChatGoogle DeepMind

Google's most cost-efficient model. Optimized for high-volume, lower-complexity tasks with excellent throughput.

€0.003
affordablefastefficient

Gemini Robotics

RoboticsGoogle DeepMind
New

Google DeepMind's Gemini model adapted for robotics. Leverages Gemini's multimodal understanding for zero-shot robot task planning and execution.

€0.10
Geminizero-shottask-planning

Gemma 2 27B

Text & ChatGoogle DeepMind

Google's open-source 27B model. Strong performance in reasoning and text generation, built with Google's research expertise.

€0.003
open-sourceGoogleefficient

Gemma 2 9B

Text & ChatGoogle DeepMind

Compact open-source model from Google. Excellent for on-device deployment and resource-constrained environments.

€0.001
open-sourcecompactGoogle

Gemma 2 9B (Replicate)

Text & ChatGoogle DeepMind

Google's compact open model on Replicate. Efficient 9B model with strong general capabilities.

€0.002
Googlecompactefficient

GFPGAN

ImageTencent ARC

Face restoration model. Fix and enhance degraded face photos with realistic detail recovery.

€0.007
face-restorationenhancementrepair

GPT-4o Mini

Text & ChatOpenAI

Small, fast, and affordable model from OpenAI. Great for lightweight tasks like classification, summarization, and simple Q&A.

€0.003
fastaffordablelightweight

Grok 2

Text & ChatxAI

xAI's previous flagship model. Known for its witty personality, strong reasoning, and ability to handle nuanced questions.

€0.01
xAIwittyreasoning

Grok 3 Mini

Text & ChatxAI
New

Smaller, faster version of Grok 3. Excellent for quick responses and lower-cost applications while maintaining strong capabilities.

€0.01
xAIfastaffordable

Grounding DINO

ImageIDEA Research

Open-set object detection with text prompts. Detect any object by describing it in natural language.

€0.005
detectiongroundingopen-set

Helix

RoboticsFigure AI
New

Figure AI's VLA model powering their humanoid robots. Combines language understanding with full-body motion planning for household and industrial tasks.

€0.13
humanoidFigure AIfull-body

Hunyuan Video

VideoTencent
New

Tencent's open-source video generation model. Creates high-quality videos with strong motion coherence.

€0.39
Tencentopen-sourcehigh-quality

Ideogram 2.0

ImageIdeogram

Ideogram's latest model excelling at typography and text in images. Best-in-class text rendering in generated images.

€0.07
typographytext-renderingcreative

Ideogram V2 Turbo

ImageIdeogram
New

Fast version of Ideogram's text-rendering model. Quick generation with excellent typography capabilities.

€0.03
typographyfasttext-rendering

Illusion Diffusion

ImageCommunity

Create optical illusion images. Generate images that contain hidden patterns and visual tricks.

€0.01
illusionscreativepatterns

Img2Img SDXL

ImageStability AI

Image-to-image translation with SDXL. Transform and modify existing images with text guidance.

€0.01
img2imgeditingtransformation

Incredibly Fast Whisper

STTCommunity

Optimized Whisper model for ultra-fast transcription. 10x faster than standard Whisper with comparable accuracy.

€0.005
fastoptimizedefficient

InstantID

ImageCommunity

Zero-shot identity-preserving generation. Create images of a person in any style using just one reference photo.

€0.03
face-swapidentityzero-shot

InternVL 2

MultimodalInternVL

Open-source vision-language model rivaling GPT-4V. Strong visual understanding across diverse domains.

€0.005
GPT-4V rivalopen-source26B

IP-Adapter FaceID

ImageCommunity

Face-preserving image generation using IP-Adapter. Generate images that maintain facial identity from reference photos.

€0.03
face-preservingidentityadapter

Jina Embeddings v3

EmbeddingCustom

Jina AI's latest embedding model with task-specific adapters. Supports flexible dimensions and multiple retrieval tasks.

€0.001
flexible-dimstask-adaptersopen-source

Juggernaut XL

ImageCommunity

Popular fine-tuned SDXL model. Known for photorealistic outputs, especially portraits and landscapes.

€0.01
photorealisticfine-tunedportraits

Kandinsky 2.2

ImageAI Forever

Open-source multilingual text-to-image model. Supports prompts in multiple languages with good creative output.

€0.01
multilingualopen-sourcecreative

Kling 1.5

VideoKuaishou

Kuaishou's video generation model. Creates high-quality videos with good motion consistency and diverse styles.

€0.39
high-qualitydiverse-stylesopen

Kolors

ImageKwai

Kwai's photorealistic image generation model. Strong at generating realistic human portraits and scenes.

€0.01
photorealisticportraitsKwai

Llama 3.1 405B

Text & ChatMeta

Meta's largest open-source model. 405 billion parameters delivering frontier-class performance on reasoning, coding, and multilingual tasks.

€0.007
open-source405Bfrontier-class

Llama 3.1 70B

Text & ChatMeta

Meta's highly capable 70B open-source model. Great balance of performance and efficiency for a wide range of tasks.

€0.004
open-sourcebalancedpopular

Llama 3.1 70B (Replicate)

Text & ChatMeta

Meta's popular 70B model on Replicate. Strong all-around performance for chat, coding, and reasoning.

€0.004
70Bpopularversatile

Llama 3.1 8B

Text & ChatMeta

Meta's compact 8B model. Surprisingly capable for its size, perfect for fast inference, edge deployment, and cost-sensitive applications.

€0.001
open-sourcecompactfast

Llama 3.1 8B (Replicate)

Text & ChatMeta

Efficient 8B Llama model on Replicate. Fast and affordable for straightforward tasks.

€0.001
fastaffordable8B

Llama 3.2 11B Vision

Text & ChatMeta

Compact multimodal Llama 3.2. Vision-language model for efficient image understanding and text generation.

€0.002
multimodalcompactefficient

Llama 3.2 1B

Text & ChatMeta

Smallest Llama model for on-device inference. 1B parameters, ideal for mobile and IoT applications.

€0.001
tinyon-device1B

Llama 3.2 3B

Text & ChatMeta

Ultra-compact Llama model for edge deployment. 3B parameters with surprising capability for its size.

€0.001
edgecompact3B

Llama 3.2 90B Vision

Text & ChatMeta

Meta's multimodal Llama 3.2. 90B parameter model with native image understanding and text generation.

€0.005
multimodal90BMeta

Llama 4 Scout

Text & ChatMeta
New

Meta's next-generation Llama 4 model optimized for efficiency. Built on a new architecture with improved reasoning and instruction following.

€0.003
open-sourcenext-genefficient

LLARVA

RoboticsLLARVA

Vision-Language-Action model using LLM backbones for structured robot action prediction. Bridges language models and low-level robot control.

€0.05
LLM-basedstructured-actionsresearch

LLaVA 1.6 34B

MultimodalTogether AI

Open-source multimodal model combining language and vision. Strong visual understanding with conversational capabilities.

€0.004
open-sourcevisionconversational

LLaVA v1.6 13B

MultimodalCommunity

Open-source multimodal model. Analyze and describe images with natural language understanding.

€0.003
visionopen-sourceanalysis

Logo Generator SDXL

ImageCommunity

Logo and icon generation using fine-tuned SDXL. Create professional logos and brand assets.

€0.01
logobrandingdesign

LTX Video

VideoLightricks
New

Lightweight text-to-video model. Fast generation with reasonable quality for prototyping and previews.

€0.20
fastlightweighttext-to-video

Luma Dream Machine

VideoLuma AI

Luma AI's video generation model. Creates dreamy, cinematic videos with excellent visual quality and creative flexibility.

€0.39
cinematiccreativehigh-quality

Material Transfer

ImageCommunity

Transfer materials and textures between images. Apply the material of one image onto objects in another.

€0.02
texturematerialtransfer

Minimax Image

ImageMinimax
New

Minimax's text-to-image model. High-quality image generation with strong prompt understanding.

€0.01
high-qualityMinimaxversatile

Minimax Video-01

VideoMinimax

Minimax's video generation model supporting up to 720p resolution. Good for short-form video content creation.

€0.39
720pshort-formaccessible

MiniMax Video-01 (Replicate)

VideoMinimax
New

Minimax's video model on Replicate. Generate short videos from text descriptions.

€0.39
text-to-videoMinimaxshort-form

Mistral Large 2

Text & ChatMistral AI

Mistral's most capable model. 123B parameters with strong reasoning, multilingual support, and function calling. Great for complex enterprise tasks.

€0.01
enterprisemultilingualfunction-calling

Mistral Medium

Text & ChatMistral AI

Mid-range model from Mistral AI offering a good balance of performance and cost for most business applications.

€0.004
balancedbusinessmultilingual

Mistral Nemo

Text & ChatMistral AI

12B open-weight model by Mistral and NVIDIA. Compact but capable, ideal for on-device or self-hosted deployments.

€0.003
open-weightcompactself-hosted

Mistral Small

Text & ChatMistral AI

Mistral's efficient small model. Fast and cost-effective for straightforward tasks like classification, text generation, and RAG.

€0.001
fastaffordableefficient

Mixtral 8x7B

Text & ChatMistral AI

Mistral's MoE model with 8 experts. Strong performance with efficient inference using sparse architecture.

€0.003
MoEefficientMistral

Mochi 1

VideoGenmo

Genmo's video generation model. Creative video generation with artistic style flexibility.

€0.26
creativeartisticGenmo

Moondream 2

MultimodalCommunity

Tiny but capable vision-language model. Only 1.8B params yet surprisingly good at image understanding.

€0.001
tiny1.8Befficient

MusicGen Large

AudioMeta

Meta's open-source music generation model. Creates high-quality music from text descriptions with control over style, tempo, and instruments.

€0.04
musicopen-sourceMeta

MusicGen Melody

AudioMeta

Meta's music generation with melody conditioning. Create music that follows a reference melody while matching text descriptions.

€0.04
musicmelody-guidedMeta

MusicGen Stereo Large

AudioMeta

Stereo music generation from Meta. Creates high-quality stereo music tracks from text descriptions.

€0.05
musicstereohigh-quality

Nous Hermes 2 Mixtral

Text & ChatNous Research

Nous Research fine-tune of Mixtral. Enhanced instruction following and conversational quality.

€0.003
fine-tunedinstructionsconversational

o1 Mini

Text & ChatOpenAI

Smaller, faster version of OpenAI's o1 reasoning model. Optimized for STEM tasks with lower latency and cost.

€0.02
reasoningfastSTEM

o3 Mini

Text & ChatOpenAI
New

OpenAI's latest small reasoning model. Highly efficient chain-of-thought reasoning with excellent cost-performance ratio.

€0.01
reasoningefficientnew

OCR with GPT-4o

MultimodalCommunity

Accurate text extraction from images using GPT-4o vision. Extract text, tables, and structured data.

€0.01
OCRtext-extractiontables

Octo

RoboticsUC Berkeley

Open-source generalist robot policy from UC Berkeley. Supports multiple robot embodiments and can be fine-tuned for new tasks with minimal data.

€0.07
open-sourcegeneralistfine-tunable

OpenAI TTS-1

TTSOpenAI

OpenAI's standard TTS model. Fast and affordable text-to-speech synthesis with good quality for most applications.

€0.02
fastaffordablestandard

OpenAI TTS-1 HD

TTSOpenAI

OpenAI's high-definition text-to-speech model. Natural, human-like voice synthesis with 6 preset voices.

€0.04
HD quality6 voicesnatural

Outpainting SDXL

ImageCommunity

Extend images beyond their borders. Seamlessly expand the canvas of any image with AI-generated content.

€0.02
outpaintingextensioncreative

Phi-4

Text & ChatMicrosoft
New

Microsoft's small but mighty 14B model. Punches well above its weight class on reasoning, math, and coding benchmarks.

€0.001
open-sourceefficientMicrosoft

PhotoMaker v2

ImageTencent ARC

Customizable realistic photo generation. Create photos of specific people in different scenes and styles.

€0.03
personalizationportraitscustomizable

Pi0.5

RoboticsPhysical Intelligence
New

Physical Intelligence's latest VLA model with improved generalization. Handles complex multi-step manipulation tasks with fewer demonstrations.

€0.13
next-genmulti-stepPhysical Intelligence

Pika 2.0

VideoPika

Pika's latest video model with improved motion quality and generation speed. User-friendly interface for video creation.

€0.26
user-friendlyfastcreative

PixArt Sigma

ImagePixArt

Efficient transformer-based image generation. High-quality 4K images with excellent text rendering capabilities.

€0.01
transformer4Ktext-rendering

Pixtral Large

MultimodalMistral AI

Mistral's vision-language model. 124B parameters with native image understanding, document analysis, and visual reasoning.

€0.01
vision124Bdocument-analysis

Playground v2.5

ImagePlayground AI

Playground AI's aesthetic-focused model. Trained for beautiful, photorealistic images with excellent color and composition.

€0.01
aestheticphotorealisticfree

Playground v3

ImagePlayground AI

Playground AI's latest model focused on photorealistic image generation with strong aesthetic quality and prompt adherence.

€0.01
photorealisticaestheticfree-tier

QR Code Generator

ImageCommunity

AI-powered artistic QR code generator. Create beautiful, functional QR codes with custom designs.

€0.03
QR-codedesignfunctional

Qwen 2.5 72B

Text & ChatAlibaba / Qwen

Alibaba's flagship 72B model. Exceptional at Chinese and English tasks, strong coding abilities, and competitive with leading closed-source models.

€0.004
open-sourcemultilingualChinese

Qwen 2.5 7B

Text & ChatAlibaba / Qwen

Compact 7B model from Alibaba's Qwen series. Fast and efficient while maintaining strong multilingual and coding capabilities.

€0.001
open-sourcecompactmultilingual

Qwen VL Plus

MultimodalCommunity

Alibaba's vision-language model. Strong at document understanding, charts, and multilingual visual QA.

€0.003
documentschartsmultilingual

Qwen2.5-Coder 32B

CodeAlibaba / Qwen
New

Alibaba's specialized coding model. Strong performance on code benchmarks with support for many programming languages.

€0.003
open-sourcecodingAlibaba

QwQ 32B

Text & ChatAlibaba / Qwen
New

Alibaba's reasoning model. Uses chain-of-thought to solve complex math, logic, and coding problems. Open-weight alternative to o1.

€0.003
reasoningopen-sourcemath

Real-ESRGAN

ImageCommunity

Powerful image upscaler using enhanced ESRGAN. Upscale images 2-4x with excellent detail preservation and artifact removal.

€0.007
upscalersuper-resolutionenhancement

Realistic Vision XL

ImageCommunity

SDXL fine-tune optimized for photorealism. Creates stunning realistic photos from text descriptions.

€0.01
photorealisticfine-tunedrealistic

Recraft V3

ImageRecraft

Recraft's SVG and design-focused generation model. Creates vector graphics, icons, and design assets from text descriptions.

€0.05
designSVGvector

Remove Background

ImageCommunity

Automatic background removal. Remove backgrounds from images with high accuracy and clean edges.

€0.005
background-removaleditingfast

Riffusion

AudioRiffusion

Real-time music generation through spectrograms. Create music by interpolating between text prompts.

€0.01
musicspectrogramsreal-time

RoboFlamingo

RoboticsRoboFlamingo

Robotics adaptation of the Flamingo vision-language model. Few-shot learning for robot tasks using language-conditioned visuomotor policies.

€0.05
few-shotFlamingo-basedresearch

RT-X

RoboticsGoogle DeepMind

Cross-embodiment robotic foundation model from the Open X-Embodiment collaboration. Trained on data from 22 robot types for generalized manipulation.

€0.10
cross-embodimentopen-datamanipulation

SDXL Lightning

ImageByteDance

Distilled SDXL model for ultra-fast generation. Creates high-quality images in 1-4 steps.

€0.005
ultra-fast1-4 stepsdistilled

Segment Anything (SAM)

ImageMeta

Meta's universal image segmentation model. Automatically detect and segment any object in images.

€0.01
segmentationdetectionMeta

Snowflake Arctic

Text & ChatSnowflake

Snowflake's enterprise-focused LLM. Optimized for SQL generation, data analysis, and enterprise coding tasks.

€0.004
enterpriseSQLdata-analysis

SpatialVLA

RoboticsSpatialVLA

VLA model with explicit 3D spatial reasoning. Uses depth perception and spatial understanding for more precise robotic manipulation.

€0.08
3D-spatialdepthprecision

Stable Audio

AudioUdio

Stability AI's audio generation model. Creates music and sound effects from text prompts with customizable duration.

€0.03
musicsound-effectscustomizable

Stable Audio Open

AudioStability AI

Stability AI's open-source audio model. Generate music and sound effects up to 47 seconds.

€0.02
musicsound-effectsopen-source

Stable Diffusion 3 Medium

ImageStability AI

Stability AI's efficient SD3 model. Good balance of quality and speed for general-purpose image generation.

€0.04
balancedefficientgeneral-purpose

Stable Diffusion 3.5 Large

ImageStability AI

Stability AI's latest open-source image model. 8B parameter model with improved prompt adherence, typography, and photorealism.

€0.05
open-source8B paramstypography

Stable Diffusion 3.5 Large Turbo

ImageStability AI

Accelerated version of SD3.5 Large. Few-step generation for near real-time image creation.

€0.03
turbofastfew-step

Stable Diffusion 3.5 Medium

ImageStability AI

Mid-size variant of SD3.5. Faster generation while maintaining strong visual quality and prompt adherence.

€0.04
mediumfastquality

Stable Diffusion XL

ImageStability AI

Stability AI's popular SDXL model. Widely adopted, extensive community support, and thousands of fine-tuned variants available.

€0.01
open-sourcepopularfine-tunable

Stable Video Diffusion

VideoStability AI

Stability AI's video generation model. Create short video clips from text or image prompts.

€0.20
text-to-videoStability AIshort-clips

StarCoder2 15B

CodeBigCode

BigCode's open-source code model trained on The Stack v2. Supports 600+ programming languages with strong completion quality.

€0.001
open-source600+ languagesBigCode

Style Transfer

ImageCommunity

Apply artistic styles from one image to another. Neural style transfer for creative image transformation.

€0.01
style-transferartisticcreative

SUPIR Upscaler

ImageCommunity

State-of-the-art AI image upscaler. Practice image restoration and upscaling with incredible detail generation.

€0.04
upscalerrestorationdetail

SwinIR

ImageCommunity

Swin Transformer-based image restoration. Denoising, super-resolution, and JPEG artifact removal.

€0.007
restorationdenoisingartifact-removal

text-embedding-3-small

EmbeddingOpenAI

OpenAI's efficient embedding model. 1536 dimensions with strong performance at lower cost, ideal for most use cases.

€0.000
1536 dimaffordableefficient

Tortoise TTS

TTSCommunity

High-quality multi-speaker TTS. Generates natural speech with voice cloning capabilities from short reference clips.

€0.03
voice-cloningmulti-speakernatural

Video Upscaler

VideoCommunity

AI-powered video upscaling. Enhance video resolution up to 4x with detail preservation.

€0.26
upscalerenhancement4x

Wan 2.1 1.3B

VideoWan AI

Lightweight video generation model. Fast text-to-video generation suitable for quick previews and prototyping.

€0.13
lightweightfasttext-to-video

Wan 2.1 Image-to-Video

VideoWan AI
New

Animate still images into videos. Transform a single image into a dynamic video sequence.

€0.39
image-to-videoanimation14B

Whisper (Replicate)

STTOpenAI

OpenAI's Whisper model on Replicate. Transcribe audio in 100+ languages with word-level timestamps.

€0.007
transcription100+ languagestimestamps

Whisper Diarize

STTCommunity

Whisper with speaker diarization. Transcribe conversations and identify individual speakers.

€0.01
diarizationspeakersconversations

XTTS-v2

TTSCommunity

Coqui's cross-lingual TTS model. Generate speech in 17 languages using voice cloning from a short reference clip.

€0.03
multilingualvoice-cloning17 languages

Yi Lightning

Text & Chat01.AI

01.AI's fast inference model. Optimized for speed with competitive quality, ideal for real-time applications.

€0.001
open-sourcefast01.AI

Start Building with AI

Access all models through a single API. OpenAI-compatible, no vendor lock-in.