Text &amp; Chat Models

Anthropic's most powerful model. Exceptional at complex analysis, agentic tasks, and extended reasoning.

0.15/1K tokens5.0s

flagshipreasoningagentic

Claude Sonnet 4

Popular

Anthropic's most capable model. Excellent for complex analysis, coding, math, and creative writing.

0.03/1K tokens3.0s

popularcodinganalysis

Gemini 2.0 Flash

Google's fastest multimodal model. Supports text, images, audio, and video input.

0.001/1K tokens1.2s

fastmultimodalaffordable

Gemini 2.5 Pro

reasoningcodingmultimodal

Google's latest thinking model. Excels at reasoning, coding, math, and science with massive context window.

0.0125/1K tokens4.0s

GPT-4.1

OpenAI's newest flagship model. Improved reasoning, instruction following, and coding over GPT-4o.

0.02/1K tokens2.5s

popularcodingreasoning

GPT-4.5 Preview

OpenAI's latest frontier model with improved reasoning, creativity, and instruction following. Significant improvements over GPT-4o.

frontierreasoningcreative

GPT-4o

Popular

OpenAI's most capable multimodal model. Excellent for complex reasoning, coding, and creative tasks.

0.025/1K tokens2.0s

popularfastmultimodal

Llama 4 Maverick

Meta's powerful Llama 4 Maverick model. A larger, more capable variant with strong reasoning, creative writing, and multilingual abilities.

open-sourcenext-genpowerful

o1

Popular

OpenAI's reasoning model that thinks before answering. Uses chain-of-thought to solve complex math, science, and coding problems.

reasoningmathscience

o3-mini

OpenAI's reasoning model optimized for STEM tasks, coding, and math. Uses chain-of-thought reasoning.

0.011/1K tokens10.0s

reasoningcodingmath

Claude 3.5 Haiku

Anthropic's fastest and most affordable model. Ideal for high-volume tasks, customer support, and quick responses.

fastaffordablehigh-volume

Claude 3.5 Sonnet

Previous generation balanced model from Anthropic. Still excellent for many tasks including coding, analysis, and creative writing.

balancedcodingpopular

Claude Haiku 3.5

Anthropic's fast and affordable model. Great for quick tasks, summarization, and simple coding.

0.008/1K tokens1.0s

fastaffordable

Command R

Cohere's efficient model optimized for RAG and tool use. Great balance of quality and cost for production deployments.

RAGefficientCohere

Command R+

Cohere's flagship model for enterprise RAG applications. Excellent at retrieval-augmented generation, summarization, and multi-step tasks.

enterpriseRAGCohere

DBRX Instruct

Databricks' open-source MoE model with 132B total parameters. Strong at enterprise tasks, SQL, and data-related queries.

open-sourceMoEenterprise

DeepSeek R1

DeepSeek

DeepSeek's reasoning model with chain-of-thought capabilities. Excellent for complex problem-solving.

0.0055/1K tokens8.0s

reasoningmath

DeepSeek V3

DeepSeek

Powerful open-weight model from DeepSeek. Strong at coding, math, and Chinese/English tasks.

0.0014/1K tokens2.0s

affordablecoding

Gemini 2.0 Flash

Google's fast, versatile multimodal model. Supports text, images, audio, and video inputs. Great balance of speed and capability.

fastmultimodalversatile

Gemini 2.0 Flash Lite

Google's most cost-efficient model. Optimized for high-volume, lower-complexity tasks with excellent throughput.

affordablefastefficient

Gemma 2 27B

Google's open-source 27B model. Strong performance in reasoning and text generation, built with Google's research expertise.

open-sourceGoogleefficient

Gemma 2 9B

Compact open-source model from Google. Excellent for on-device deployment and resource-constrained environments.

open-sourcecompactGoogle

GPT-4o Mini

Small, fast, and affordable model for lightweight tasks. Great balance of speed and capability.

0.0015/1K tokens800ms

fastaffordable

Grok 2

xAI's previous flagship model. Known for its witty personality, strong reasoning, and ability to handle nuanced questions.

xAIwittyreasoning

Grok 3

xAI's flagship model. Strong at reasoning, coding, and real-time knowledge with web search capabilities.

0.03/1K tokens3.0s

reasoningreal-time

Grok 3 Mini

Smaller, faster version of Grok 3. Excellent for quick responses and lower-cost applications while maintaining strong capabilities.

xAIfastaffordable

Llama 3.1 405B

Meta's largest open-source model. 405 billion parameters delivering frontier-class performance on reasoning, coding, and multilingual tasks.

open-source405Bfrontier-class

Llama 3.1 70B

Meta's highly capable 70B open-source model. Great balance of performance and efficiency for a wide range of tasks.

open-sourcebalancedpopular

Llama 3.1 8B

Meta's compact 8B model. Surprisingly capable for its size, perfect for fast inference, edge deployment, and cost-sensitive applications.

open-sourcecompactfast

Llama 3.3 70B

Meta's open-source 70B parameter model. Strong all-around performance with multilingual support.

0.0088/1K tokens2.5s

open-sourcepopular

Llama 4 Scout

Meta's next-generation Llama 4 model optimized for efficiency. Built on a new architecture with improved reasoning and instruction following.

open-sourcenext-genefficient

Mistral Large

Mistral's flagship model. Strong reasoning, multilingual, and coding capabilities.

0.02/1K tokens2.5s

multilingualcoding

Mistral Large 2

Mistral's most capable model. 123B parameters with strong reasoning, multilingual support, and function calling. Great for complex enterprise tasks.

enterprisemultilingualfunction-calling

Mistral Medium

Mid-range model from Mistral AI offering a good balance of performance and cost for most business applications.

balancedbusinessmultilingual

Mistral Nemo

12B open-weight model by Mistral and NVIDIA. Compact but capable, ideal for on-device or self-hosted deployments.

open-weightcompactself-hosted

Mistral Small

Mistral's efficient small model. Fast and cost-effective for straightforward tasks like classification, text generation, and RAG.

fastaffordableefficient

o1 Mini

Smaller, faster version of OpenAI's o1 reasoning model. Optimized for STEM tasks with lower latency and cost.

reasoningfastSTEM

Phi-4

Microsoft's small but mighty 14B model. Punches well above its weight class on reasoning, math, and coding benchmarks.

open-sourceefficientMicrosoft

Qwen 2.5 72B

open-sourcecodingmultilingual

Alibaba's powerful open-source model. Excellent at coding, math, and multilingual tasks.

0.012/1K tokens2.5s

Qwen 2.5 7B

Compact 7B model from Alibaba's Qwen series. Fast and efficient while maintaining strong multilingual and coding capabilities.

open-sourcecompactmultilingual

QwQ 32B

Alibaba's reasoning model. Uses chain-of-thought to solve complex math, logic, and coding problems. Open-weight alternative to o1.

reasoningopen-sourcemath

Yi Lightning

01.AI's fast inference model. Optimized for speed with competitive quality, ideal for real-time applications.