Text & Chat Models
Powerful language models for conversation, analysis, and content generation
42 models available
Claude Opus 4
Anthropic's most powerful model. Exceptional at complex analysis, agentic tasks, and extended reasoning.
Claude Sonnet 4
Anthropic's most capable model. Excellent for complex analysis, coding, math, and creative writing.
Gemini 2.0 Flash
Google's fastest multimodal model. Supports text, images, audio, and video input.
Gemini 2.5 Pro
Google's latest thinking model. Excels at reasoning, coding, math, and science with massive context window.
GPT-4.1
OpenAI's newest flagship model. Improved reasoning, instruction following, and coding over GPT-4o.
GPT-4.5 Preview
OpenAI's latest frontier model with improved reasoning, creativity, and instruction following. Significant improvements over GPT-4o.
GPT-4o
OpenAI's most capable multimodal model. Excellent for complex reasoning, coding, and creative tasks.
Llama 4 Maverick
Meta's powerful Llama 4 Maverick model. A larger, more capable variant with strong reasoning, creative writing, and multilingual abilities.
o1
OpenAI's reasoning model that thinks before answering. Uses chain-of-thought to solve complex math, science, and coding problems.
o3-mini
OpenAI's reasoning model optimized for STEM tasks, coding, and math. Uses chain-of-thought reasoning.
Claude 3.5 Haiku
Anthropic's fastest and most affordable model. Ideal for high-volume tasks, customer support, and quick responses.
Claude 3.5 Sonnet
Previous generation balanced model from Anthropic. Still excellent for many tasks including coding, analysis, and creative writing.
Claude Haiku 3.5
Anthropic's fast and affordable model. Great for quick tasks, summarization, and simple coding.
Command R
Cohere's efficient model optimized for RAG and tool use. Great balance of quality and cost for production deployments.
Command R+
Cohere's flagship model for enterprise RAG applications. Excellent at retrieval-augmented generation, summarization, and multi-step tasks.
DBRX Instruct
Databricks' open-source MoE model with 132B total parameters. Strong at enterprise tasks, SQL, and data-related queries.
DeepSeek R1
DeepSeek's reasoning model with chain-of-thought capabilities. Excellent for complex problem-solving.
DeepSeek V3
Powerful open-weight model from DeepSeek. Strong at coding, math, and Chinese/English tasks.
Gemini 2.0 Flash
Google's fast, versatile multimodal model. Supports text, images, audio, and video inputs. Great balance of speed and capability.
Gemini 2.0 Flash Lite
Google's most cost-efficient model. Optimized for high-volume, lower-complexity tasks with excellent throughput.
Gemma 2 27B
Google's open-source 27B model. Strong performance in reasoning and text generation, built with Google's research expertise.
Gemma 2 9B
Compact open-source model from Google. Excellent for on-device deployment and resource-constrained environments.
GPT-4o Mini
Small, fast, and affordable model for lightweight tasks. Great balance of speed and capability.
Grok 2
xAI's previous flagship model. Known for its witty personality, strong reasoning, and ability to handle nuanced questions.
Grok 3
xAI's flagship model. Strong at reasoning, coding, and real-time knowledge with web search capabilities.
Grok 3 Mini
Smaller, faster version of Grok 3. Excellent for quick responses and lower-cost applications while maintaining strong capabilities.
Llama 3.1 405B
Meta's largest open-source model. 405 billion parameters delivering frontier-class performance on reasoning, coding, and multilingual tasks.
Llama 3.1 70B
Meta's highly capable 70B open-source model. Great balance of performance and efficiency for a wide range of tasks.
Llama 3.1 8B
Meta's compact 8B model. Surprisingly capable for its size, perfect for fast inference, edge deployment, and cost-sensitive applications.
Llama 3.3 70B
Meta's open-source 70B parameter model. Strong all-around performance with multilingual support.
Llama 4 Scout
Meta's next-generation Llama 4 model optimized for efficiency. Built on a new architecture with improved reasoning and instruction following.
Mistral Large
Mistral's flagship model. Strong reasoning, multilingual, and coding capabilities.
Mistral Large 2
Mistral's most capable model. 123B parameters with strong reasoning, multilingual support, and function calling. Great for complex enterprise tasks.
Mistral Medium
Mid-range model from Mistral AI offering a good balance of performance and cost for most business applications.
Mistral Nemo
12B open-weight model by Mistral and NVIDIA. Compact but capable, ideal for on-device or self-hosted deployments.
Mistral Small
Mistral's efficient small model. Fast and cost-effective for straightforward tasks like classification, text generation, and RAG.
o1 Mini
Smaller, faster version of OpenAI's o1 reasoning model. Optimized for STEM tasks with lower latency and cost.
Phi-4
Microsoft's small but mighty 14B model. Punches well above its weight class on reasoning, math, and coding benchmarks.
Qwen 2.5 72B
Alibaba's powerful open-source model. Excellent at coding, math, and multilingual tasks.
Qwen 2.5 7B
Compact 7B model from Alibaba's Qwen series. Fast and efficient while maintaining strong multilingual and coding capabilities.
QwQ 32B
Alibaba's reasoning model. Uses chain-of-thought to solve complex math, logic, and coding problems. Open-weight alternative to o1.
Yi Lightning
01.AI's fast inference model. Optimized for speed with competitive quality, ideal for real-time applications.
Start Building with AI
Access all models through a single API. Get free credits when you sign up — no credit card required.