Text & Chat Models

Powerful language models for conversation, analysis, and content generation

Modelos de texto y chat para cargas de IA en producción

Los grandes modelos de lenguaje son el caballo de batalla de la IA moderna: chatbots, agentes, resumidores, clasificadores, traductores. Es la categoría más concurrida en Railwail — OpenAI, Anthropic, Google, Mistral, Meta, DeepSeek, xAI y decenas de laboratorios de pesos abiertos compiten aquí.

45 models available

Claude Opus 4

Text & ChatAnthropic
NewPopular

Anthropic's most powerful model. Exceptional at complex analysis, agentic tasks, and extended reasoning.

Free5.0s
flagshipreasoningagentic

Claude Sonnet 4

Text & ChatAnthropic
Popular

Anthropic's most capable model. Excellent for complex analysis, coding, math, and creative writing.

Free3.0s
popularcodinganalysis

DeepSeek V3.1

Text & ChatDeepSeek
Popular

DeepSeek's refreshed V3.1 release. 671B MoE / 37B active. Tops open-weights leaderboards on coding and reasoning.

Free
deepseekopen-weightsmoe

DeepSeek V4 Pro

Text & ChatDeepSeek
NewPopular

DeepSeek's April 2026 flagship. 1.6T MoE / 49B active params, 1M context, rivals top closed-source models on STEM and coding at a fraction of the price.

Free
deepseekopen-weightsmoe

Gemini 2.0 Flash

Text & ChatGoogle DeepMind
NewPopular

Google's fastest multimodal model. Supports text, images, audio, and video input.

Free1.2s
fastmultimodalaffordable

Gemini 2.5 Pro

Text & ChatGoogle DeepMind
NewPopular

Google's latest thinking model. Excels at reasoning, coding, math, and science with massive context window.

Free4.0s
reasoningcodingmultimodal

GPT-4.1

Text & ChatOpenAI
NewPopular

OpenAI's newest flagship model. Improved reasoning, instruction following, and coding over GPT-4o.

Free2.5s
popularcodingreasoning

GPT-4o

Text & ChatOpenAI
Popular

OpenAI's most capable multimodal model. Excellent for complex reasoning, coding, and creative tasks.

Free2.0s
popularfastmultimodal

Grok 4

Text & ChatxAI
Popular

xAI's flagship reasoning model with vision and tool use. 256k context, strong at complex reasoning and STEM tasks.

Free
xaiflagshipreasoning

Kimi K2 (Moonshot)

Text & ChatCustom
Popular

Moonshot AI's 1T-parameter MoE model. Industry-leading agentic coding and tool-use benchmarks.

Free
moonshotkimimoe

MiniMax-01

Text & ChatMinimax
Popular

MiniMax's 456B hybrid lightning-attention model with native 4M-token context. Industry-leading long-context.

Free
minimaxlong-contextlightning-attention

o3-mini

Text & ChatOpenAI
NewPopular

OpenAI's reasoning model optimized for STEM tasks, coding, and math. Uses chain-of-thought reasoning.

Free10.0s
reasoningcodingmath

Perplexity Sonar Pro

Text & ChatCustom
Popular

Perplexity's premium web-grounded search model with multi-step reasoning over live sources.

Free
perplexityweb-searchcitations

Qwen 3 235B Instruct

Text & ChatAlibaba / Qwen
Popular

Alibaba's Qwen 3 flagship MoE: 235B total / 22B active. Strong reasoning and tool use, open-weights.

Free
qwenalibabamoe

AI21 Jamba 1.5 Large

Text & ChatCustom

AI21's flagship hybrid Mamba-Transformer model with a 256k context window for long-document tasks.

Free
ai21long-contextmamba

AI21 Jamba 1.5 Mini

Text & ChatCustom

Cost-efficient hybrid Mamba-Transformer model with 256k context. Tuned for high-throughput RAG.

Free
ai21long-contextmamba

Claude Haiku 3.5

Text & ChatAnthropic

Anthropic's fast and affordable model. Great for quick tasks, summarization, and simple coding.

Free1.0s
fastaffordable

Cohere Aya 23 35B

Text & ChatCustom

Open-weights multilingual research model from Cohere covering 23 languages. 35B parameters.

Free
coheremultilingualopen-weights

Cohere Command Light (legacy)

Text & ChatCohere

Cohere's fast lightweight chat model (deprecated Sep 2025). Kept as comparison tombstone.

Free
coherelegacydeprecated

Cohere Command R (08-2024)

Text & ChatCohere

Cohere's mid-tier RAG/tool model. Cost-efficient sibling of Command R+ with 128k context.

Free
cohereragtools

Cohere Command R+ (08-2024)

Text & ChatCohere

Cohere's flagship RAG- and tool-optimized chat model. 128k context, refreshed August 2024.

Free
cohereragtools

DeepSeek R1

Text & ChatDeepSeek
New

DeepSeek's reasoning model with chain-of-thought capabilities. Excellent for complex problem-solving.

Free8.0s
reasoningmath

DeepSeek V3

Text & ChatDeepSeek

Powerful open-weight model from DeepSeek. Strong at coding, math, and Chinese/English tasks.

Free2.0s
affordablecoding

DeepSeek V4 Flash

Text & ChatDeepSeek
New

Efficiency-optimized variant of DeepSeek V4. 284B MoE / 13B active, 1M context, ultra-low pricing for high-throughput workloads.

Free
deepseekopen-weightsmoe

GPT-4o Mini

Text & ChatOpenAI

Small, fast, and affordable model for lightweight tasks. Great balance of speed and capability.

Free800ms
fastaffordable

Grok 3

Text & ChatxAI
New

xAI's flagship model. Strong at reasoning, coding, and real-time knowledge with web search capabilities.

Free3.0s
reasoningreal-time

Llama 3.3 70B

Text & ChatMeta

Meta's open-source 70B parameter model. Strong all-around performance with multilingual support.

Free2.5s
open-sourcepopular

M2M-100 12B

Text & ChatMeta

Meta M2M-100 12B many-to-many translation model. Direct translation between 100 languages without pivoting through English.

€0.006
replicatetranslationmeta

MADLAD-400 3B

Text & ChatGoogle DeepMind

Google MADLAD-400 3B multilingual translation model. 419 languages supported, trained on a 5T-token multilingual corpus with strong low-resource performance.

€0.004
replicatetranslationgoogle

mBART 50 Many-to-Many

Text & ChatMeta

Meta mBART-50 many-to-many translation model. 50 supported languages with strong performance on news and conversational text.

€0.003
replicatetranslationmeta

Microsoft Phi-3.5 MoE Instruct

Text & ChatMicrosoft

Mixture-of-experts Phi-3.5: 42B total / 6.6B active params. 128k context, multilingual.

Free
microsoftopen-weightsmoe

Mistral Large

Text & ChatMistral AI

Mistral's flagship model. Strong reasoning, multilingual, and coding capabilities.

Free2.5s
multilingualcoding

NLLB-200 3B

Text & ChatMeta

Meta's No Language Left Behind 3.3B translation model. Direct translation between any pair of 200+ languages including many low-resource African and Asian languages.

€0.003
replicatetranslationmeta

NLLB-200 Distilled 600M

Text & ChatMeta

Meta's distilled 600M NLLB. Same 200-language coverage as the 3B model with a fraction of the parameters, ideal for edge or high-throughput deployment.

€0.002
replicatetranslationmeta

Nous Hermes 3 405B

Text & ChatTogether AI

Full-parameter fine-tune of Llama 3.1 405B by Nous Research. Steerable, uncensored, strong tool use.

Free
nousopen-weightstools

Nous Hermes 3 70B

Text & ChatTogether AI

Llama-3.1-70B fine-tune from Nous Research with strong tool/agent capabilities and uncensored alignment.

Free
nousopen-weightstools

Perplexity Sonar

Text & ChatCustom

Perplexity's fastest and cheapest web-grounded chat model. Live-source citations included.

Free
perplexityweb-searchcitations

Perplexity Sonar Reasoning

Text & ChatCustom

Perplexity's reasoning model with chain-of-thought and integrated web search.

Free
perplexityweb-searchreasoning

Qwen 2.5 72B

Text & ChatAlibaba / Qwen

Alibaba's powerful open-source model. Excellent at coding, math, and multilingual tasks.

Free2.5s
open-sourcecodingmultilingual

Qwen 2.5-Max

Text & ChatCustom

Alibaba's flagship pretrained MoE model. Top-tier reasoning and code performance via DashScope API.

Free
qwenalibabamoe

SeamlessM4T v2 Large (Text)

Text & ChatMeta

Meta SeamlessM4T v2 Large. Universal multilingual translation across 100+ languages with text-to-text mode for documents and chat.

€0.006
replicatetranslationmeta

Snowflake Arctic Instruct

Text & ChatCustom

Snowflake's open MoE model: 480B total / 17B active params with dense+MoE hybrid architecture.

Free
snowflakemoeopen-weights

TII Falcon 180B Chat

Text & ChatTogether AI

TII's 180B causal decoder chat model fine-tuned on Ultrachat, Platypus and Airoboros.

Free
tiiopen-weightslegacy

TowerInstruct 13B

Text & ChatReplicate

Unbabel TowerInstruct 13B. Llama-2-based multilingual translation and post-editing model. Strong terminology consistency for enterprise localization.

€0.005
replicatetranslationunbabel

Yi Large

Text & ChatCustom

01.AI's larger general-purpose chat model with 32k context window and strong bilingual performance.

Free
01aichinesebilingual

Top text & chat models picks

Hand-picked across four common criteria — resolved against the live catalog so the picks track price and performance changes.

Mejor en general
Claude Opus 4

Anthropic's most powerful model. Exceptional at complex analysis, agentic tasks, and extended reasoning.

Learn more
Más barato
Cohere Command R (08-2024)

Cohere's mid-tier RAG/tool model. Cost-efficient sibling of Command R+ with 128k context.

Learn more
Contexto más largo
MiniMax-01

MiniMax's 456B hybrid lightning-attention model with native 4M-token context. Industry-leading long-context.

Learn more
Más rápido
GPT-4o Mini

Small, fast, and affordable model for lightweight tasks. Great balance of speed and capability.

Learn more

La tarificación en generación de texto es casi siempre por token, dividida en una tarifa de entrada más barata y una tarifa de salida más cara. Un millón de tokens de entrada equivale a unas 750 000 palabras en inglés, por lo que incluso prompts verbosos rara vez superan unos pocos céntimos por llamada. Es en la salida donde las facturas crecen: bucles agentivos que se vuelven a invocar, informes largos y ejemplos few-shot sin caché se acumulan rápidamente. Cachea el prompt de sistema, recorta el historial agresivamente y prefiere el modo JSON para tareas estructuradas para mantener bajos los recuentos de tokens.

El triángulo de compromiso es calidad, latencia y coste. Los modelos punteros (GPT-5, Claude 4.6, Gemini 2.5) cuestan de diez a cincuenta veces más que los niveles económicos (Haiku, Flash, Mini) y responden dos a cuatro veces más despacio, pero razonan más profundo, siguen instrucciones con mayor fiabilidad y alucinan menos. Para clasificación o extracción de alto volumen, un modelo económico es casi siempre la elección correcta. Para análisis de formato largo, revisión de código o cualquier cosa orientada al usuario, el modelo puntero normalmente se amortiza solo.

Cuidado con la dilución del contexto: cuando metes 200K tokens en la ventana, la atención del modelo se reparte y empieza a ignorar la parte central del prompt — incluso en los punteros de contexto largo. Recupera los 8 a 16K tokens relevantes con embeddings en lugar de pegar el documento entero.

Frequently asked questions

Start Building with AI

Access all models through a single API. Get free credits when you sign up — no credit card required.