Text & Chat Models

Powerful language models for conversation, analysis, and content generation

Modelos de texto e chat para cargas de trabalho de IA em produção

Os large language models são o cavalo de trabalho da IA moderna: chatbots, agentes, resumidores, classificadores, tradutores. É a categoria mais concorrida no Railwail — OpenAI, Anthropic, Google, Mistral, Meta, DeepSeek, xAI e dezenas de laboratórios open-weights competem aqui.

45 models available

Claude Opus 4

Text & ChatAnthropic
NewPopular

Anthropic's most powerful model. Exceptional at complex analysis, agentic tasks, and extended reasoning.

Free5.0s
flagshipreasoningagentic

Claude Sonnet 4

Text & ChatAnthropic
Popular

Anthropic's most capable model. Excellent for complex analysis, coding, math, and creative writing.

Free3.0s
popularcodinganalysis

DeepSeek V3.1

Text & ChatDeepSeek
Popular

DeepSeek's refreshed V3.1 release. 671B MoE / 37B active. Tops open-weights leaderboards on coding and reasoning.

Free
deepseekopen-weightsmoe

DeepSeek V4 Pro

Text & ChatDeepSeek
NewPopular

DeepSeek's April 2026 flagship. 1.6T MoE / 49B active params, 1M context, rivals top closed-source models on STEM and coding at a fraction of the price.

Free
deepseekopen-weightsmoe

Gemini 2.0 Flash

Text & ChatGoogle DeepMind
NewPopular

Google's fastest multimodal model. Supports text, images, audio, and video input.

Free1.2s
fastmultimodalaffordable

Gemini 2.5 Pro

Text & ChatGoogle DeepMind
NewPopular

Google's latest thinking model. Excels at reasoning, coding, math, and science with massive context window.

Free4.0s
reasoningcodingmultimodal

GPT-4.1

Text & ChatOpenAI
NewPopular

OpenAI's newest flagship model. Improved reasoning, instruction following, and coding over GPT-4o.

Free2.5s
popularcodingreasoning

GPT-4o

Text & ChatOpenAI
Popular

OpenAI's most capable multimodal model. Excellent for complex reasoning, coding, and creative tasks.

Free2.0s
popularfastmultimodal

Grok 4

Text & ChatxAI
Popular

xAI's flagship reasoning model with vision and tool use. 256k context, strong at complex reasoning and STEM tasks.

Free
xaiflagshipreasoning

Kimi K2 (Moonshot)

Text & ChatCustom
Popular

Moonshot AI's 1T-parameter MoE model. Industry-leading agentic coding and tool-use benchmarks.

Free
moonshotkimimoe

MiniMax-01

Text & ChatMinimax
Popular

MiniMax's 456B hybrid lightning-attention model with native 4M-token context. Industry-leading long-context.

Free
minimaxlong-contextlightning-attention

o3-mini

Text & ChatOpenAI
NewPopular

OpenAI's reasoning model optimized for STEM tasks, coding, and math. Uses chain-of-thought reasoning.

Free10.0s
reasoningcodingmath

Perplexity Sonar Pro

Text & ChatCustom
Popular

Perplexity's premium web-grounded search model with multi-step reasoning over live sources.

Free
perplexityweb-searchcitations

Qwen 3 235B Instruct

Text & ChatAlibaba / Qwen
Popular

Alibaba's Qwen 3 flagship MoE: 235B total / 22B active. Strong reasoning and tool use, open-weights.

Free
qwenalibabamoe

AI21 Jamba 1.5 Large

Text & ChatCustom

AI21's flagship hybrid Mamba-Transformer model with a 256k context window for long-document tasks.

Free
ai21long-contextmamba

AI21 Jamba 1.5 Mini

Text & ChatCustom

Cost-efficient hybrid Mamba-Transformer model with 256k context. Tuned for high-throughput RAG.

Free
ai21long-contextmamba

Claude Haiku 3.5

Text & ChatAnthropic

Anthropic's fast and affordable model. Great for quick tasks, summarization, and simple coding.

Free1.0s
fastaffordable

Cohere Aya 23 35B

Text & ChatCustom

Open-weights multilingual research model from Cohere covering 23 languages. 35B parameters.

Free
coheremultilingualopen-weights

Cohere Command Light (legacy)

Text & ChatCohere

Cohere's fast lightweight chat model (deprecated Sep 2025). Kept as comparison tombstone.

Free
coherelegacydeprecated

Cohere Command R (08-2024)

Text & ChatCohere

Cohere's mid-tier RAG/tool model. Cost-efficient sibling of Command R+ with 128k context.

Free
cohereragtools

Cohere Command R+ (08-2024)

Text & ChatCohere

Cohere's flagship RAG- and tool-optimized chat model. 128k context, refreshed August 2024.

Free
cohereragtools

DeepSeek R1

Text & ChatDeepSeek
New

DeepSeek's reasoning model with chain-of-thought capabilities. Excellent for complex problem-solving.

Free8.0s
reasoningmath

DeepSeek V3

Text & ChatDeepSeek

Powerful open-weight model from DeepSeek. Strong at coding, math, and Chinese/English tasks.

Free2.0s
affordablecoding

DeepSeek V4 Flash

Text & ChatDeepSeek
New

Efficiency-optimized variant of DeepSeek V4. 284B MoE / 13B active, 1M context, ultra-low pricing for high-throughput workloads.

Free
deepseekopen-weightsmoe

GPT-4o Mini

Text & ChatOpenAI

Small, fast, and affordable model for lightweight tasks. Great balance of speed and capability.

Free800ms
fastaffordable

Grok 3

Text & ChatxAI
New

xAI's flagship model. Strong at reasoning, coding, and real-time knowledge with web search capabilities.

Free3.0s
reasoningreal-time

Llama 3.3 70B

Text & ChatMeta

Meta's open-source 70B parameter model. Strong all-around performance with multilingual support.

Free2.5s
open-sourcepopular

M2M-100 12B

Text & ChatMeta

Meta M2M-100 12B many-to-many translation model. Direct translation between 100 languages without pivoting through English.

€0.006
replicatetranslationmeta

MADLAD-400 3B

Text & ChatGoogle DeepMind

Google MADLAD-400 3B multilingual translation model. 419 languages supported, trained on a 5T-token multilingual corpus with strong low-resource performance.

€0.004
replicatetranslationgoogle

mBART 50 Many-to-Many

Text & ChatMeta

Meta mBART-50 many-to-many translation model. 50 supported languages with strong performance on news and conversational text.

€0.003
replicatetranslationmeta

Microsoft Phi-3.5 MoE Instruct

Text & ChatMicrosoft

Mixture-of-experts Phi-3.5: 42B total / 6.6B active params. 128k context, multilingual.

Free
microsoftopen-weightsmoe

Mistral Large

Text & ChatMistral AI

Mistral's flagship model. Strong reasoning, multilingual, and coding capabilities.

Free2.5s
multilingualcoding

NLLB-200 3B

Text & ChatMeta

Meta's No Language Left Behind 3.3B translation model. Direct translation between any pair of 200+ languages including many low-resource African and Asian languages.

€0.003
replicatetranslationmeta

NLLB-200 Distilled 600M

Text & ChatMeta

Meta's distilled 600M NLLB. Same 200-language coverage as the 3B model with a fraction of the parameters, ideal for edge or high-throughput deployment.

€0.002
replicatetranslationmeta

Nous Hermes 3 405B

Text & ChatTogether AI

Full-parameter fine-tune of Llama 3.1 405B by Nous Research. Steerable, uncensored, strong tool use.

Free
nousopen-weightstools

Nous Hermes 3 70B

Text & ChatTogether AI

Llama-3.1-70B fine-tune from Nous Research with strong tool/agent capabilities and uncensored alignment.

Free
nousopen-weightstools

Perplexity Sonar

Text & ChatCustom

Perplexity's fastest and cheapest web-grounded chat model. Live-source citations included.

Free
perplexityweb-searchcitations

Perplexity Sonar Reasoning

Text & ChatCustom

Perplexity's reasoning model with chain-of-thought and integrated web search.

Free
perplexityweb-searchreasoning

Qwen 2.5 72B

Text & ChatAlibaba / Qwen

Alibaba's powerful open-source model. Excellent at coding, math, and multilingual tasks.

Free2.5s
open-sourcecodingmultilingual

Qwen 2.5-Max

Text & ChatCustom

Alibaba's flagship pretrained MoE model. Top-tier reasoning and code performance via DashScope API.

Free
qwenalibabamoe

SeamlessM4T v2 Large (Text)

Text & ChatMeta

Meta SeamlessM4T v2 Large. Universal multilingual translation across 100+ languages with text-to-text mode for documents and chat.

€0.006
replicatetranslationmeta

Snowflake Arctic Instruct

Text & ChatCustom

Snowflake's open MoE model: 480B total / 17B active params with dense+MoE hybrid architecture.

Free
snowflakemoeopen-weights

TII Falcon 180B Chat

Text & ChatTogether AI

TII's 180B causal decoder chat model fine-tuned on Ultrachat, Platypus and Airoboros.

Free
tiiopen-weightslegacy

TowerInstruct 13B

Text & ChatReplicate

Unbabel TowerInstruct 13B. Llama-2-based multilingual translation and post-editing model. Strong terminology consistency for enterprise localization.

€0.005
replicatetranslationunbabel

Yi Large

Text & ChatCustom

01.AI's larger general-purpose chat model with 32k context window and strong bilingual performance.

Free
01aichinesebilingual

Top text & chat models picks

Hand-picked across four common criteria — resolved against the live catalog so the picks track price and performance changes.

Melhor no global
Claude Opus 4

Anthropic's most powerful model. Exceptional at complex analysis, agentic tasks, and extended reasoning.

Learn more
Mais barato
Cohere Command R (08-2024)

Cohere's mid-tier RAG/tool model. Cost-efficient sibling of Command R+ with 128k context.

Learn more
Contexto mais longo
MiniMax-01

MiniMax's 456B hybrid lightning-attention model with native 4M-token context. Industry-leading long-context.

Learn more
Mais rápido
GPT-4o Mini

Small, fast, and affordable model for lightweight tasks. Great balance of speed and capability.

Learn more

O pricing na geração de texto é quase sempre por token, dividido numa tarifa de input mais económica e numa de output mais cara. Um milhão de tokens em input corresponde a cerca de 750.000 palavras em inglês, pelo que mesmo prompts verbosos raramente fazem subir o custo por chamada acima de uns cêntimos. É no output que as contas crescem: loops agênticos que se voltam a fazer prompts a si próprios, relatórios longos e exemplos few-shot sem cache acumulam-se rapidamente. Coloque o prompt de sistema em cache, recorte o histórico de forma agressiva e prefira o modo JSON para tarefas estruturadas para manter o número de tokens baixo.

O triângulo de compromissos é qualidade, latência e custo. Os modelos topo de gama (GPT-5, Claude 4.6, Gemini 2.5) custam dez a cinquenta vezes mais do que os tiers económicos (Haiku, Flash, Mini) e respondem duas a quatro vezes mais devagar, mas raciocinam em maior profundidade, seguem instruções de forma mais fiável e alucinam menos. Para classificação ou extração de grande volume, um modelo económico é quase sempre a escolha certa. Para análise longform, code review ou qualquer coisa virada ao utilizador final, o flagship normalmente paga-se a si próprio.

Atenção à diluição do contexto: quando enfiamos 200K tokens na janela, a atenção do modelo dispersa-se e começa a ignorar o meio do prompt — mesmo nos flagships de contexto longo. Recupere os 8-16K tokens relevantes com embeddings em vez de colar o documento inteiro.

Frequently asked questions

Start Building with AI

Access all models through a single API. Get free credits when you sign up — no credit card required.