Text & Chat Models

Powerful language models for conversation, analysis, and content generation

Modele tekstowe i czatowe do produkcyjnych obciążeń AI

Duże modele językowe to koń roboczy współczesnej AI: chatboty, agenci, narzędzia do streszczania, klasyfikatory i tłumacze. To najbardziej zatłoczona kategoria na Railwail — OpenAI, Anthropic, Google, Mistral, Meta, DeepSeek, xAI oraz dziesiątki laboratoriów open-weights rywalizują tutaj.

45 models available

Claude Opus 4

Text & ChatAnthropic
NewPopular

Anthropic's most powerful model. Exceptional at complex analysis, agentic tasks, and extended reasoning.

Free5.0s
flagshipreasoningagentic

Claude Sonnet 4

Text & ChatAnthropic
Popular

Anthropic's most capable model. Excellent for complex analysis, coding, math, and creative writing.

Free3.0s
popularcodinganalysis

DeepSeek V3.1

Text & ChatDeepSeek
Popular

DeepSeek's refreshed V3.1 release. 671B MoE / 37B active. Tops open-weights leaderboards on coding and reasoning.

Free
deepseekopen-weightsmoe

DeepSeek V4 Pro

Text & ChatDeepSeek
NewPopular

DeepSeek's April 2026 flagship. 1.6T MoE / 49B active params, 1M context, rivals top closed-source models on STEM and coding at a fraction of the price.

Free
deepseekopen-weightsmoe

Gemini 2.0 Flash

Text & ChatGoogle DeepMind
NewPopular

Google's fastest multimodal model. Supports text, images, audio, and video input.

Free1.2s
fastmultimodalaffordable

Gemini 2.5 Pro

Text & ChatGoogle DeepMind
NewPopular

Google's latest thinking model. Excels at reasoning, coding, math, and science with massive context window.

Free4.0s
reasoningcodingmultimodal

GPT-4.1

Text & ChatOpenAI
NewPopular

OpenAI's newest flagship model. Improved reasoning, instruction following, and coding over GPT-4o.

Free2.5s
popularcodingreasoning

GPT-4o

Text & ChatOpenAI
Popular

OpenAI's most capable multimodal model. Excellent for complex reasoning, coding, and creative tasks.

Free2.0s
popularfastmultimodal

Grok 4

Text & ChatxAI
Popular

xAI's flagship reasoning model with vision and tool use. 256k context, strong at complex reasoning and STEM tasks.

Free
xaiflagshipreasoning

Kimi K2 (Moonshot)

Text & ChatCustom
Popular

Moonshot AI's 1T-parameter MoE model. Industry-leading agentic coding and tool-use benchmarks.

Free
moonshotkimimoe

MiniMax-01

Text & ChatMinimax
Popular

MiniMax's 456B hybrid lightning-attention model with native 4M-token context. Industry-leading long-context.

Free
minimaxlong-contextlightning-attention

o3-mini

Text & ChatOpenAI
NewPopular

OpenAI's reasoning model optimized for STEM tasks, coding, and math. Uses chain-of-thought reasoning.

Free10.0s
reasoningcodingmath

Perplexity Sonar Pro

Text & ChatCustom
Popular

Perplexity's premium web-grounded search model with multi-step reasoning over live sources.

Free
perplexityweb-searchcitations

Qwen 3 235B Instruct

Text & ChatAlibaba / Qwen
Popular

Alibaba's Qwen 3 flagship MoE: 235B total / 22B active. Strong reasoning and tool use, open-weights.

Free
qwenalibabamoe

AI21 Jamba 1.5 Large

Text & ChatCustom

AI21's flagship hybrid Mamba-Transformer model with a 256k context window for long-document tasks.

Free
ai21long-contextmamba

AI21 Jamba 1.5 Mini

Text & ChatCustom

Cost-efficient hybrid Mamba-Transformer model with 256k context. Tuned for high-throughput RAG.

Free
ai21long-contextmamba

Claude Haiku 3.5

Text & ChatAnthropic

Anthropic's fast and affordable model. Great for quick tasks, summarization, and simple coding.

Free1.0s
fastaffordable

Cohere Aya 23 35B

Text & ChatCustom

Open-weights multilingual research model from Cohere covering 23 languages. 35B parameters.

Free
coheremultilingualopen-weights

Cohere Command Light (legacy)

Text & ChatCohere

Cohere's fast lightweight chat model (deprecated Sep 2025). Kept as comparison tombstone.

Free
coherelegacydeprecated

Cohere Command R (08-2024)

Text & ChatCohere

Cohere's mid-tier RAG/tool model. Cost-efficient sibling of Command R+ with 128k context.

Free
cohereragtools

Cohere Command R+ (08-2024)

Text & ChatCohere

Cohere's flagship RAG- and tool-optimized chat model. 128k context, refreshed August 2024.

Free
cohereragtools

DeepSeek R1

Text & ChatDeepSeek
New

DeepSeek's reasoning model with chain-of-thought capabilities. Excellent for complex problem-solving.

Free8.0s
reasoningmath

DeepSeek V3

Text & ChatDeepSeek

Powerful open-weight model from DeepSeek. Strong at coding, math, and Chinese/English tasks.

Free2.0s
affordablecoding

DeepSeek V4 Flash

Text & ChatDeepSeek
New

Efficiency-optimized variant of DeepSeek V4. 284B MoE / 13B active, 1M context, ultra-low pricing for high-throughput workloads.

Free
deepseekopen-weightsmoe

GPT-4o Mini

Text & ChatOpenAI

Small, fast, and affordable model for lightweight tasks. Great balance of speed and capability.

Free800ms
fastaffordable

Grok 3

Text & ChatxAI
New

xAI's flagship model. Strong at reasoning, coding, and real-time knowledge with web search capabilities.

Free3.0s
reasoningreal-time

Llama 3.3 70B

Text & ChatMeta

Meta's open-source 70B parameter model. Strong all-around performance with multilingual support.

Free2.5s
open-sourcepopular

M2M-100 12B

Text & ChatMeta

Meta M2M-100 12B many-to-many translation model. Direct translation between 100 languages without pivoting through English.

€0.006
replicatetranslationmeta

MADLAD-400 3B

Text & ChatGoogle DeepMind

Google MADLAD-400 3B multilingual translation model. 419 languages supported, trained on a 5T-token multilingual corpus with strong low-resource performance.

€0.004
replicatetranslationgoogle

mBART 50 Many-to-Many

Text & ChatMeta

Meta mBART-50 many-to-many translation model. 50 supported languages with strong performance on news and conversational text.

€0.003
replicatetranslationmeta

Microsoft Phi-3.5 MoE Instruct

Text & ChatMicrosoft

Mixture-of-experts Phi-3.5: 42B total / 6.6B active params. 128k context, multilingual.

Free
microsoftopen-weightsmoe

Mistral Large

Text & ChatMistral AI

Mistral's flagship model. Strong reasoning, multilingual, and coding capabilities.

Free2.5s
multilingualcoding

NLLB-200 3B

Text & ChatMeta

Meta's No Language Left Behind 3.3B translation model. Direct translation between any pair of 200+ languages including many low-resource African and Asian languages.

€0.003
replicatetranslationmeta

NLLB-200 Distilled 600M

Text & ChatMeta

Meta's distilled 600M NLLB. Same 200-language coverage as the 3B model with a fraction of the parameters, ideal for edge or high-throughput deployment.

€0.002
replicatetranslationmeta

Nous Hermes 3 405B

Text & ChatTogether AI

Full-parameter fine-tune of Llama 3.1 405B by Nous Research. Steerable, uncensored, strong tool use.

Free
nousopen-weightstools

Nous Hermes 3 70B

Text & ChatTogether AI

Llama-3.1-70B fine-tune from Nous Research with strong tool/agent capabilities and uncensored alignment.

Free
nousopen-weightstools

Perplexity Sonar

Text & ChatCustom

Perplexity's fastest and cheapest web-grounded chat model. Live-source citations included.

Free
perplexityweb-searchcitations

Perplexity Sonar Reasoning

Text & ChatCustom

Perplexity's reasoning model with chain-of-thought and integrated web search.

Free
perplexityweb-searchreasoning

Qwen 2.5 72B

Text & ChatAlibaba / Qwen

Alibaba's powerful open-source model. Excellent at coding, math, and multilingual tasks.

Free2.5s
open-sourcecodingmultilingual

Qwen 2.5-Max

Text & ChatCustom

Alibaba's flagship pretrained MoE model. Top-tier reasoning and code performance via DashScope API.

Free
qwenalibabamoe

SeamlessM4T v2 Large (Text)

Text & ChatMeta

Meta SeamlessM4T v2 Large. Universal multilingual translation across 100+ languages with text-to-text mode for documents and chat.

€0.006
replicatetranslationmeta

Snowflake Arctic Instruct

Text & ChatCustom

Snowflake's open MoE model: 480B total / 17B active params with dense+MoE hybrid architecture.

Free
snowflakemoeopen-weights

TII Falcon 180B Chat

Text & ChatTogether AI

TII's 180B causal decoder chat model fine-tuned on Ultrachat, Platypus and Airoboros.

Free
tiiopen-weightslegacy

TowerInstruct 13B

Text & ChatReplicate

Unbabel TowerInstruct 13B. Llama-2-based multilingual translation and post-editing model. Strong terminology consistency for enterprise localization.

€0.005
replicatetranslationunbabel

Yi Large

Text & ChatCustom

01.AI's larger general-purpose chat model with 32k context window and strong bilingual performance.

Free
01aichinesebilingual

Top text & chat models picks

Hand-picked across four common criteria — resolved against the live catalog so the picks track price and performance changes.

Najlepszy ogólnie
Claude Opus 4

Anthropic's most powerful model. Exceptional at complex analysis, agentic tasks, and extended reasoning.

Learn more
Najtańszy
Cohere Command R (08-2024)

Cohere's mid-tier RAG/tool model. Cost-efficient sibling of Command R+ with 128k context.

Learn more
Najdłuższy kontekst
MiniMax-01

MiniMax's 456B hybrid lightning-attention model with native 4M-token context. Industry-leading long-context.

Learn more
Najszybszy
GPT-4o Mini

Small, fast, and affordable model for lightweight tasks. Great balance of speed and capability.

Learn more

Cennik w generowaniu tekstu jest niemal zawsze za token i dzieli się na tańszą stawkę za input i droższą za output. Milion tokenów wejściowych odpowiada mniej więcej 750 000 słów po angielsku, więc nawet rozbudowane prompty rzadko podnoszą koszt jednego wywołania powyżej kilku centów. To w outpucie rosną rachunki: pętle agentowe, które same się reprompują, długie raporty i niezcachowane przykłady few-shot szybko się sumują. Cache prompt systemowy, agresywnie przycinaj historię i preferuj tryb JSON dla zadań ustrukturyzowanych, aby utrzymać liczbę tokenów na niskim poziomie.

Trójkąt kompromisu to jakość, opóźnienie i koszt. Modele flagship (GPT-5, Claude 4.6, Gemini 2.5) kosztują od dziesięciu do pięćdziesięciu razy więcej niż tańsze tiery (Haiku, Flash, Mini) i odpowiadają dwa do czterech razy wolniej, ale rozumują głębiej, dokładniej trzymają się instrukcji i mniej halucynują. Dla wielkoskalowej klasyfikacji albo ekstrakcji niemal zawsze właściwym wyborem jest model budżetowy. Dla analizy długoformatowej, code review i wszystkiego, co trafia do użytkownika końcowego, flagship zwykle zwraca koszt.

Uwaga na rozmycie kontekstu: gdy upchasz 200 tys. tokenów do okna, uwaga modelu rozprasza się i zaczyna ignorować środkową część promptu — nawet we flagshipach z długim kontekstem. Pobieraj istotne 8-16 tys. tokenów za pomocą embeddingów zamiast wklejać cały dokument.

Frequently asked questions

Start Building with AI

Access all models through a single API. Get free credits when you sign up — no credit card required.