AI Model Leaderboard

Last updated about 3 hours ago

Live ranking of 369+ AI models across 8 dimensions. Updated every 60 minutes.

369 models10 providersLast updated Jul 9, 2026, 1:32 AM

Leaders at a glance

Cheapest input

Lowest input price per 1M tokens. Top 25 models, updated hourly.

Rank	Model	Metric	Context	Input cost	Output cost	Action
1	Gemini 1.5 Pro (vision) Google	$1.00 / 1M tokens	2.1M	$1.00	$5.00	Leader
2	Grok 4.3 Custom New	$1.00 / 1M tokens	1.0M	$1.00	$3.00	Compare
3	Gemini 3 Flash Google New	$1.00 / 1M tokens	1.0M	$1.00	$3.00	Compare
4	GPT-5.4 Mini OpenAI New	$1.00 / 1M tokens	400K	$1.00	$5.00	Compare
5	Claude Haiku 4.5 Anthropic New	$1.00 / 1M tokens	200K	$1.00	$5.00	Compare
6	Gemini 3.1 Pro Google New	$2.00 / 1M tokens	2.0M	$2.00	$12.00	Compare
7	Claude 3.5 Sonnet (vision) Anthropic	$3.00 / 1M tokens	200K	$3.00	$15.00	Compare
8	GPT-4o (vision) OpenAI	$3.00 / 1M tokens	128K	$3.00	$10.00	Compare
9	GPT-5.4 OpenAI New	$3.00 / 1M tokens	1.1M	$3.00	$15.00	Compare
10	Claude Sonnet 4.6 Anthropic New	$3.00 / 1M tokens	200K	$3.00	$15.00	Compare
11	Claude Opus 4.7 Anthropic New	$5.00 / 1M tokens	200K	$5.00	$25.00	Compare
12	Jina Embeddings v3 (Multilingual) Custom	$20.00 / 1M tokens	8K	$20.00	Free	Compare
13	OpenAI text-embedding-3-small OpenAI	$20.00 / 1M tokens	8K	$20.00	Free	Compare
14	Cartesia Sonic Custom	$30.00 / 1M tokens	n/a	$30.00	Free	Compare
15	Voyage AI voyage-3 Custom	$60.00 / 1M tokens	32K	$60.00	Free	Compare
16	Cohere embed-multilingual-v3 Custom	$100.00 / 1M tokens	512	$100.00	Free	Compare
17	Reka Edge Custom	$100.00 / 1M tokens	16K	$100.00	$100.00	Compare
18	OpenAI text-embedding-3-large OpenAI	$130.00 / 1M tokens	8K	$130.00	Free	Compare
19	Cohere Command R (08-2024) Custom	$150.00 / 1M tokens	128K	$150.00	$600.00	Compare
20	Voyage AI voyage-code-3 Custom	$180.00 / 1M tokens	32K	$180.00	Free	Compare
21	MiniMax-01 Custom	$200.00 / 1M tokens	4.1M	$200.00	$1100.00	Compare
22	Reka Flash Custom	$200.00 / 1M tokens	128K	$200.00	$800.00	Compare
23	AI21 Jamba 1.5 Mini Custom	$200.00 / 1M tokens	256K	$200.00	$400.00	Compare
24	GPT-5 Mini OpenAI	$250.00 / 1M tokens	400K	$250.00	$2000.00	Compare
25	DeepSeek V3.1 DeepSeek	$270.00 / 1M tokens	131K	$270.00	$1100.00	Compare

How we rank

Cost (input / output): Normalised to USD per 1M tokens, sourced from public provider list prices, refreshed weekly. Free models are excluded from cost rankings so the leaderboard reflects production economics.
Context window: Taken from each provider's official model card. Capped at the input-side window — output-only context is reported separately.
Latency: p50 measured from Railwail's own request logs over the trailing 30 days, with a minimum sample threshold of 100 requests per model. Latency is end-to-end (queue + provider + network).
Popularity: Total job count over the last 30 days. Excludes test traffic and synthetic load. A single user's repeat usage is weighted to avoid skew from large customers.

Freshness: Provider's official public release date. Models markedNewwere released in the last 30 days.
Community rating: ELO derived from head-to-head Arena votes by Railwail users. Default 1500 for unrated models. We require >30 matches before a rating is considered stable.
Best for code: Models tagged for coding (category code or tags including coding / developer), ordered by popularity within the cohort. Empirically tracks real developer adoption better than synthetic benchmarks.

Why no benchmarks?

MMLU, HumanEval, MT-Bench and similar are increasingly contaminated by training-set leakage and gamed via prompt engineering. They tell you nothing about a model's real cost in production, its tail latency, or whether developers actually keep choosing it after the launch hype fades. This leaderboard uses observable, real-world signals only — what people pay, how long they wait, and what they choose again.

Spot something off? We update prices and specs every week — but errors creep in.

Submit a correction →

Explore further

Browse all AI models →Vote in the Arena →See API pricing →Compare any two models →Read the docs →Latest blog posts →