How much does Mistral Pixtral Large (124B) cost via Railwail?

Input: €2.00 per 1M tokens. Output: €6.00 per 1M tokens. No monthly minimum, no subscription. Start with €5 free credits.

What is the context window of Mistral Pixtral Large (124B)?

Mistral Pixtral Large (124B) supports a 131.1K tokens context window — enough for long books, technical manuals, and extended analysis.

How fast is Mistral Pixtral Large (124B)?

Latency depends on prompt length and load — typically 200ms to 2s for short prompts. We measure p50/p95 in real-time on /rankings.

Is Mistral Pixtral Large (124B) better than Claude Opus 4.7?

It depends on your use case. Mistral Pixtral Large (124B) (Mistral AI) and Claude Opus 4.7 (Anthropic) are both strong choices in multimodal. Compare them side-by-side at /compare/pixtral-large-124b-vs-claude-opus-4-7.

Does Mistral Pixtral Large (124B) support image input (vision)?

Yes — Mistral Pixtral Large (124B) accepts image inputs in addition to text. Send images via the standard OpenAI-compatible `messages` array with `image_url` content blocks. Supported formats: text, image.

Mistral Pixtral Large (124B)

Name: Mistral Pixtral Large (124B)
Brand: Mistral AI
SKU: pixtral-large-124b
Price: 0.002 EUR
Availability: InStock

Mistral AI

Multimodal

Mistral's 124B multimodal flagship. 123B decoder + 1B vision encoder, 128k ctx, up to 30 images per request.

Try Mistral Pixtral Large (124B) now

Send a single prompt and stream a response inline. Hit Cmd+Enter to submit.

Press Cmd+Enter to send

Response appears here.

TL;DR·Last updated May 16, 2026

Mistral Pixtral Large (124B) is multimodal AI model from Mistral AI, priced at €2.00 per 1M input tokens with a 131.1K tokens context window.

Try Mistral Pixtral Large (124B)

System Prompt

Message

Temperature

0.7

Max Tokens

Pricing

Price per Generation

Per generationFree

API Integration

Use our OpenAI-compatible API to integrate Mistral Pixtral Large (124B) into your application.

Install

npm install railwail

JavaScript / TypeScript

import railwail from "railwail";

const rw = railwail("YOUR_API_KEY");

// Simple — just pass a string
const reply = await rw.run("pixtral-large-124b", "Hello! What can you do?");
console.log(reply);

// With message history
const reply2 = await rw.run("pixtral-large-124b", [
  { role: "system", content: "You are a helpful assistant." },
  { role: "user", content: "Explain quantum computing simply." },
]);
console.log(reply2);

// Full response with usage info
const res = await rw.chat("pixtral-large-124b", [
  { role: "user", content: "Hello!" },
], { temperature: 0.7, max_tokens: 500 });
console.log(res.choices[0].message.content);
console.log(res.usage);

Specifications

Context window

131,072 tokens

Max output

8,192 tokens

Developer

Mistral AI

Deep dive — Mistral AI's Mistral Pixtral Large (124B)

About Mistral AI

Founded 2023 · Paris, France

Mistral AI was founded in April 2023 in Paris by Arthur Mensch (CEO, ex-DeepMind), Guillaume Lample (Chief Scientist, ex-Meta FAIR LLaMA lead) and Timothee Lacroix (CTO, ex-Meta FAIR). The company is the European poster child for sovereign AI and has raised a record-setting $113M seed (June 2023) followed by $415M Series A (December 2023), $645M Series B (June 2024) and a reported €1B+ funding round in 2025 at a valuation above $6B, with backers including Andreessen Horowitz, Lightspeed, General Catalyst, BPI France and Cisco. Mistral's model line includes Mistral 7B, Mixtral 8x7B and 8x22B (MoE), Mistral Large 1/2, Codestral, and the multimodal Pixtral family. Pixtral 12B launched in September 2024 and Pixtral Large 124B followed in November 2024 as the flagship multimodal model, released under the Mistral Research Licence and offered commercially via la Plateforme.

Visit Mistral AI →

Architecture

Decoder-only Transformer with Pixtral-ViT vision encoder

Pixtral Large is a 124B-parameter multimodal model built on top of Mistral Large 2 (123B decoder-only Transformer) and Mistral's custom Pixtral-ViT vision encoder (1B parameters, 400M visible). The vision encoder processes each image at its native aspect ratio in patches of 16x16 pixels and projects the resulting visual tokens directly into the LLM token stream (without cross-attention or fixed-size resampler), which lets the model handle high-resolution images and many images per prompt in the same 131K-token context window. The vision encoder uses 2D rotary position embeddings and an attention mask that distinguishes patches from different images. Training used a multi-stage curriculum: vision-encoder pretraining on image-text pairs, joint vision-language pretraining and multimodal supervised fine-tuning with chain-of-thought style instruction data. Pixtral Large posts top scores on MathVista, DocVQA and VQAv2, often matching GPT-4o and Claude 3.5 Sonnet at lower cost. The model is released under the Mistral Research Licence with a separate commercial licence available.

Parameters: 124B (123B LLM + 1B vision encoder)
Context: 131.1K tokens

What it can do

Native variable-resolution image input (no fixed grid)
Up to 30 images per request, 131K-token context
Top-tier MathVista, DocVQA, VQAv2 and ChartQA scores
131K-token context window
Multilingual: English, German, French, Spanish, Italian, Portuguese, Dutch, Russian, Arabic, Hindi, Japanese, Chinese, Korean
JSON output and tool use inherited from Mistral Large 2
Open weights under Mistral Research Licence (commercial via separate licence)
Best for: high-end open-weights multimodal apps, document AI, math-with-image, European data sovereignty

Training & License

Mistral has not disclosed exact dataset sizes. The technical report describes a 'multimodal corpus of public web data, licensed image-text pairs and curated instruction data' with a knowledge cutoff in 2024.

License: Weights under Mistral Research Licence (non-commercial). Commercial use via Mistral la Plateforme API or a paid Mistral Commercial Licence.

Known limitations