How much does GPT-5.4 Mini cost via Railwail?

Input: €0.001 per 1M tokens. Output: €0.005 per 1M tokens. No monthly minimum, no subscription. Start with €5 free credits.

What is the context window of GPT-5.4 Mini?

GPT-5.4 Mini supports a 400K tokens context window — enough for entire codebases or research papers in one prompt.

How fast is GPT-5.4 Mini?

Latency depends on prompt length and load — typically 200ms to 2s for short prompts. We measure p50/p95 in real-time on /rankings.

Is GPT-5.4 Mini better than BLIP?

It depends on your use case. GPT-5.4 Mini (OpenAI) and BLIP (Salesforce) are both strong choices in multimodal. Compare them side-by-side at /compare/gpt-5-4-mini-vs-blip-captioning.

Does GPT-5.4 Mini support image input (vision)?

Yes — GPT-5.4 Mini accepts image inputs in addition to text. Send images via the standard OpenAI-compatible `messages` array with `image_url` content blocks. Supported formats: text, image.

GPT-5.4 Mini

Name: GPT-5.4 Mini
Brand: OpenAI
SKU: gpt-5-4-mini
Price: 1e-6 EUR
Availability: InStock

New

Popular

OpenAI

Multimodal

OpenAI's efficient mid-tier model. 2x faster than its predecessor, 400k context, approaches GPT-5.4 quality on SWE-Bench Pro at a fraction of the cost.

Try GPT-5.4 Mini now

Send a single prompt and stream a response inline. Hit Cmd+Enter to submit.

Press Cmd+Enter to send

Response appears here.

TL;DR·Last updated June 24, 2026

GPT-5.4 Mini is multimodal AI model from OpenAI, priced at €0.001 per 1M input tokens with a 400K tokens context window.

About this model

Released March 17, 2026, GPT-5.4 mini brings the strengths of GPT-5.4 to a smaller, faster model designed for the subagent era. 400K context, vision input, integrated tool use, and 2x faster latency than GPT-5 mini. Significant gains on coding, reasoning, multimodal understanding and tool use; approaches the full GPT-5.4 on SWE-Bench Pro and OSWorld-Verified. Recommended for subagent workflows, customer-facing chat, coding assistants and high-volume API workloads.

Try GPT-5.4 Mini

System Prompt

Message

Temperature

0.7

Max Tokens

Pricing

Price per Generation

Per generationFree

API Integration

Use our OpenAI-compatible API to integrate GPT-5.4 Mini into your application.

Install

npm install railwail

JavaScript / TypeScript

import railwail from "railwail";

const rw = railwail("YOUR_API_KEY");

// Simple — just pass a string
const reply = await rw.run("gpt-5-4-mini", "Hello! What can you do?");
console.log(reply);

// With message history
const reply2 = await rw.run("gpt-5-4-mini", [
  { role: "system", content: "You are a helpful assistant." },
  { role: "user", content: "Explain quantum computing simply." },
]);
console.log(reply2);

// Full response with usage info
const res = await rw.chat("gpt-5-4-mini", [
  { role: "user", content: "Hello!" },
], { temperature: 0.7, max_tokens: 500 });
console.log(res.choices[0].message.content);
console.log(res.usage);

Specifications

Context window

400,000 tokens

Max output

128,000 tokens

Developer

OpenAI

Deep dive — OpenAI's GPT-5.4 Mini

About OpenAI

Founded 2015 · San Francisco, USA

OpenAI was founded in December 2015 as a non-profit AI research organisation and transitioned to a capped-profit structure in 2019. The GPT lineage spans GPT-1 (2018) through GPT-5 (mid-2025) and the GPT-5.x family (2025-2026) which unified the o-series reasoning models with the general-purpose GPT line. GPT-5.4 mini and nano were announced on March 17, 2026 as the small-model tier of the GPT-5.4 generation. OpenAI is backed by Microsoft, Khosla, Andreessen Horowitz, Thrive Capital and Sequoia, with total funding above $60 billion and a 2026 valuation above $300 billion.

Visit OpenAI →

Architecture

Unified Transformer (mid-tier, with integrated 'Thinking' tier)

GPT-5.4 mini was announced March 17, 2026 alongside GPT-5.4 nano as the small-model tier of the GPT-5.4 generation. It is a smaller variant of the unified GPT-5.4 architecture, retaining native text + image input, an integrated 'Thinking' tier for reasoning on demand, and the full tool-use API, while running more than 2x faster than GPT-5 mini at significantly lower cost. Pretraining used a similar multi-trillion-token mixture as GPT-5.4 with heavier distillation pressure from larger teacher models. Post-training included supervised fine-tuning, RLHF and reinforcement learning against verifiable rewards on coding, reasoning and tool-use trajectories. On evaluations such as SWE-Bench Pro and OSWorld-Verified, GPT-5.4 mini approaches the performance of full GPT-5.4 while costing roughly one-third as much.

Parameters: Undisclosed (estimated tens of billions of parameters, likely sparse MoE)
Context: 400K tokens

What it can do

2x faster than GPT-5 mini at lower cost
Approaches full GPT-5.4 on SWE-Bench Pro and OSWorld-Verified
400K token context window
Native multimodal input: text and images
Integrated 'Thinking' tier activates for harder reasoning
Native tool use, function calling and parallel tool calls
Designed for the subagent era: works well under an orchestrator
Strong coding, classification and extraction performance
Available in ChatGPT, Codex CLI and the OpenAI API
Regional processing endpoints available with 10% uplift
Best for: subagent workflows, customer-facing chat, coding assistants, high-volume API workloads.

Training & License

Pretrained on a multi-trillion-token mixture of web text, code, scientific papers and licensed data; heavy distillation from larger GPT-5.4 teacher models. Post-training uses supervised fine-tuning, RLHF and RL against verifiable rewards. Knowledge cutoff approximately late 2025.

License: Proprietary commercial license via OpenAI API and Azure OpenAI.

Known limitations

Below full GPT-5.4 on the hardest agentic and reasoning benchmarks
Smaller context window than GPT-5.4 (400K vs 1.05M)
No native audio or video input
Knowledge cutoff in late 2025
Thinking mode adds latency and token cost when enabled

Research papers

Frequently asked questions

Related Models

View all Multimodal

BLIP

Salesforce

Salesforce BLIP. Vision-language model for image captioning and visual question answering. Given an image it writes a short natural-language caption, or answers a question about the image when one is supplied. A widely used baseline for automatic captioning.

€1.00

CLIP Interrogator

Community

pharmapsychotic's CLIP Interrogator. Takes an image and produces a Stable-Diffusion-style text prompt by combining BLIP captioning with CLIP to rank likely subjects, artists, mediums and styles. Commonly used to reverse-engineer a prompt from an existing picture.

€1.00

Claude 3.5 Sonnet (vision)

Anthropic

Anthropic Claude 3.5 Sonnet with image input. 200k context, strong on dense documents, tables, charts and handwriting. Reliable structured extraction from screenshots and scans.

Free

Claude Opus 4.7