How much does GPT-5.4 cost via Railwail?

Input: €0.003 per 1M tokens. Output: €0.015 per 1M tokens. No monthly minimum, no subscription. Start with €5 free credits.

What is the context window of GPT-5.4?

GPT-5.4 supports a 1.1M tokens context window — enough for entire codebases or research papers in one prompt.

Latency depends on prompt length and load — typically 200ms to 2s for short prompts. We measure p50/p95 in real-time on /rankings.

Is GPT-5.4 better than BLIP?

It depends on your use case. GPT-5.4 (OpenAI) and BLIP (Salesforce) are both strong choices in multimodal. Compare them side-by-side at /compare/gpt-5-4-vs-blip-captioning.

Does GPT-5.4 support image input (vision)?

Yes — GPT-5.4 accepts image inputs in addition to text. Send images via the standard OpenAI-compatible `messages` array with `image_url` content blocks. Supported formats: text, image.

GPT-5.4

Name: GPT-5.4
Brand: OpenAI
SKU: gpt-5-4
Price: 3e-6 EUR
Availability: InStock

New

Popular

OpenAI

Multimodal

OpenAI's unified flagship combining GPT and o-series reasoning into one model. 1M context, multimodal, top SWE-Bench Pro and OSWorld scores.

Try GPT-5.4 now

Send a single prompt and stream a response inline. Hit Cmd+Enter to submit.

Press Cmd+Enter to send

Response appears here.

TL;DR·Last updated June 24, 2026

GPT-5.4 is multimodal AI model from OpenAI, priced at €0.003 per 1M input tokens with a 1.1M tokens context window.

About this model

GPT-5.4 is OpenAI's frontier model as of 2026, merging the previously separate GPT and o-series reasoning lines into a single unified architecture. Native multimodal input (text + images), 1.05M-token experimental context window (272K standard), 128K max output, integrated 'Thinking' tier for reasoning on demand. Best for: high-context coding, agentic engineering, multimodal analysis, complex tool-use workflows. The Codex line has been absorbed into this model.

Try GPT-5.4

System Prompt

Message

Temperature

0.7

Max Tokens

Pricing

Price per Generation

Per generationFree

API Integration

Use our OpenAI-compatible API to integrate GPT-5.4 into your application.

Install

npm install railwail

JavaScript / TypeScript

import railwail from "railwail";

const rw = railwail("YOUR_API_KEY");

// Simple — just pass a string
const reply = await rw.run("gpt-5-4", "Hello! What can you do?");
console.log(reply);

// With message history
const reply2 = await rw.run("gpt-5-4", [
  { role: "system", content: "You are a helpful assistant." },
  { role: "user", content: "Explain quantum computing simply." },
]);
console.log(reply2);

// Full response with usage info
const res = await rw.chat("gpt-5-4", [
  { role: "user", content: "Hello!" },
], { temperature: 0.7, max_tokens: 500 });
console.log(res.choices[0].message.content);
console.log(res.usage);

Specifications

Context window

1,050,000 tokens

Max output

128,000 tokens

Developer

OpenAI

Deep dive — OpenAI's GPT-5.4

About OpenAI

Founded 2015 · San Francisco, USA

OpenAI was founded in December 2015 by Sam Altman, Elon Musk, Greg Brockman, Ilya Sutskever, Wojciech Zaremba and John Schulman as a non-profit AI research organisation; it transitioned to a 'capped-profit' structure in 2019. OpenAI authored the GPT lineage starting with GPT-1 (2018), GPT-2 (2019), GPT-3 (2020), the ChatGPT launch with GPT-3.5 (November 2022), GPT-4 (March 2023), GPT-4o (May 2024), the o-series reasoning models o1 (December 2024) and o3 (early 2025), then GPT-5 (mid-2025). The GPT-5.x series unified the reasoning o-series with the general-purpose GPT line: GPT-5.2 shipped December 2025, GPT-5.3-Codex February 2026, GPT-5.4 in March 2026 and the GPT-5.5 family later in 2026. OpenAI's investors include Microsoft (~$13B+), Khosla Ventures, Andreessen Horowitz, Thrive Capital and Sequoia, with total funding exceeding $60 billion and a 2026 valuation above $300 billion.

Visit OpenAI →

Architecture

Unified Transformer with integrated reasoning ('Thinking') tier

GPT-5.4 is OpenAI's frontier model as of March 2026 and marks the full convergence of the previously separate o-series reasoning models and general-purpose GPT line into a single unified architecture. The o1, o3, o3-pro and o4-mini reasoning models were retired from ChatGPT on February 13, 2026; their capabilities are now folded into GPT-5.x as a 'Thinking' tier that activates on demand. GPT-5.4 also absorbs the Codex line for agentic coding. Pretraining used a multi-trillion-token mixture of web text, code repositories, scientific papers, books and licensed datasets on OpenAI's most recent supercomputer cluster. Post-training combined supervised fine-tuning, RLHF and large-scale reinforcement learning against verifiable rewards on coding, math and tool-use trajectories. The model supports native text + image input and a 1.05M-token experimental context window (272K standard), 128K max output, integrated tool-use, function calling, structured outputs, and built-in 'Tool Search' for delegating to subagents. Safety training followed OpenAI's Preparedness Framework with red-teaming, capability evaluations and external assessments.

Parameters: Undisclosed (estimated multi-hundred billion parameters, likely sparse MoE)
Context: 1.1M tokens

What it can do

Unified GPT and o-series: integrated 'Thinking' tier activates on demand
1.05M-token experimental context window (272K standard, 128K max output)
Native multimodal input: text and images in a single pass
Top scores on SWE-Bench Pro and OSWorld-Verified at launch
Native tool use, function calling, parallel tool calls and structured outputs
Integrated 'Tool Search' for delegating to subagent models
Absorbed Codex agentic coding capabilities
Strong multilingual performance and instruction following
Available via OpenAI API, Azure OpenAI, ChatGPT and Codex CLI
Regional processing (data residency) endpoints available with 10% uplift
Best for: high-context coding, agentic engineering, multimodal analysis, complex tool-use workflows.

Training & License

Pretrained on a multi-trillion-token mixture of web text, code, scientific papers, books and licensed datasets. Post-training combines supervised fine-tuning, RLHF and large-scale reinforcement learning against verifiable rewards on coding, math and tool-use tasks. Knowledge cutoff approximately late 2025.

License: Proprietary commercial license via OpenAI API and Azure OpenAI. Commercial use permitted under OpenAI's Usage Policies.

Known limitations

1M context is experimental and must be enabled explicitly (default is 272K)
Prompts above 272K input tokens are billed at 2x input / 1.5x output for the full session
No native audio or video input in this variant
Higher latency and cost than mini/nano tiers, especially with Thinking enabled
Knowledge cutoff in late 2025

Research papers

Frequently asked questions

Related Models

View all Multimodal

BLIP

Salesforce

Salesforce BLIP. Vision-language model for image captioning and visual question answering. Given an image it writes a short natural-language caption, or answers a question about the image when one is supplied. A widely used baseline for automatic captioning.

€1.00

CLIP Interrogator

Community

pharmapsychotic's CLIP Interrogator. Takes an image and produces a Stable-Diffusion-style text prompt by combining BLIP captioning with CLIP to rank likely subjects, artists, mediums and styles. Commonly used to reverse-engineer a prompt from an existing picture.

€1.00

Claude 3.5 Sonnet (vision)

Anthropic

Anthropic Claude 3.5 Sonnet with image input. 200k context, strong on dense documents, tables, charts and handwriting. Reliable structured extraction from screenshots and scans.

Free

Claude Opus 4.7