GPT-5.4 Nano

New
OpenAI
Multimodal

OpenAI's smallest and cheapest GPT-5.4 variant. Built for high-volume classification, extraction and coding subagents at edge-grade latency.

Try GPT-5.4 Nano now
Send a single prompt and stream a response inline. Hit Cmd+Enter to submit.
Sign in to try this model with €5 free credits.
Sign in
Press Cmd+Enter to send
Response appears here.
TL;DRΒ·Last updated May 16, 2026

GPT-5.4 Nano is multimodal AI model from OpenAI, priced at €0.000 per 1M input tokens with a 400K tokens context window.

About this model

GPT-5.4 nano is the lightest member of the GPT-5.4 family, released alongside Mini in March 2026. 400K context window, vision input, optimized for classification, data extraction, ranking and coding subagents that handle simpler supporting tasks. A major upgrade over GPT-5 nano, designed for the subagent era of agentic workflows where orchestrator models delegate to many cheap workers.
Try GPT-5.4 Nano

0.7

Sign in to generate β€” 50 free credits on sign-up

Pricing

Price per Generation
Per generationFree

API Integration

Use our OpenAI-compatible API to integrate GPT-5.4 Nano into your application.

Install
npm install railwail
JavaScript / TypeScript
import railwail from "railwail";

const rw = railwail("YOUR_API_KEY");

// Simple β€” just pass a string
const reply = await rw.run("gpt-5-4-nano", "Hello! What can you do?");
console.log(reply);

// With message history
const reply2 = await rw.run("gpt-5-4-nano", [
  { role: "system", content: "You are a helpful assistant." },
  { role: "user", content: "Explain quantum computing simply." },
]);
console.log(reply2);

// Full response with usage info
const res = await rw.chat("gpt-5-4-nano", [
  { role: "user", content: "Hello!" },
], { temperature: 0.7, max_tokens: 500 });
console.log(res.choices[0].message.content);
console.log(res.usage);
Specifications
Context window
400,000 tokens
Max output
32,000 tokens
Developer
OpenAI
Category
Multimodal
Supported Formats
text
image
Tags
openai
cost-efficient
low-latency
vision
subagents
edge

Deep dive β€” OpenAI's GPT-5.4 Nano

About OpenAI
Founded 2015 Β· San Francisco, USA

OpenAI is the AI research lab founded in December 2015 by Sam Altman, Elon Musk, Greg Brockman, Ilya Sutskever and others. The company shipped GPT-1 through GPT-5 and the unified GPT-5.x line (2025-2026). GPT-5.4 nano was released alongside GPT-5.4 mini on March 17, 2026 as the smallest, cheapest variant of the GPT-5.4 generation, aimed at the 'subagent era' of agentic workflows. OpenAI is backed by Microsoft and other major investors with a 2026 valuation above $300 billion.

Visit OpenAI β†’
Architecture
Unified Transformer (small, optimized for high-throughput subagent workloads)

GPT-5.4 nano is the lightest member of the GPT-5.4 family, released March 17, 2026 as OpenAI's smallest and cheapest reasoning-capable model. It is heavily distilled from larger GPT-5.4 teacher models and trained to excel at classification, extraction, ranking and coding subagent tasks. Architecturally it retains the unified GPT-5.4 design: native text + image input, integrated 'Thinking' tier on demand, and full tool-use API including parallel tool calls. The 400K context window is unusually large for a nano-class model and supports long-document subagent work. Post-training emphasized reliability on structured outputs, JSON schema adherence and function calling, since nano-class models are typically deployed in deterministic pipelines.

Parameters
Undisclosed (estimated single-digit-billion-parameter class)
Context
400K tokens
What it can do
  • Smallest and cheapest GPT-5.4 variant
  • 400K token context window
  • Native multimodal input: text and images
  • Integrated 'Thinking' tier for occasional harder tasks
  • Strong on classification, extraction, ranking and coding subagent tasks
  • Native tool use, function calling and parallel tool calls
  • Reliable structured JSON output and schema adherence
  • Edge-grade latency suitable for real-time pipelines
  • Major upgrade over GPT-5 nano on every measured benchmark
  • Available in ChatGPT (free tier) and the OpenAI API
  • Best for: classification, data extraction, ranking, coding subagents under an orchestrator.
Training & License

Heavily distilled from larger GPT-5.4 teacher models. Pretraining uses a multi-trillion-token mixture; post-training combines supervised fine-tuning, RLHF and RL against verifiable rewards with a strong emphasis on structured-output reliability. Knowledge cutoff approximately late 2025.

License: Proprietary commercial license via OpenAI API and Azure OpenAI.

Known limitations
  • Below GPT-5.4 mini on hard reasoning and agentic coding benchmarks
  • Limited depth on niche or long-tail topics
  • No native audio or video input
  • Knowledge cutoff in late 2025
  • Thinking mode is available but slow relative to model size

Frequently asked questions

Start using GPT-5.4 Nano today

Get started with free credits. No credit card required. Access GPT-5.4 Nano and 100+ other models through a single API.