OpenAI Sora 2

Popular
OpenAI
Video Generation

OpenAI's second-generation Sora video model. Realistic motion, improved physics, audio support.

Queue video with OpenAI Sora 2
Video generation runs asynchronously — we'll queue a job and you can track it in your history.
Sign in to try this model with €5 free credits.
Sign in
Generates as an async job — typically 30 s to 2 min.
TL;DR·Last updated May 16, 2026

OpenAI Sora 2 is video generation AI model from OpenAI, priced at €0.000 per 1M input tokens with a unknown context window.

Try OpenAI Sora 2
Sign in to generate — 50 free credits on sign-up

Pricing

Price per Generation
Per generation€0.50

API Integration

Use our OpenAI-compatible API to integrate OpenAI Sora 2 into your application.

Install
npm install railwail
JavaScript / TypeScript
import railwail from "railwail";

const rw = railwail("YOUR_API_KEY");

// Simple — just pass a string
const reply = await rw.run("sora-2", "Hello! What can you do?");
console.log(reply);

// With message history
const reply2 = await rw.run("sora-2", [
  { role: "system", content: "You are a helpful assistant." },
  { role: "user", content: "Explain quantum computing simply." },
]);
console.log(reply2);

// Full response with usage info
const res = await rw.chat("sora-2", [
  { role: "user", content: "Hello!" },
], { temperature: 0.7, max_tokens: 500 });
console.log(res.choices[0].message.content);
console.log(res.usage);
Specifications
Price
€0.50
Developer
OpenAI
Category
Video Generation
Supported Formats
text
image
Tags
openai
sora
text-to-video
flagship
audio

Deep dive — OpenAI's OpenAI Sora 2

About OpenAI
Founded 2015 · San Francisco, USA

OpenAI was founded in December 2015 by Sam Altman, Elon Musk, Greg Brockman, Ilya Sutskever, Wojciech Zaremba, John Schulman, Andrej Karpathy and others. The company shipped Sora 1 in December 2024 inside ChatGPT and the standalone Sora app. Sora 2 launched in 2025 as the substantially upgraded successor: longer durations, much better physics, native synchronized audio (sound effects, music and dialogue) and an entirely new social product called the Sora app, which features a TikTok-like feed of user-generated AI clips and the 'cameo' feature that lets users insert their own and friends' likenesses into generated videos. Sora 2 cemented OpenAI's position alongside Google DeepMind's Veo 3 family as the frontier of consumer-accessible video generation in 2025.

Visit OpenAI →
Architecture
Diffusion Transformer on spacetime patches with joint audio-video diffusion

Sora 2 builds on the Sora 1 'spacetime patches' DiT architecture: a 3D causal VAE encodes video into a spatio-temporal latent and a transformer denoiser is trained on patches drawn from that latent. The headline architectural change is a joint audio-video diffusion pipeline that produces synchronized soundtrack -- including sound effects, ambient noise, music swells and short dialogue with lip-sync -- alongside the visual track. Sora 2 also adds an explicit 'cameo' conditioning channel that locks character identity to a user-provided reference video / images while enforcing OpenAI's identity-consent and impersonation policies. Native generation runs up to ~25-60 seconds depending on tier, at up to 1080p. Conditioning uses a GPT-family text encoder with dense recaptioning, plus optional image and video references. Public benchmark results and OpenAI's own internal evaluations report substantial gains over Sora 1 in motion fidelity, physical plausibility and prompt adherence.

Parameters
Undisclosed
Context
unknown
What it can do
  • Native synchronized audio (sound effects, ambient sound, dialogue with lip-sync)
  • Up to ~25-60 second clips at up to 1080p depending on tier
  • 'Cameo' feature: insert user / friend likeness with consent verification
  • Strong physical plausibility (water, fabric, crowds, gravity)
  • Text-to-video, image-to-video, video-to-video, in-painting and extension
  • Social Sora app with TikTok-like feed of AI clips
  • Dense GPT-class recaptioning pipeline for strong prompt adherence
  • Available via Sora app, ChatGPT Plus / Pro and limited API
  • Best for: cinematic shorts with sound, social-media content, character cameos.
Training & License

Massive curated multilingual audio-video corpus combining licensed footage, web data and partner sources, with dense synthetic audio-visual captions; exact size undisclosed.

License: Proprietary commercial licence via OpenAI terms; commercial use on paid plans subject to content policy; mandatory C2PA metadata and visible watermark on outputs; cameo feature gated by identity-verification flow.

Known limitations
  • Strict cameo / public-figure / impersonation moderation
  • Per-clip cost and queue times can be high
  • Closed model without a peer-reviewed paper
  • Audio is short-form (limited musical sophistication)
  • Some content categories blocked entirely (political ads, violent scenes)

Frequently asked questions

Start using OpenAI Sora 2 today

Get started with free credits. No credit card required. Access OpenAI Sora 2 and 100+ other models through a single API.