OpenAI Sora 2

Popular
OpenAI
Video Generation

OpenAI's second-generation Sora video model. Realistic motion, improved physics, audio support.

Queue video with OpenAI Sora 2
Video generation runs asynchronously β€” we'll queue a job and you can track it in your history.
Sign in to try this model with €5 free credits.
Sign in
Generates as an async job β€” typically 30 s to 2 min.
TL;DRΒ·Last updated May 16, 2026

OpenAI Sora 2 is video generation AI model from OpenAI, priced at €0.000 per 1M input tokens with a unknown context window.

Try OpenAI Sora 2
Sign in to generate β€” 50 free credits on sign-up

Pricing

Price per Generation
Per generation€0.50

API Integration

Use our OpenAI-compatible API to integrate OpenAI Sora 2 into your application.

Install
npm install railwail
JavaScript / TypeScript
import railwail from "railwail";

const rw = railwail("YOUR_API_KEY");

// Simple β€” just pass a string
const reply = await rw.run("sora-2", "Hello! What can you do?");
console.log(reply);

// With message history
const reply2 = await rw.run("sora-2", [
  { role: "system", content: "You are a helpful assistant." },
  { role: "user", content: "Explain quantum computing simply." },
]);
console.log(reply2);

// Full response with usage info
const res = await rw.chat("sora-2", [
  { role: "user", content: "Hello!" },
], { temperature: 0.7, max_tokens: 500 });
console.log(res.choices[0].message.content);
console.log(res.usage);
Specifications
Price
€0.50
Developer
OpenAI
Category
Video Generation
Supported Formats
text
image
Tags
openai
sora
text-to-video
flagship
audio

Deep dive β€” OpenAI's OpenAI Sora 2

About OpenAI
Founded 2015 Β· San Francisco, USA

OpenAI was founded in December 2015 by Sam Altman, Elon Musk, Greg Brockman, Ilya Sutskever, Wojciech Zaremba, John Schulman, Andrej Karpathy and others. The company shipped Sora 1 in December 2024 inside ChatGPT and the standalone Sora app. Sora 2 launched in 2025 as the substantially upgraded successor: longer durations, much better physics, native synchronized audio (sound effects, music and dialogue) and an entirely new social product called the Sora app, which features a TikTok-like feed of user-generated AI clips and the 'cameo' feature that lets users insert their own and friends' likenesses into generated videos. Sora 2 cemented OpenAI's position alongside Google DeepMind's Veo 3 family as the frontier of consumer-accessible video generation in 2025.

Visit OpenAI β†’
Architecture
Diffusion Transformer on spacetime patches with joint audio-video diffusion

Sora 2 builds on the Sora 1 'spacetime patches' DiT architecture: a 3D causal VAE encodes video into a spatio-temporal latent and a transformer denoiser is trained on patches drawn from that latent. The headline architectural change is a joint audio-video diffusion pipeline that produces synchronized soundtrack -- including sound effects, ambient noise, music swells and short dialogue with lip-sync -- alongside the visual track. Sora 2 also adds an explicit 'cameo' conditioning channel that locks character identity to a user-provided reference video / images while enforcing OpenAI's identity-consent and impersonation policies. Native generation runs up to ~25-60 seconds depending on tier, at up to 1080p. Conditioning uses a GPT-family text encoder with dense recaptioning, plus optional image and video references. Public benchmark results and OpenAI's own internal evaluations report substantial gains over Sora 1 in motion fidelity, physical plausibility and prompt adherence.

Parameters
Undisclosed
Context
unknown
What it can do
  • Native synchronized audio (sound effects, ambient sound, dialogue with lip-sync)
  • Up to ~25-60 second clips at up to 1080p depending on tier
  • 'Cameo' feature: insert user / friend likeness with consent verification
  • Strong physical plausibility (water, fabric, crowds, gravity)
  • Text-to-video, image-to-video, video-to-video, in-painting and extension
  • Social Sora app with TikTok-like feed of AI clips
  • Dense GPT-class recaptioning pipeline for strong prompt adherence
  • Available via Sora app, ChatGPT Plus / Pro and limited API
  • Best for: cinematic shorts with sound, social-media content, character cameos.
Training & License

Massive curated multilingual audio-video corpus combining licensed footage, web data and partner sources, with dense synthetic audio-visual captions; exact size undisclosed.

License: Proprietary commercial licence via OpenAI terms; commercial use on paid plans subject to content policy; mandatory C2PA metadata and visible watermark on outputs; cameo feature gated by identity-verification flow.

Known limitations
  • Strict cameo / public-figure / impersonation moderation
  • Per-clip cost and queue times can be high
  • Closed model without a peer-reviewed paper
  • Audio is short-form (limited musical sophistication)
  • Some content categories blocked entirely (political ads, violent scenes)

Frequently asked questions

Start using OpenAI Sora 2 today

Get started with free credits. No credit card required. Access OpenAI Sora 2 and 100+ other models through a single API.