Google Veo 3 Fast

New
Google DeepMind
Video Generation

Faster cheaper Veo 3 with audio

Queue video with Google Veo 3 Fast
Video generation runs asynchronously β€” we'll queue a job and you can track it in your history.
Sign in to try this model with €5 free credits.
Sign in
Generates as an async job β€” typically 30 s to 2 min.
TL;DRΒ·Last updated March 25, 2026

Google Veo 3 Fast is video generation AI model from Google DeepMind, priced at €0.000 per 1M input tokens with a unknown context window.

Try Google Veo 3 Fast
Sign in to generate β€” 50 free credits on sign-up

Examples

See what Google Veo 3 Fast can generate

0:06

Quick

"Timelapse of flower blooming"

Pricing

Price per Generation
Per generation€3.20

API Integration

Use our OpenAI-compatible API to integrate Google Veo 3 Fast into your application.

Install
npm install railwail
JavaScript / TypeScript
import railwail from "railwail";

const rw = railwail("YOUR_API_KEY");

// Simple β€” just pass a string
const reply = await rw.run("veo-3-fast", "Hello! What can you do?");
console.log(reply);

// With message history
const reply2 = await rw.run("veo-3-fast", [
  { role: "system", content: "You are a helpful assistant." },
  { role: "user", content: "Explain quantum computing simply." },
]);
console.log(reply2);

// Full response with usage info
const res = await rw.chat("veo-3-fast", [
  { role: "user", content: "Hello!" },
], { temperature: 0.7, max_tokens: 500 });
console.log(res.choices[0].message.content);
console.log(res.usage);
Specifications
Price
€3.20
Avg. latency
59.0s
Est. duration
59s
Developer
Google DeepMind
Category
Video Generation
Supported Formats
mp4
Tags
fast
audio

Deep dive β€” Google DeepMind's Google Veo 3 Fast

About Google DeepMind
Founded 2010 Β· London, United Kingdom

Google DeepMind is the merged AI research organisation formed in April 2023 from Google Brain and DeepMind under Demis Hassabis. The Veo video-generation programme delivered Veo 1 (May 2024), Veo 2 (December 2024) and Veo 3 (May 2025), the first major commercial video model with natively generated synchronized audio. Veo 3 Fast was launched alongside Veo 3 as the speed and cost tier: a distilled, low-step variant designed for high-volume creative iteration on Vertex AI, the Gemini API and consumer surfaces such as VideoFX and YouTube Dream Screen. It became the most widely-used Veo variant for high-throughput workflows because the cost difference versus full Veo 3 is significant while quality remains broadly cinematic.

Visit Google DeepMind β†’
Architecture
Distilled latent video diffusion / DiT speed tier with joint audio-video diffusion

Veo 3 Fast is the distilled speed-tier variant of Veo 3. Like Veo 3 it is a latent video diffusion model with a transformer-based denoiser plus a coupled audio diffusion module that produces synchronized music, ambient sound and dialogue with lip-sync. Fast is produced via consistency-style sampler distillation and quantization, compressing the diffusion sampler into far fewer denoising steps while preserving most of Veo 3's prompt adherence, motion physics and audio-video sync. Conditioning uses Gemini-family text encoders and optional image embeddings. Native generation produces up to 8-second clips at 1080p with synchronized audio, at noticeably lower latency and per-clip cost than full Veo 3. Training is partly a distillation of Veo 3 over the same curated multilingual audio-video corpus, with auxiliary supervision tuned for sampler-step efficiency.

Parameters
Undisclosed
Context
unknown
What it can do
  • Speed-tier Veo 3 with native synchronized audio retained
  • Up to 8-second 1080p clips at substantially lower cost
  • Text-to-video and image-to-video
  • Rich cinematographic prompt vocabulary
  • Multilingual prompts via Gemini text encoders
  • Available via Vertex AI, Gemini API, VideoFX, Whisk and YouTube Dream Screen
  • SynthID audio + visual watermarking
  • Materially lower latency than full Veo 3
  • Best for: high-volume creative iteration, ad A/B testing, social shorts with sound.
Training & License

Distilled from Veo 3 over the same curated multilingual audio-video corpus, with auxiliary supervision tuned for sampler-step efficiency and audio-video synchronization.

License: Proprietary commercial licence via Google Cloud / Vertex AI and Gemini API; commercial use under Google's generative-AI terms; mandatory SynthID watermarking.

Known limitations
  • Slight quality regression vs full Veo 3 on fine detail
  • 8-second native clip limit
  • Audio short-form and English-leaning
  • Closed model without peer-reviewed paper
  • Strict moderation on people, brands and political content

Frequently asked questions

Start using Google Veo 3 Fast today

Get started with free credits. No credit card required. Access Google Veo 3 Fast and 100+ other models through a single API.