Google Veo 3 Fast
Faster cheaper Veo 3 with audio
Google Veo 3 Fast is video generation AI model from Google DeepMind, priced at €0.000 per 1M input tokens with a unknown context window.
Examples
See what Google Veo 3 Fast can generate
Quick
"Timelapse of flower blooming"
Pricing
API Integration
Use our OpenAI-compatible API to integrate Google Veo 3 Fast into your application.
npm install railwailimport railwail from "railwail";
const rw = railwail("YOUR_API_KEY");
// Simple — just pass a string
const reply = await rw.run("veo-3-fast", "Hello! What can you do?");
console.log(reply);
// With message history
const reply2 = await rw.run("veo-3-fast", [
{ role: "system", content: "You are a helpful assistant." },
{ role: "user", content: "Explain quantum computing simply." },
]);
console.log(reply2);
// Full response with usage info
const res = await rw.chat("veo-3-fast", [
{ role: "user", content: "Hello!" },
], { temperature: 0.7, max_tokens: 500 });
console.log(res.choices[0].message.content);
console.log(res.usage);Deep dive — Google DeepMind's Google Veo 3 Fast
Google DeepMind is the merged AI research organisation formed in April 2023 from Google Brain and DeepMind under Demis Hassabis. The Veo video-generation programme delivered Veo 1 (May 2024), Veo 2 (December 2024) and Veo 3 (May 2025), the first major commercial video model with natively generated synchronized audio. Veo 3 Fast was launched alongside Veo 3 as the speed and cost tier: a distilled, low-step variant designed for high-volume creative iteration on Vertex AI, the Gemini API and consumer surfaces such as VideoFX and YouTube Dream Screen. It became the most widely-used Veo variant for high-throughput workflows because the cost difference versus full Veo 3 is significant while quality remains broadly cinematic.
Visit Google DeepMind →Veo 3 Fast is the distilled speed-tier variant of Veo 3. Like Veo 3 it is a latent video diffusion model with a transformer-based denoiser plus a coupled audio diffusion module that produces synchronized music, ambient sound and dialogue with lip-sync. Fast is produced via consistency-style sampler distillation and quantization, compressing the diffusion sampler into far fewer denoising steps while preserving most of Veo 3's prompt adherence, motion physics and audio-video sync. Conditioning uses Gemini-family text encoders and optional image embeddings. Native generation produces up to 8-second clips at 1080p with synchronized audio, at noticeably lower latency and per-clip cost than full Veo 3. Training is partly a distillation of Veo 3 over the same curated multilingual audio-video corpus, with auxiliary supervision tuned for sampler-step efficiency.
- Parameters
- Undisclosed
- Context
- unknown
- Speed-tier Veo 3 with native synchronized audio retained
- Up to 8-second 1080p clips at substantially lower cost
- Text-to-video and image-to-video
- Rich cinematographic prompt vocabulary
- Multilingual prompts via Gemini text encoders
- Available via Vertex AI, Gemini API, VideoFX, Whisk and YouTube Dream Screen
- SynthID audio + visual watermarking
- Materially lower latency than full Veo 3
- Best for: high-volume creative iteration, ad A/B testing, social shorts with sound.
Distilled from Veo 3 over the same curated multilingual audio-video corpus, with auxiliary supervision tuned for sampler-step efficiency and audio-video synchronization.
License: Proprietary commercial licence via Google Cloud / Vertex AI and Gemini API; commercial use under Google's generative-AI terms; mandatory SynthID watermarking.
Known limitations
- Slight quality regression vs full Veo 3 on fine detail
- 8-second native clip limit
- Audio short-form and English-leaning
- Closed model without peer-reviewed paper
- Strict moderation on people, brands and political content
Research papers
Frequently asked questions
Related Models
View all Video GenerationGoogle Veo 2
Google's state-of-the-art video generation model. Simulates real-world physics with various visual styles.
Google Veo 3
Google's Veo 3. High-fidelity text-to-video with native audio generation, up to 8s clips.
Google Veo 3.1
Latest Veo with image-to-video and context-aware audio
Kling v3
Cinematic video up to 15s with multi-shot and native audio
Start using Google Veo 3 Fast today
Get started with free credits. No credit card required. Access Google Veo 3 Fast and 100+ other models through a single API.