Grok Imagine Video
xAI video with native audio and lip-sync, up to 15s
Grok Imagine Video is video generation AI model from Replicate, priced at €0.000 per 1M input tokens with a unknown context window.
Image References
Examples
See what Grok Imagine Video can generate
Fun
"Golden retriever catching frisbee in slow motion"
Pricing
API Integration
Use our OpenAI-compatible API to integrate Grok Imagine Video into your application.
npm install railwailimport railwail from "railwail";
const rw = railwail("YOUR_API_KEY");
// Simple — just pass a string
const reply = await rw.run("grok-video", "Hello! What can you do?");
console.log(reply);
// With message history
const reply2 = await rw.run("grok-video", [
{ role: "system", content: "You are a helpful assistant." },
{ role: "user", content: "Explain quantum computing simply." },
]);
console.log(reply2);
// Full response with usage info
const res = await rw.chat("grok-video", [
{ role: "user", content: "Hello!" },
], { temperature: 0.7, max_tokens: 500 });
console.log(res.choices[0].message.content);
console.log(res.usage);Deep dive — xAI's Grok Imagine Video
xAI was founded in March 2023 by Elon Musk and a team of researchers from OpenAI, DeepMind, Google, Microsoft and Tesla, including Igor Babuschkin, Tony Wu, Christian Szegedy and Jimmy Ba. The company's stated mission is to 'understand the true nature of the universe' and it operates closely with X (formerly Twitter) and Tesla. xAI launched Grok-1 in November 2023 (later open-sourced), followed by Grok-1.5, Grok-2 and Grok-3 with native image generation (Aurora) and image-to-video (Grok Imagine). xAI runs the Colossus supercluster in Memphis (originally 100k H100s, expanded to 200k+ by 2025) and was valued above $50B in late 2024.
Visit xAI →Grok Imagine Video is xAI's image-to-video generation feature shipped inside the Grok app and on X (Twitter). It is a closed model and xAI has published almost no technical detail beyond it being a 'native video generation' system that complements the Aurora image generator. Industry reporting and behaviour suggest a latent video diffusion architecture with a transformer denoiser, conditioned on text embeddings (likely from a Grok-family text encoder) and on an initial image. The product targets short social-media clips (6-15 seconds) at HD resolution with optional NSFW 'spicy' mode that bypasses many of the moderation filters used by competitors. Audio support (sound effects, ambient noise) was added in late 2025. Training data are presumed to draw heavily from xAI's privileged access to X video uploads, though xAI have not confirmed this.
- Parameters
- Undisclosed
- Context
- unknown
- Image-to-video and text-to-video for short social clips
- Tight integration with X (Twitter) and the Grok app
- NSFW / 'spicy' mode (within posted guidelines) unlike most rivals
- Native audio (sound effects, ambient) since late 2025
- Fast generation tuned for Colossus hardware
- Available to X Premium / SuperGrok subscribers
- Strong meme and pop-culture coverage
- Mobile-first UX
- Best for: meme content, social shorts, viral clips, rapid prototyping.
Undisclosed; presumed to include licensed footage, web video and X platform uploads. xAI have not released a model card or data sheet.
License: Proprietary commercial licence under X / xAI terms of service.
Known limitations
- Short clip duration (typically 6-15 seconds)
- No public model card or technical paper
- Quality and consistency below Veo 3 / Sora 2 in independent tests
- Subscription-gated, no general API at launch
- Permissive moderation has drawn criticism
Research papers
Frequently asked questions
Related Models
View all Video GenerationGoogle Veo 2
Google's state-of-the-art video generation model. Simulates real-world physics with various visual styles.
Google Veo 3
Google's Veo 3. High-fidelity text-to-video with native audio generation, up to 8s clips.
Google Veo 3.1
Latest Veo with image-to-video and context-aware audio
Kling v3
Cinematic video up to 15s with multi-shot and native audio
Start using Grok Imagine Video today
Get started with free credits. No credit card required. Access Grok Imagine Video and 100+ other models through a single API.