Kling v3 Omni

New
Popular
Replicate
Video Generation

Most versatile: multi-reference images, video editing, native audio

Queue video with Kling v3 Omni
Video generation runs asynchronously — we'll queue a job and you can track it in your history.
Sign in to try this model with €5 free credits.
Sign in
Generates as an async job — typically 30 s to 2 min.
TL;DR·Last updated March 25, 2026

Kling v3 Omni is video generation AI model from Replicate, priced at €0.000 per 1M input tokens with a unknown context window.

Try Kling v3 Omni

Image References

Sign in to generate — 50 free credits on sign-up

Examples

See what Kling v3 Omni can generate

0:05

Character

"Cartoon character waving and dancing"

Pricing

Price per Generation
Per generation€2.50

API Integration

Use our OpenAI-compatible API to integrate Kling v3 Omni into your application.

Install
npm install railwail
JavaScript / TypeScript
import railwail from "railwail";

const rw = railwail("YOUR_API_KEY");

// Simple — just pass a string
const reply = await rw.run("kling-v3-omni", "Hello! What can you do?");
console.log(reply);

// With message history
const reply2 = await rw.run("kling-v3-omni", [
  { role: "system", content: "You are a helpful assistant." },
  { role: "user", content: "Explain quantum computing simply." },
]);
console.log(reply2);

// Full response with usage info
const res = await rw.chat("kling-v3-omni", [
  { role: "user", content: "Hello!" },
], { temperature: 0.7, max_tokens: 500 });
console.log(res.choices[0].message.content);
console.log(res.usage);
Specifications
Price
€2.50
Avg. latency
120.0s
Est. duration
2min
Developer
Replicate
Category
Video Generation
Supported Formats
mp4
Tags
popular
audio
i2v
editing

Deep dive — Kuaishou Technology's Kling v3 Omni

About Kuaishou Technology
Founded 2011 · Beijing, China

Kuaishou Technology (founded 2011 by Su Hua and Cheng Yixiao) operates one of China's two dominant short-video platforms and runs the KLING AI lab. Following the success of Kling 1.x and Kling v3, the team released Kling v3 Omni in 2025 as the multimodal-conditioning variant of v3. Omni accepts not only text and a single image but also audio reference, video reference, character-reference images, sketches, depth maps and pose skeletons, making it the broadest conditioning surface among Chinese video models. Omni is exposed through klingai.com, the Kling Studio editor and the Kling API.

Visit Kuaishou Technology
Architecture
Diffusion Transformer (DiT) with multimodal conditioning adapters

Kling v3 Omni shares the same core DiT denoiser as Kling v3 but adds a suite of conditioning adapters that inject heterogeneous control signals into the diffusion backbone. The model accepts text prompts (bilingual LLM encoder), first frame, end frame, subject-reference images (for locked identity), pose / skeleton videos (for motion transfer), depth maps (for 3D-aware control), sketches (for layout), audio reference (for music-driven motion and lip-sync) and short video references (for style transfer). Each modality is encoded by a dedicated adapter network and merged via cross-attention layers into the 3D spatio-temporal latent stream. Joint audio-video diffusion produces synchronized sound, including lip-sync to a provided dialogue audio track. Training uses paired multimodal-conditioning data including motion-capture libraries, sketch-to-video pairs and music videos.

Parameters
Undisclosed
Context
unknown
What it can do
  • Text-, image-, pose-, sketch-, depth-, audio- and video-reference conditioning
  • Lip-sync to a user-provided dialogue audio track
  • Motion transfer from a pose / skeleton video
  • Subject-reference for locked character identity
  • Sketch- and depth-controlled layout for storyboards
  • 1080p / 30 fps, native synchronized audio
  • Available via Kling Studio and API
  • Strongest conditioning surface among Chinese video models
  • Best for: animation studios, lip-sync content, motion-capture-driven workflows.
Training & License

Closed corpus including platform video, licensed footage, motion-capture libraries, sketch / depth annotation pairs and music videos with dense multimodal captions.

License: Proprietary commercial licence via Kling AI / Kuaishou terms.

Known limitations
  • Complex conditioning workflows can be brittle
  • Heavy political and public-figure moderation
  • Higher per-clip cost than vanilla v3
  • Audio sync is short-form and English/Chinese-leaning
  • Closed model with limited public documentation

Frequently asked questions

Start using Kling v3 Omni today

Get started with free credits. No credit card required. Access Kling v3 Omni and 100+ other models through a single API.