Kling 1.6 Pro
Kuaishou's Kling 1.6 Pro. Premium cinematic motion and physics realism, ~$0.07/sec.
Kling 1.6 Pro is video generation AI model from Kuaishou, priced at €0.000 per 1M input tokens with a unknown context window.
Pricing
API Integration
Use our OpenAI-compatible API to integrate Kling 1.6 Pro into your application.
npm install railwailimport railwail from "railwail";
const rw = railwail("YOUR_API_KEY");
// Simple — just pass a string
const reply = await rw.run("kling-1-6-pro", "Hello! What can you do?");
console.log(reply);
// With message history
const reply2 = await rw.run("kling-1-6-pro", [
{ role: "system", content: "You are a helpful assistant." },
{ role: "user", content: "Explain quantum computing simply." },
]);
console.log(reply2);
// Full response with usage info
const res = await rw.chat("kling-1-6-pro", [
{ role: "user", content: "Hello!" },
], { temperature: 0.7, max_tokens: 500 });
console.log(res.choices[0].message.content);
console.log(res.usage);Deep dive — Kuaishou Technology's Kling 1.6 Pro
Kuaishou (Kwai outside China) is a major Chinese short-video platform founded in 2011 by Su Hua and Cheng Yixiao, often described as the principal domestic competitor to ByteDance's Douyin / TikTok. Kuaishou's AI Group (KLING team) launched the Kling video model in June 2024, instantly impressing the international community by producing 2-minute photorealistic clips before Sora was publicly available. The Pro tier (Kling 1.0 Pro, 1.5 Pro, 1.6 Pro) is positioned as Kuaishou's premium video model with higher resolution, better prompt adherence and longer durations than the Standard tier. Kling 1.6 Pro (released late 2024) was widely cited as the strongest publicly accessible Chinese video model at its time of launch, ahead of Hailuo and earlier Wan releases.
Visit Kuaishou Technology →Kling is a closed-source latent video diffusion model that Kuaishou describe as a Diffusion-Transformer (DiT) operating on a 3D spatio-temporal Variational Autoencoder. The 1.6 Pro variant generates clips up to 10 seconds at 1080p / 30 fps natively, with extension chains supporting up to 2 minutes. Text conditioning uses a bilingual large-language-model text encoder; image conditioning supports both first frame and end frame for guided shots. The architecture is reported to use 3D attention with full spatio-temporal mixing rather than factorised space/time attention, yielding stronger motion and identity coherence. Training uses a curated video corpus drawn in part from Kuaishou's vast platform data with synthetic dense captions; post-training reportedly combines reward-model alignment for aesthetics with caption-fidelity supervision and selective reinforcement on human-judged quality.
- Parameters
- Undisclosed
- Context
- unknown
- 1080p / 30 fps clips up to 10 seconds natively, up to 2 minutes with extension
- First-frame and end-frame image conditioning
- Strong bilingual prompt understanding (Chinese, English)
- Director-style camera-control vocabulary
- Lip-sync, motion-brush and elements features in Kling editor
- Available via klingai.com web app, mobile app and API
- Strong character consistency across extensions
- Highly cinematic look on portrait and product shots
- Best for: cinematic shorts, fashion, product video, music videos.
Closed corpus including licensed video, web video and Kuaishou platform data with bilingual dense captions; exact dataset size undisclosed.
License: Proprietary commercial licence via Kling AI / Kuaishou terms; commercial use permitted on paid tiers.
Known limitations
- 10-second native clip limit, longer durations require extension chaining
- Heavy political and public-figure moderation on Chinese surfaces
- No native audio in 1.6 Pro (audio added in v3)
- Closed model with limited technical documentation
- Output queue times can be long during peak hours
Research papers
Frequently asked questions
Related Models
View all Video GenerationGoogle Veo 2
Google's state-of-the-art video generation model. Simulates real-world physics with various visual styles.
Google Veo 3
Google's Veo 3. High-fidelity text-to-video with native audio generation, up to 8s clips.
Google Veo 3.1
Latest Veo with image-to-video and context-aware audio
Kling v3
Cinematic video up to 15s with multi-shot and native audio
Start using Kling 1.6 Pro today
Get started with free credits. No credit card required. Access Kling 1.6 Pro and 100+ other models through a single API.