How much does Kuaishou Kolors cost via Railwail?

No monthly minimum, no subscription. Start with €5 free credits.

What is the context window of Kuaishou Kolors?

Kuaishou Kolors supports a unknown context window — enough for typical AI workloads.

How fast is Kuaishou Kolors?

Latency depends on prompt length and load — typically 200ms to 2s for short prompts. We measure p50/p95 in real-time on /rankings.

Is Kuaishou Kolors better than FLUX 1.1 Pro?

It depends on your use case. Kuaishou Kolors (Replicate) and FLUX 1.1 Pro (Black Forest Labs) are both strong choices in image generation. Compare them side-by-side at /compare/kolors-kuaishou-vs-flux-1-1-pro.

Kuaishou Kolors

Name: Kuaishou Kolors
Brand: Replicate
SKU: kolors-kuaishou
Availability: InStock

Replicate

Image Generation

Kuaishou's bilingual (CN/EN) latent diffusion text-to-image model with strong text rendering.

Generate with Kuaishou Kolors

Describe what you want and pick a size — the image renders inline.

Size

Result appears here

TL;DR·Last updated June 24, 2026

Kuaishou Kolors is image generation AI model from Replicate, priced at €0.000 per 1M input tokens with a unknown context window.

Try Kuaishou Kolors

Prompt

Aspect Ratio

Quality

Pricing

Price per Generation

Per generationFree

API Integration

Use our OpenAI-compatible API to integrate Kuaishou Kolors into your application.

Install

npm install railwail

JavaScript / TypeScript

import railwail from "railwail";

const rw = railwail("YOUR_API_KEY");

const images = await rw.run("kolors-kuaishou", "A beautiful sunset over Tokyo");
console.log(images[0].url);

// Or use the image() method for full control
const res = await rw.image("kolors-kuaishou", "A cat in space", {
  size: "1024x1024",
  n: 1,
});
console.log(res.data[0].url);

Specifications

Developer

Replicate

Deep dive — Kuaishou (Kolors Team)'s Kuaishou Kolors

About Kuaishou (Kolors Team)

Founded 2011 · Beijing, China

Kuaishou Technology (快手) is a Chinese short-video platform founded in 2011 in Beijing by Su Hua and Cheng Yixiao, listed on the Hong Kong Stock Exchange since 2021. Its AI research division, often referred to as the Kuaishou KwaiVGI or Kling AI team, has produced several generative-media foundation models including Kling (video) and Kolors (image). Kolors was open-sourced in July 2024 with weights released on Hugging Face and GitHub (Kwai-Kolors/Kolors). The model is positioned as the best open Chinese-English bilingual text-to-image model, particularly strong on Chinese-language prompts, calligraphy and culturally specific content.

Visit Kuaishou (Kolors Team) →

Architecture

Latent diffusion U-Net with ChatGLM3-6B text encoder (bilingual Chinese-English)

Kolors is a latent diffusion text-to-image model released open-source by Kuaishou in July 2024. The architecture follows the Stable Diffusion XL family — a latent-space U-Net with cross-attention conditioning on text embeddings — but replaces the standard CLIP text encoder with the much larger ChatGLM3-6B, a bilingual Chinese-English instruction-tuned LLM developed by Tsinghua KEG and Zhipu AI. This gives Kolors strong understanding of Chinese-language prompts, idioms, calligraphy and culturally specific content (food, festivals, Chinese architecture, traditional clothing) that Western models like SDXL and FLUX often struggle with. The U-Net has ~2.6B parameters and is trained at 1024x1024 resolution with the standard noise-prediction objective. Kuaishou also released a Kolors-Inpainting variant and a Kolors-ControlNet collection (Canny/Depth). The model is open-source under the Apache 2.0 license, making it the leading commercial-use open bilingual model.

Parameters: ~2.6B U-Net + 6B ChatGLM3 text encoder
Context: 256 tokens

What it can do

Best-in-class understanding of Chinese-language prompts
Strong English performance comparable to SDXL
1024x1024 native resolution
Open weights under Apache 2.0
Excellent rendering of Chinese characters and calligraphy
Culturally accurate Chinese-food, festival, architecture and clothing imagery
Companion models: Kolors-Inpainting, Kolors-ControlNet (Canny/Depth/Pose)
Best for: Chinese-market apps, bilingual creative tools, culturally specific content, self-hosted Chinese services.

Training & License

Trained on a large mixed Chinese-English image-text corpus including high-quality Chinese-captioned data. Exact composition is not disclosed.

License: Apache 2.0 — open weights, full commercial use permitted including redistribution and derivatives.

Known limitations

Image quality below FLUX 1.1 [pro] on photorealism
Smaller model than SD3.5 Large or FLUX
Some Chinese-regulatory filters baked into training
Open weights have no integrated safety classifier

Research papers

Frequently asked questions

Related Models

View all Image Generation

FLUX 1.1 Pro

Black Forest Labs

Black Forest Labs' flagship text-to-image model. Faster generation than FLUX.1 Pro at higher prompt adherence, with strong photorealism and reliable spatial composition. Runs as a hosted Replicate model.

€4.00

FLUX 1.1 Pro Ultra

Black Forest Labs

FLUX 1.1 Pro in Ultra mode by Black Forest Labs. Generates up to 4 megapixel images with a raw mode for less processed, more natural-looking photography. Best FLUX option when output resolution and fine detail matter.

€5.00