Kuaishou Kolors

Replicate
Image Generation

Kuaishou's bilingual (CN/EN) latent diffusion text-to-image model with strong text rendering.

Generate with Kuaishou Kolors
Describe what you want and pick a size β€” the image renders inline.
Sign in to try this model with €5 free credits.
Sign in
Result appears here
TL;DRΒ·Last updated May 16, 2026

Kuaishou Kolors is image generation AI model from Replicate, priced at €0.000 per 1M input tokens with a unknown context window.

Try Kuaishou Kolors
Sign in to generate β€” 50 free credits on sign-up

Pricing

Price per Generation
Per generationFree

API Integration

Use our OpenAI-compatible API to integrate Kuaishou Kolors into your application.

Install
npm install railwail
JavaScript / TypeScript
import railwail from "railwail";

const rw = railwail("YOUR_API_KEY");

const images = await rw.run("kolors-kuaishou", "A beautiful sunset over Tokyo");
console.log(images[0].url);

// Or use the image() method for full control
const res = await rw.image("kolors-kuaishou", "A cat in space", {
  size: "1024x1024",
  n: 1,
});
console.log(res.data[0].url);
Specifications
Developer
Replicate
Category
Image Generation
Supported Formats
text
Tags
kuaishou
text-to-image
open-weights
chinese
pricing-tbd

Deep dive β€” Kuaishou (Kolors Team)'s Kuaishou Kolors

About Kuaishou (Kolors Team)
Founded 2011 Β· Beijing, China

Kuaishou Technology (快手) is a Chinese short-video platform founded in 2011 in Beijing by Su Hua and Cheng Yixiao, listed on the Hong Kong Stock Exchange since 2021. Its AI research division, often referred to as the Kuaishou KwaiVGI or Kling AI team, has produced several generative-media foundation models including Kling (video) and Kolors (image). Kolors was open-sourced in July 2024 with weights released on Hugging Face and GitHub (Kwai-Kolors/Kolors). The model is positioned as the best open Chinese-English bilingual text-to-image model, particularly strong on Chinese-language prompts, calligraphy and culturally specific content.

Visit Kuaishou (Kolors Team) β†’
Architecture
Latent diffusion U-Net with ChatGLM3-6B text encoder (bilingual Chinese-English)

Kolors is a latent diffusion text-to-image model released open-source by Kuaishou in July 2024. The architecture follows the Stable Diffusion XL family β€” a latent-space U-Net with cross-attention conditioning on text embeddings β€” but replaces the standard CLIP text encoder with the much larger ChatGLM3-6B, a bilingual Chinese-English instruction-tuned LLM developed by Tsinghua KEG and Zhipu AI. This gives Kolors strong understanding of Chinese-language prompts, idioms, calligraphy and culturally specific content (food, festivals, Chinese architecture, traditional clothing) that Western models like SDXL and FLUX often struggle with. The U-Net has ~2.6B parameters and is trained at 1024x1024 resolution with the standard noise-prediction objective. Kuaishou also released a Kolors-Inpainting variant and a Kolors-ControlNet collection (Canny/Depth). The model is open-source under the Apache 2.0 license, making it the leading commercial-use open bilingual model.

Parameters
~2.6B U-Net + 6B ChatGLM3 text encoder
Context
256 tokens
What it can do
  • Best-in-class understanding of Chinese-language prompts
  • Strong English performance comparable to SDXL
  • 1024x1024 native resolution
  • Open weights under Apache 2.0
  • Excellent rendering of Chinese characters and calligraphy
  • Culturally accurate Chinese-food, festival, architecture and clothing imagery
  • Companion models: Kolors-Inpainting, Kolors-ControlNet (Canny/Depth/Pose)
  • Best for: Chinese-market apps, bilingual creative tools, culturally specific content, self-hosted Chinese services.
Training & License

Trained on a large mixed Chinese-English image-text corpus including high-quality Chinese-captioned data. Exact composition is not disclosed.

License: Apache 2.0 β€” open weights, full commercial use permitted including redistribution and derivatives.

Known limitations
  • Image quality below FLUX 1.1 [pro] on photorealism
  • Smaller model than SD3.5 Large or FLUX
  • Some Chinese-regulatory filters baked into training
  • Open weights have no integrated safety classifier

Frequently asked questions

Start using Kuaishou Kolors today

Get started with free credits. No credit card required. Access Kuaishou Kolors and 100+ other models through a single API.