Yi Large
01.AI's larger general-purpose chat model with 32k context window and strong bilingual performance.
Yi Large is text & chat AI model from Custom, priced at €3.00 per 1M input tokens with a 32.8K tokens context window.
0.7
Pricing
API Integration
Use our OpenAI-compatible API to integrate Yi Large into your application.
npm install railwailimport railwail from "railwail";
const rw = railwail("YOUR_API_KEY");
// Simple — just pass a string
const reply = await rw.run("yi-large", "Hello! What can you do?");
console.log(reply);
// With message history
const reply2 = await rw.run("yi-large", [
{ role: "system", content: "You are a helpful assistant." },
{ role: "user", content: "Explain quantum computing simply." },
]);
console.log(reply2);
// Full response with usage info
const res = await rw.chat("yi-large", [
{ role: "user", content: "Hello!" },
], { temperature: 0.7, max_tokens: 500 });
console.log(res.choices[0].message.content);
console.log(res.usage);Deep dive — 01.AI (Lingyi Wanwu)'s Yi Large
01.AI (零一万物, Lingyi Wanwu) was founded in March 2023 by Kai-Fu Lee — former president of Google China, founder of Sinovation Ventures and a globally recognised AI investor and author. The lab launched the Yi-6B and Yi-34B open-weights base models in November 2023, which briefly led the Hugging Face Open LLM Leaderboard. 01.AI followed with Yi-9B (2024), Yi-VL (vision, 2024) and the closed-API flagship Yi Large in May 2024. The company raised $200M+ in seed funding in 2023 at a reported $1B valuation, with investors including Alibaba Cloud, Sinovation Ventures and others. 01.AI's positioning emphasises strong bilingual Chinese-English performance, data-quality-focused training and pragmatic product applications.
Visit 01.AI (Lingyi Wanwu) →Yi Large is a dense decoder-only transformer following the modern LLM recipe — grouped-query attention, RoPE positional embeddings, SwiGLU activations and RMSNorm. 01.AI has been less specific about Yi Large's exact size than about the open Yi-34B/9B/6B variants. The Yi technical report describes the training methodology in detail: a heavily filtered bilingual Chinese-English data pipeline using CCNet-style quality filtering, aggressive deduplication and curated additions of books, code and scientific content. The Yi-34B base was trained on 3.1T tokens; Yi Large is reported to scale further. Post-training is supervised fine-tuning plus RLHF with proprietary specifics. Yi Large is closed-API only — only the smaller Yi base and chat variants (Yi-34B, Yi-9B, Yi-6B) have open weights under the Yi Series Models Community License Agreement. Released May 2024 via 01.AI Platform and partner clouds.
- Parameters
- Undisclosed (estimated ~100B based on tier sibling Yi-34B and contemporaneous public details)
- Context
- 32K tokens
- Top-tier bilingual Chinese-English performance
- Frequent #1 or #2 ranking on SuperCLUE Chinese leaderboards
- Top-10 LMSYS Chatbot Arena ranking at release in 2024
- Strong Chinese reasoning, math, writing and exam-style tasks
- Claimed parity with GPT-4 on selected English benchmarks at release
- 32K context window
- Available via 01.AI Platform and hosted API
- Best for: Chinese-market applications, bilingual content workflows, Chinese exam and education AI.
Trained on heavily filtered bilingual Chinese-English web data, books, code and academic content. Smaller Yi-34B base used 3.1T tokens; Yi Large is reported to scale further. Data pipeline uses CCNet-style quality filtering plus aggressive deduplication (described in the Yi technical report). Knowledge cutoff approximately early 2024. Post-training is supervised fine-tuning plus RLHF.
License: Yi Large itself is closed-API only under 01.AI's commercial terms of service. The open Yi base / chat variants (Yi-34B, Yi-9B, Yi-6B) use the Yi Series Models Community License Agreement, which permits commercial use after free registration with 01.AI.
Known limitations
- Closed weights — Yi Large itself is API-only
- 32K context lags 128K+ frontier models
- No vision modality (separate Yi-VL line covers multimodal)
- Tool-use / function-calling polish behind GPT-4 / Claude / Command R+
- Filters politically sensitive Chinese topics consistent with regulations
- Update cadence less transparent than US labs
Frequently asked questions
Related Models
View all Text & ChatClaude Opus 4
Anthropic's most powerful model. Exceptional at complex analysis, agentic tasks, and extended reasoning.
Claude Sonnet 4
Anthropic's most capable model. Excellent for complex analysis, coding, math, and creative writing.
DeepSeek V3.1
DeepSeek's refreshed V3.1 release. 671B MoE / 37B active. Tops open-weights leaderboards on coding and reasoning.
DeepSeek V4 Pro
DeepSeek's April 2026 flagship. 1.6T MoE / 49B active params, 1M context, rivals top closed-source models on STEM and coding at a fraction of the price.
Start using Yi Large today
Get started with free credits. No credit card required. Access Yi Large and 100+ other models through a single API.