How much does Yi Large cost via Railwail?

Input: €3.00 per 1M tokens. Output: €3.00 per 1M tokens. No monthly minimum, no subscription. Start with €5 free credits.

What is the context window of Yi Large?

Yi Large supports a 32.8K tokens context window — enough for long books, technical manuals, and extended analysis.

How fast is Yi Large?

Latency depends on prompt length and load — typically 200ms to 2s for short prompts. We measure p50/p95 in real-time on /rankings.

Is Yi Large better than Bio_ClinicalBERT?

It depends on your use case. Yi Large (Custom) and Bio_ClinicalBERT (huggingface) are both strong choices in text & chat. Compare them side-by-side at /compare/yi-large-vs-bio-clinicalbert.

Yi Large

Name: Yi Large
Brand: Custom
SKU: yi-large
Price: 0.003 EUR
Availability: InStock

Custom

Text & Chat

01.AI's larger general-purpose chat model with 32k context window and strong bilingual performance.

Try Yi Large now

Send a single prompt and stream a response inline. Hit Cmd+Enter to submit.

Press Cmd+Enter to send

Response appears here.

TL;DR·Last updated June 24, 2026

Yi Large is text & chat AI model from Custom, priced at €3.00 per 1M input tokens with a 32.8K tokens context window.

Try Yi Large

System Prompt

Message

Temperature

0.7

Max Tokens

Direct API access coming soon

Pricing

Price per Generation

Per generationFree

API Integration

Use our OpenAI-compatible API to integrate Yi Large into your application.

Install

npm install railwail

JavaScript / TypeScript

import railwail from "railwail";

const rw = railwail("YOUR_API_KEY");

// Simple — just pass a string
const reply = await rw.run("yi-large", "Hello! What can you do?");
console.log(reply);

// With message history
const reply2 = await rw.run("yi-large", [
  { role: "system", content: "You are a helpful assistant." },
  { role: "user", content: "Explain quantum computing simply." },
]);
console.log(reply2);

// Full response with usage info
const res = await rw.chat("yi-large", [
  { role: "user", content: "Hello!" },
], { temperature: 0.7, max_tokens: 500 });
console.log(res.choices[0].message.content);
console.log(res.usage);

Specifications

Context window

32,768 tokens

Max output

4,096 tokens

Developer

Custom

Deep dive — 01.AI (Lingyi Wanwu)'s Yi Large

About 01.AI (Lingyi Wanwu)

Founded 2023 · Beijing, China

01.AI (零一万物, Lingyi Wanwu) was founded in March 2023 by Kai-Fu Lee — former president of Google China, founder of Sinovation Ventures and a globally recognised AI investor and author. The lab launched the Yi-6B and Yi-34B open-weights base models in November 2023, which briefly led the Hugging Face Open LLM Leaderboard. 01.AI followed with Yi-9B (2024), Yi-VL (vision, 2024) and the closed-API flagship Yi Large in May 2024. The company raised $200M+ in seed funding in 2023 at a reported $1B valuation, with investors including Alibaba Cloud, Sinovation Ventures and others. 01.AI's positioning emphasises strong bilingual Chinese-English performance, data-quality-focused training and pragmatic product applications.

Visit 01.AI (Lingyi Wanwu) →

Architecture

Decoder-only Transformer (proprietary)

Yi Large is a dense decoder-only transformer following the modern LLM recipe — grouped-query attention, RoPE positional embeddings, SwiGLU activations and RMSNorm. 01.AI has been less specific about Yi Large's exact size than about the open Yi-34B/9B/6B variants. The Yi technical report describes the training methodology in detail: a heavily filtered bilingual Chinese-English data pipeline using CCNet-style quality filtering, aggressive deduplication and curated additions of books, code and scientific content. The Yi-34B base was trained on 3.1T tokens; Yi Large is reported to scale further. Post-training is supervised fine-tuning plus RLHF with proprietary specifics. Yi Large is closed-API only — only the smaller Yi base and chat variants (Yi-34B, Yi-9B, Yi-6B) have open weights under the Yi Series Models Community License Agreement. Released May 2024 via 01.AI Platform and partner clouds.

Parameters: Undisclosed (estimated ~100B based on tier sibling Yi-34B and contemporaneous public details)
Context: 32K tokens

What it can do

Top-tier bilingual Chinese-English performance
Frequent #1 or #2 ranking on SuperCLUE Chinese leaderboards
Top-10 LMSYS Chatbot Arena ranking at release in 2024
Strong Chinese reasoning, math, writing and exam-style tasks
Claimed parity with GPT-4 on selected English benchmarks at release
32K context window
Available via 01.AI Platform and hosted API
Best for: Chinese-market applications, bilingual content workflows, Chinese exam and education AI.

Training & License

Trained on heavily filtered bilingual Chinese-English web data, books, code and academic content. Smaller Yi-34B base used 3.1T tokens; Yi Large is reported to scale further. Data pipeline uses CCNet-style quality filtering plus aggressive deduplication (described in the Yi technical report). Knowledge cutoff approximately early 2024. Post-training is supervised fine-tuning plus RLHF.

License: Yi Large itself is closed-API only under 01.AI's commercial terms of service. The open Yi base / chat variants (Yi-34B, Yi-9B, Yi-6B) use the Yi Series Models Community License Agreement, which permits commercial use after free registration with 01.AI.

Known limitations

Closed weights — Yi Large itself is API-only
32K context lags 128K+ frontier models
No vision modality (separate Yi-VL line covers multimodal)
Tool-use / function-calling polish behind GPT-4 / Claude / Command R+
Filters politically sensitive Chinese topics consistent with regulations
Update cadence less transparent than US labs

Research papers

Frequently asked questions

Related Models

View all Text & Chat

Bio_ClinicalBERT

huggingface

The original Bio_ClinicalBERT from Alsentzer et al., a BERT model initialized from BioBERT and further pretrained on all MIMIC-III clinical notes. Served as a fill-mask endpoint it predicts masked tokens in clinical text and produces clinical embeddings. It is the standard encoder backbone behind many downstream clinical NLP fine-tunes.

€1.00

Biomedical NER (all entities)

huggingface

Token-classification model from d4data that tags 84 biomedical entity types in clinical and medical text, including disease, sign, symptom, medication, dosage, lab value, body part and procedure. Trained on the Maccrobat clinical case corpus on a DistilBERT base, so it runs cheaply for high-volume tagging.

€1.00

Claude Opus 4

Anthropic

Anthropic's most powerful model. Exceptional at complex analysis, agentic tasks, and extended reasoning.

Free

Claude Opus 4.8