Snowflake Arctic Instruct

Custom
Text & Chat

Snowflake's open MoE model: 480B total / 17B active params with dense+MoE hybrid architecture.

Try Snowflake Arctic Instruct now
Send a single prompt and stream a response inline. Hit Cmd+Enter to submit.
Sign in to try this model with €5 free credits.
Sign in
Press Cmd+Enter to send
Response appears here.
TL;DRΒ·Last updated May 16, 2026

Snowflake Arctic Instruct is text & chat AI model from Custom, priced at €0.000 per 1M input tokens with a 4.1K tokens context window.

Try Snowflake Arctic Instruct

0.7

Direct API access coming soon

Pricing

Price per Generation
Per generationFree

API Integration

Use our OpenAI-compatible API to integrate Snowflake Arctic Instruct into your application.

Install
npm install railwail
JavaScript / TypeScript
import railwail from "railwail";

const rw = railwail("YOUR_API_KEY");

// Simple β€” just pass a string
const reply = await rw.run("arctic-instruct", "Hello! What can you do?");
console.log(reply);

// With message history
const reply2 = await rw.run("arctic-instruct", [
  { role: "system", content: "You are a helpful assistant." },
  { role: "user", content: "Explain quantum computing simply." },
]);
console.log(reply2);

// Full response with usage info
const res = await rw.chat("arctic-instruct", [
  { role: "user", content: "Hello!" },
], { temperature: 0.7, max_tokens: 500 });
console.log(res.choices[0].message.content);
console.log(res.usage);
Specifications
Context window
4,096 tokens
Max output
4,096 tokens
Developer
Custom
Category
Text & Chat
Supported Formats
text
Tags
snowflake
moe
open-weights
enterprise
pricing-tbd

Deep dive β€” Snowflake AI Research's Snowflake Arctic Instruct

About Snowflake AI Research
Founded 2023 Β· San Mateo, California, USA

Snowflake AI Research is the applied-AI team inside the Snowflake data cloud company, established as a focused unit in 2023 under Yuxiong He (former Microsoft DeepSpeed lead) and Samyam Rajbhandari. Snowflake itself is a public NYSE company (founded 2012) headquartered in Bozeman, Montana with engineering primarily in California. The AI Research team built Arctic to demonstrate that enterprise-targeted LLMs (SQL, code, instruction following) could be trained cheaply by combining many-small-expert MoE designs with carefully curated training data. Arctic Instruct was released April 2024 under Apache 2.0, with the team publicly reporting a training cost of approximately $2M β€” a fraction of comparable Western frontier runs.

Visit Snowflake AI Research β†’
Architecture
Hybrid Dense-MoE Transformer

Snowflake Arctic Instruct is a hybrid dense-MoE transformer released April 2024 under Apache 2.0. The architecture combines a 10B dense transformer with a 128-way MoE component (128 experts of ~3.66B parameters each, top-2 routing) for a total of 480B parameters and 17B active per forward pass. Compared to peer MoEs (Mixtral 8x22B, DBRX) Arctic uses many more, smaller experts and a high expert-count-to-active-param ratio β€” a deliberate choice to maximise specialisation for enterprise SQL and code tasks at low active compute. Training used a three-stage curriculum on roughly 3.5T tokens of web, code, GitHub, StackExchange and Snowflake-curated enterprise SQL data, with the final stage heavily oversampling SQL and structured outputs. Snowflake reported total training compute of approximately $2M on a cluster of around 3,200 H100 GPUs. Post-training was supervised fine-tuning plus DPO on enterprise instruction data; no full RLHF pipeline was reported.

Parameters
480B total, 17B active per token (10B dense + 128 x 3.66B experts, top-2 routing)
Context
4.1K tokens
What it can do
  • Hybrid 10B-dense + 128-expert MoE architecture
  • 17B active parameters out of 480B total β€” cheap inference for an MoE this large
  • Strong text-to-SQL performance (matches or beats Llama 3 70B on Spider, BIRD)
  • Solid enterprise code generation in Python, Java, SQL
  • Instruction following tuned for structured (JSON) outputs
  • Apache 2.0 open weights β€” fully permissive commercial use
  • Reported $2M training cost β€” milestone in cheap-to-train frontier MoE
  • Best for: enterprise SQL generation, structured extraction, self-hosted business assistants.
Training & License

Pretrained on approximately 3.5 trillion tokens in a three-stage curriculum: broad web data, code-and-math-heavy mix, and a final SQL-and-enterprise-instruction-heavy phase. Sources include filtered Common Crawl, GitHub, StackExchange, books, and Snowflake-curated enterprise SQL corpora. Knowledge cutoff is early 2024. Post-training is supervised fine-tuning plus DPO on enterprise instruction data.

License: Apache 2.0 for both base and Instruct weights. Commercial use, redistribution and modification permitted without royalty.

Known limitations
  • Very short 4K context window for a 2024 release
  • Weaker on open-ended creative writing than Llama 3 70B Instruct
  • Multilingual support limited β€” trained primarily on English
  • Reasoning chains shorter than dedicated reasoning models
  • No vision or audio input

Frequently asked questions

Start using Snowflake Arctic Instruct today

Get started with free credits. No credit card required. Access Snowflake Arctic Instruct and 100+ other models through a single API.