TII Falcon 180B Chat
TII's 180B causal decoder chat model fine-tuned on Ultrachat, Platypus and Airoboros.
TII Falcon 180B Chat is text & chat AI model from Together AI, priced at โฌ0.000 per 1M input tokens with a 2.0K tokens context window.
0.7
Pricing
API Integration
Use our OpenAI-compatible API to integrate TII Falcon 180B Chat into your application.
npm install railwailimport railwail from "railwail";
const rw = railwail("YOUR_API_KEY");
// Simple โ just pass a string
const reply = await rw.run("falcon-180b-chat", "Hello! What can you do?");
console.log(reply);
// With message history
const reply2 = await rw.run("falcon-180b-chat", [
{ role: "system", content: "You are a helpful assistant." },
{ role: "user", content: "Explain quantum computing simply." },
]);
console.log(reply2);
// Full response with usage info
const res = await rw.chat("falcon-180b-chat", [
{ role: "user", content: "Hello!" },
], { temperature: 0.7, max_tokens: 500 });
console.log(res.choices[0].message.content);
console.log(res.usage);Deep dive โ Technology Innovation Institute (TII)'s TII Falcon 180B Chat
The Technology Innovation Institute (TII) is the applied-research pillar of Abu Dhabi's Advanced Technology Research Council (ATRC), established in 2020. TII houses dedicated centres for cryptography, autonomy, robotics, biotechnology and AI. The AI Cross-Center Unit released the Falcon 7B and 40B open-weight LLMs in mid-2023, which briefly topped the Hugging Face Open LLM Leaderboard. Falcon 180B, released September 2023, was for several months the largest openly-available LLM and a milestone in non-US frontier model development. The Falcon series โ and TII's broader open-source posture โ has positioned the UAE as a serious player in sovereign AI alongside the US, EU and China. The team is led by Hakim Hacid and Ebtesam Almazrouei. Subsequent releases include Falcon 2 (2024) and Falcon 3 (late 2024).
Visit Technology Innovation Institute (TII) โFalcon 180B is a dense decoder-only transformer with 80 layers, hidden size 14848, 232 attention heads, and a 2,048-token context window. It uses multi-query attention (a single KV head shared across all 232 query heads) โ a memory-efficient design that predates grouped-query attention's popularity โ together with parallel attention/MLP layers (GPT-J style), RoPE positional embeddings and FlashAttention kernels. The model was pretrained on 3.5 trillion tokens drawn primarily from RefinedWeb (TII's deduplicated and filtered Common Crawl pipeline) plus curated books, code and conversation data. Training used 4,096 NVIDIA A100 40GB GPUs on AWS SageMaker for approximately seven months, consuming around 7 million GPU-hours. The Chat variant is a supervised fine-tune on a mixture of Ultrachat, Platypus and OpenAssistant โ no RLHF stage. Released September 2023 under the TII Falcon License (later replaced by TII Falcon LLM License 2.0 with more permissive terms).
- Parameters
- 180B (dense)
- Context
- 2.0K tokens
- 180B dense parameters โ largest openly-available LLM at September 2023 release
- Multi-query attention for memory-efficient inference
- Pretrained on 3.5T tokens of RefinedWeb-filtered data
- Open weights for both base and Chat variants
- Matched LLaMA 2 70B on many English benchmarks at release
- TII Falcon License permits commercial use (with conditions)
- Trained for ~7M GPU-hours on AWS SageMaker
- Best for: research baselines, legacy 2023-era deployments, sovereign AI demonstrations.
Pretrained on 3.5 trillion tokens. The mix is primarily RefinedWeb (TII's deduplicated and filtered Common Crawl), plus curated books, code, and multilingual data. Knowledge cutoff is February 2023. The Chat variant is supervised-fine-tuned on Ultrachat, Platypus and OpenAssistant; no RLHF stage was reported.
License: TII Falcon License (later TII Falcon LLM License 2.0). Permits commercial use and redistribution with attribution; earlier license required royalties above $1M attributable annual revenue (removed in v2.0). Less straightforward than Apache 2.0 or MIT โ review terms before deploying.
Known limitations
- Very short 2K context window โ non-competitive in 2024+
- 180B dense weights need ~400GB GPU memory at FP16
- TII Falcon License is more restrictive than Apache 2.0 or MIT
- Chat fine-tune is weaker than newer open instructs (Llama 3.1, Mixtral, Command R)
- Limited multilingual ability โ trained mostly on English
- Largely superseded by Falcon 2, Llama 3.1 405B and DeepSeek V3
Frequently asked questions
Related Models
View all Text & ChatClaude Opus 4
Anthropic's most powerful model. Exceptional at complex analysis, agentic tasks, and extended reasoning.
Claude Sonnet 4
Anthropic's most capable model. Excellent for complex analysis, coding, math, and creative writing.
DeepSeek V3.1
DeepSeek's refreshed V3.1 release. 671B MoE / 37B active. Tops open-weights leaderboards on coding and reasoning.
DeepSeek V4 Pro
DeepSeek's April 2026 flagship. 1.6T MoE / 49B active params, 1M context, rivals top closed-source models on STEM and coding at a fraction of the price.
Start using TII Falcon 180B Chat today
Get started with free credits. No credit card required. Access TII Falcon 180B Chat and 100+ other models through a single API.