TII Falcon 180B Chat

Together AI
Text & Chat

TII's 180B causal decoder chat model fine-tuned on Ultrachat, Platypus and Airoboros.

Try TII Falcon 180B Chat now
Send a single prompt and stream a response inline. Hit Cmd+Enter to submit.
Sign in to try this model with €5 free credits.
Sign in
Press Cmd+Enter to send
Response appears here.
TL;DRΒ·Last updated May 16, 2026

TII Falcon 180B Chat is text & chat AI model from Together AI, priced at €0.000 per 1M input tokens with a 2.0K tokens context window.

Try TII Falcon 180B Chat

0.7

Sign in to generate β€” 50 free credits on sign-up

Pricing

Price per Generation
Per generationFree

API Integration

Use our OpenAI-compatible API to integrate TII Falcon 180B Chat into your application.

Install
npm install railwail
JavaScript / TypeScript
import railwail from "railwail";

const rw = railwail("YOUR_API_KEY");

// Simple β€” just pass a string
const reply = await rw.run("falcon-180b-chat", "Hello! What can you do?");
console.log(reply);

// With message history
const reply2 = await rw.run("falcon-180b-chat", [
  { role: "system", content: "You are a helpful assistant." },
  { role: "user", content: "Explain quantum computing simply." },
]);
console.log(reply2);

// Full response with usage info
const res = await rw.chat("falcon-180b-chat", [
  { role: "user", content: "Hello!" },
], { temperature: 0.7, max_tokens: 500 });
console.log(res.choices[0].message.content);
console.log(res.usage);
Specifications
Context window
2,048 tokens
Max output
2,048 tokens
Developer
Together AI
Category
Text & Chat
Supported Formats
text
Tags
tii
open-weights
legacy
pricing-tbd

Deep dive β€” Technology Innovation Institute (TII)'s TII Falcon 180B Chat

About Technology Innovation Institute (TII)
Founded 2020 Β· Abu Dhabi, United Arab Emirates

The Technology Innovation Institute (TII) is the applied-research pillar of Abu Dhabi's Advanced Technology Research Council (ATRC), established in 2020. TII houses dedicated centres for cryptography, autonomy, robotics, biotechnology and AI. The AI Cross-Center Unit released the Falcon 7B and 40B open-weight LLMs in mid-2023, which briefly topped the Hugging Face Open LLM Leaderboard. Falcon 180B, released September 2023, was for several months the largest openly-available LLM and a milestone in non-US frontier model development. The Falcon series β€” and TII's broader open-source posture β€” has positioned the UAE as a serious player in sovereign AI alongside the US, EU and China. The team is led by Hakim Hacid and Ebtesam Almazrouei. Subsequent releases include Falcon 2 (2024) and Falcon 3 (late 2024).

Visit Technology Innovation Institute (TII) β†’
Architecture
Decoder-only Transformer

Falcon 180B is a dense decoder-only transformer with 80 layers, hidden size 14848, 232 attention heads, and a 2,048-token context window. It uses multi-query attention (a single KV head shared across all 232 query heads) β€” a memory-efficient design that predates grouped-query attention's popularity β€” together with parallel attention/MLP layers (GPT-J style), RoPE positional embeddings and FlashAttention kernels. The model was pretrained on 3.5 trillion tokens drawn primarily from RefinedWeb (TII's deduplicated and filtered Common Crawl pipeline) plus curated books, code and conversation data. Training used 4,096 NVIDIA A100 40GB GPUs on AWS SageMaker for approximately seven months, consuming around 7 million GPU-hours. The Chat variant is a supervised fine-tune on a mixture of Ultrachat, Platypus and OpenAssistant β€” no RLHF stage. Released September 2023 under the TII Falcon License (later replaced by TII Falcon LLM License 2.0 with more permissive terms).

Parameters
180B (dense)
Context
2.0K tokens
What it can do
  • 180B dense parameters β€” largest openly-available LLM at September 2023 release
  • Multi-query attention for memory-efficient inference
  • Pretrained on 3.5T tokens of RefinedWeb-filtered data
  • Open weights for both base and Chat variants
  • Matched LLaMA 2 70B on many English benchmarks at release
  • TII Falcon License permits commercial use (with conditions)
  • Trained for ~7M GPU-hours on AWS SageMaker
  • Best for: research baselines, legacy 2023-era deployments, sovereign AI demonstrations.
Training & License

Pretrained on 3.5 trillion tokens. The mix is primarily RefinedWeb (TII's deduplicated and filtered Common Crawl), plus curated books, code, and multilingual data. Knowledge cutoff is February 2023. The Chat variant is supervised-fine-tuned on Ultrachat, Platypus and OpenAssistant; no RLHF stage was reported.

License: TII Falcon License (later TII Falcon LLM License 2.0). Permits commercial use and redistribution with attribution; earlier license required royalties above $1M attributable annual revenue (removed in v2.0). Less straightforward than Apache 2.0 or MIT β€” review terms before deploying.

Known limitations
  • Very short 2K context window β€” non-competitive in 2024+
  • 180B dense weights need ~400GB GPU memory at FP16
  • TII Falcon License is more restrictive than Apache 2.0 or MIT
  • Chat fine-tune is weaker than newer open instructs (Llama 3.1, Mixtral, Command R)
  • Limited multilingual ability β€” trained mostly on English
  • Largely superseded by Falcon 2, Llama 3.1 405B and DeepSeek V3

Frequently asked questions

Start using TII Falcon 180B Chat today

Get started with free credits. No credit card required. Access TII Falcon 180B Chat and 100+ other models through a single API.