How much does TII Falcon 180B Chat cost via Railwail?

No monthly minimum, no subscription. Start with €5 free credits.

What is the context window of TII Falcon 180B Chat?

TII Falcon 180B Chat supports a 2.0K tokens context window — enough for short prompts and chat.

How fast is TII Falcon 180B Chat?

Latency depends on prompt length and load — typically 200ms to 2s for short prompts. We measure p50/p95 in real-time on /rankings.

Is TII Falcon 180B Chat better than Claude Opus 4?

It depends on your use case. TII Falcon 180B Chat (Together AI) and Claude Opus 4 (Anthropic) are both strong choices in text & chat. Compare them side-by-side at /compare/falcon-180b-chat-vs-claude-opus-4.

TII Falcon 180B Chat

Name: TII Falcon 180B Chat
Brand: Together AI
SKU: falcon-180b-chat
Availability: InStock

Together AI

Text & Chat

TII's 180B causal decoder chat model fine-tuned on Ultrachat, Platypus and Airoboros.

Try TII Falcon 180B Chat now

Send a single prompt and stream a response inline. Hit Cmd+Enter to submit.

Press Cmd+Enter to send

Response appears here.

TL;DR·Last updated May 16, 2026

TII Falcon 180B Chat is text & chat AI model from Together AI, priced at €0.000 per 1M input tokens with a 2.0K tokens context window.

Try TII Falcon 180B Chat

System Prompt

Message

Temperature

0.7

Max Tokens

Pricing

Price per Generation

Per generationFree

API Integration

Use our OpenAI-compatible API to integrate TII Falcon 180B Chat into your application.

Install

npm install railwail

JavaScript / TypeScript

import railwail from "railwail";

const rw = railwail("YOUR_API_KEY");

// Simple — just pass a string
const reply = await rw.run("falcon-180b-chat", "Hello! What can you do?");
console.log(reply);

// With message history
const reply2 = await rw.run("falcon-180b-chat", [
  { role: "system", content: "You are a helpful assistant." },
  { role: "user", content: "Explain quantum computing simply." },
]);
console.log(reply2);

// Full response with usage info
const res = await rw.chat("falcon-180b-chat", [
  { role: "user", content: "Hello!" },
], { temperature: 0.7, max_tokens: 500 });
console.log(res.choices[0].message.content);
console.log(res.usage);

Specifications

Context window

2,048 tokens

Max output

2,048 tokens

Developer

Together AI

Deep dive — Technology Innovation Institute (TII)'s TII Falcon 180B Chat

About Technology Innovation Institute (TII)

Founded 2020 · Abu Dhabi, United Arab Emirates

The Technology Innovation Institute (TII) is the applied-research pillar of Abu Dhabi's Advanced Technology Research Council (ATRC), established in 2020. TII houses dedicated centres for cryptography, autonomy, robotics, biotechnology and AI. The AI Cross-Center Unit released the Falcon 7B and 40B open-weight LLMs in mid-2023, which briefly topped the Hugging Face Open LLM Leaderboard. Falcon 180B, released September 2023, was for several months the largest openly-available LLM and a milestone in non-US frontier model development. The Falcon series — and TII's broader open-source posture — has positioned the UAE as a serious player in sovereign AI alongside the US, EU and China. The team is led by Hakim Hacid and Ebtesam Almazrouei. Subsequent releases include Falcon 2 (2024) and Falcon 3 (late 2024).

Visit Technology Innovation Institute (TII) →

Architecture

Decoder-only Transformer

Falcon 180B is a dense decoder-only transformer with 80 layers, hidden size 14848, 232 attention heads, and a 2,048-token context window. It uses multi-query attention (a single KV head shared across all 232 query heads) — a memory-efficient design that predates grouped-query attention's popularity — together with parallel attention/MLP layers (GPT-J style), RoPE positional embeddings and FlashAttention kernels. The model was pretrained on 3.5 trillion tokens drawn primarily from RefinedWeb (TII's deduplicated and filtered Common Crawl pipeline) plus curated books, code and conversation data. Training used 4,096 NVIDIA A100 40GB GPUs on AWS SageMaker for approximately seven months, consuming around 7 million GPU-hours. The Chat variant is a supervised fine-tune on a mixture of Ultrachat, Platypus and OpenAssistant — no RLHF stage. Released September 2023 under the TII Falcon License (later replaced by TII Falcon LLM License 2.0 with more permissive terms).

Parameters: 180B (dense)
Context: 2.0K tokens

What it can do

180B dense parameters — largest openly-available LLM at September 2023 release
Multi-query attention for memory-efficient inference
Pretrained on 3.5T tokens of RefinedWeb-filtered data
Open weights for both base and Chat variants
Matched LLaMA 2 70B on many English benchmarks at release
TII Falcon License permits commercial use (with conditions)
Trained for ~7M GPU-hours on AWS SageMaker
Best for: research baselines, legacy 2023-era deployments, sovereign AI demonstrations.

Training & License

Pretrained on 3.5 trillion tokens. The mix is primarily RefinedWeb (TII's deduplicated and filtered Common Crawl), plus curated books, code, and multilingual data. Knowledge cutoff is February 2023. The Chat variant is supervised-fine-tuned on Ultrachat, Platypus and OpenAssistant; no RLHF stage was reported.

License: TII Falcon License (later TII Falcon LLM License 2.0). Permits commercial use and redistribution with attribution; earlier license required royalties above $1M attributable annual revenue (removed in v2.0). Less straightforward than Apache 2.0 or MIT — review terms before deploying.

Known limitations

Very short 2K context window — non-competitive in 2024+
180B dense weights need ~400GB GPU memory at FP16
TII Falcon License is more restrictive than Apache 2.0 or MIT
Chat fine-tune is weaker than newer open instructs (Llama 3.1, Mixtral, Command R)
Limited multilingual ability — trained mostly on English
Largely superseded by Falcon 2, Llama 3.1 405B and DeepSeek V3

Research papers

Frequently asked questions

Related Models

View all Text & Chat

Claude Opus 4

Anthropic

Anthropic's most powerful model. Exceptional at complex analysis, agentic tasks, and extended reasoning.

Free

Claude Sonnet 4

Anthropic

Anthropic's most capable model. Excellent for complex analysis, coding, math, and creative writing.

Free

DeepSeek V3.1

DeepSeek

DeepSeek's refreshed V3.1 release. 671B MoE / 37B active. Tops open-weights leaderboards on coding and reasoning.

Free

DeepSeek V4 Pro

DeepSeek

DeepSeek's April 2026 flagship. 1.6T MoE / 49B active params, 1M context, rivals top closed-source models on STEM and coding at a fraction of the price.

Free

Start using TII Falcon 180B Chat today

Get started with free credits. No credit card required. Access TII Falcon 180B Chat and 100+ other models through a single API.

Get Started Free Browse All Models

TII Falcon 180B Chat

Pricing

API Integration

Deep dive — Technology Innovation Institute (TII)'s TII Falcon 180B Chat

Research papers

Frequently asked questions

What is TII Falcon 180B Chat?

How much does TII Falcon 180B Chat cost via Railwail?

What is the context window of TII Falcon 180B Chat?

How fast is TII Falcon 180B Chat?

Is TII Falcon 180B Chat better than Claude Opus 4?

Related Models

Claude Opus 4

Claude Sonnet 4

DeepSeek V3.1

DeepSeek V4 Pro

Start using TII Falcon 180B Chat today