Together AI

San Francisco, USAFounded 2022
9 models

Together AI provides fast inference and fine-tuning for 200+ open-source models including Llama, Qwen, DeepSeek, and Mixtral with one of the highest tokens-per-second throughputs available.

9 models from Together AI on Railwail

Access every Together AI model through Railwail's OpenAI-compatible API.

9 models available

Qwen 3 235B Instruct

Text & ChatAlibaba / Qwen
Popular

Alibaba's Qwen 3 flagship MoE: 235B total / 22B active. Strong reasoning and tool use, open-weights.

Free
qwenalibabamoe

Llama 3.2 90B Vision (multimodal)

MultimodalMeta

Meta's flagship vision-language model. 90B parameters, image understanding + chat, strong VQA performance.

Free
metallamamultimodal

Llama 3.3 70B

Text & ChatMeta

Meta's open-source 70B parameter model. Strong all-around performance with multilingual support.

Free2.5s
open-sourcepopular

Microsoft Phi-3.5 MoE Instruct

Text & ChatMicrosoft

Mixture-of-experts Phi-3.5: 42B total / 6.6B active params. 128k context, multilingual.

Free
microsoftopen-weightsmoe

Nous Hermes 3 405B

Text & ChatTogether AI

Full-parameter fine-tune of Llama 3.1 405B by Nous Research. Steerable, uncensored, strong tool use.

Free
nousopen-weightstools

Nous Hermes 3 70B

Text & ChatTogether AI

Llama-3.1-70B fine-tune from Nous Research with strong tool/agent capabilities and uncensored alignment.

Free
nousopen-weightstools

Qwen 2.5 72B

Text & ChatAlibaba / Qwen

Alibaba's powerful open-source model. Excellent at coding, math, and multilingual tasks.

Free2.5s
open-sourcecodingmultilingual

Qwen2-VL-72B Instruct

MultimodalAlibaba / Qwen

Alibaba's 72B vision-language model with M-RoPE and dynamic resolution. Strong document and video understanding.

Free
qwenalibabamultimodal

TII Falcon 180B Chat

Text & ChatTogether AI

TII's 180B causal decoder chat model fine-tuned on Ultrachat, Platypus and Airoboros.

Free
tiiopen-weightslegacy

Frequently asked questions

How is Together AI pricing handled on Railwail?
Railwail uses transparent per-call or per-token credit pricing for all Together AI models. You pay only for what you use — no monthly minimums, no upfront commitments. Pricing for every individual Together AI model is shown on its detail page.
Are there rate limits when using Together AI via Railwail?
Default rate limits depend on your account tier and the underlying Together AI capacity. Free-tier accounts get sensible defaults for development; paid accounts can request higher limits. Contact support if you need dedicated throughput or burst capacity.
Which regions does Together AI support through Railwail?
Together AI models are served from Railwail's globally distributed edge infrastructure. EU, US, and Asia-Pacific traffic is automatically routed to the nearest available provider region. GDPR-compliant EU-only routing is available on request.
Is there a sandbox or free tier to test Together AI models?
Yes — every new Railwail account receives free credits that work across all providers, including Together AI. No credit card is required to start. You can try every model in the catalog before committing to a paid plan.
Categories Together AI works in

Start building with Together AI today

Free credits on sign-up. No credit card required. Access Together AI and 27+ other providers through a single API.