Migrate from Together AI to Railwail
Migration Guides

Migrate from Together AI to Railwail

Switch from Together AI to Railwail. Same Llama, Mixtral, DeepSeek and Qwen models, same OpenAI-compatible API, EU hosting, EUR billing, plus 200+ other models.

Railwail Teamยท Developer Relations7 min readMay 16, 2026

TL;DR โ€” Switch in Under 5 Minutes

  • Both APIs are OpenAI-compatible โ€” change only the base URL and API key
  • All major Together models mirrored: Llama 3.3, Mixtral, DeepSeek V3, Qwen 2.5
  • EU-hosted endpoint, EUR billing โ€” clean for European entities
  • Plus access to GPT-4o, Claude, Gemini, Flux through the same key
  • Comparable per-token pricing on the open-source models

Why Move Off Together AI?

Together AI is great for hosted open-source LLMs at competitive prices. The trade-offs: US-hosted by default, USD-only billing, and the catalog is limited to open-source models. If you want Claude or GPT-4o in the same codebase, you still need separate accounts. Railwail unifies all of these under one API, one EUR invoice, and EU residency.

Step 1 โ€” Get a Railwail API Key

Sign up at railwail.com and create a key in Dashboard โ†’ API Keys.

Sponsored

Access 100+ AI Models with One API Key

GPT-4o, Claude, Gemini, Llama, Flux, DALL-E and more โ€” all through a single, OpenAI-compatible endpoint. No more juggling multiple providers.

Step 2 โ€” Change the Base URL

TypeScript / JavaScript

Before (Together AI):

import OpenAI from "openai";

const together = new OpenAI({
  apiKey: process.env.TOGETHER_API_KEY,
  baseURL: "https://api.together.xyz/v1",
});

const res = await together.chat.completions.create({
  model: "meta-llama/Llama-3.3-70B-Instruct-Turbo",
  messages: [{ role: "user", content: "Hello" }],
});
After (Railwail):
import OpenAI from "openai";

const client = new OpenAI({
  apiKey: process.env.RAILWAIL_API_KEY,
  baseURL: "https://api.railwail.com/v1",
});

const res = await client.chat.completions.create({
  model: "llama-3.3-70b-instruct",
  messages: [{ role: "user", content: "Hello" }],
});

Python

from openai import OpenAI

client = OpenAI(
    api_key=os.environ["RAILWAIL_API_KEY"],
    base_url="https://api.railwail.com/v1",
)

resp = client.chat.completions.create(
    model="llama-3.3-70b-instruct",
    messages=[{"role": "user", "content": "Hello"}],
)
print(resp.choices[0].message.content)

cURL

curl https://api.railwail.com/v1/chat/completions \
  -H "Authorization: Bearer $RAILWAIL_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "llama-3.3-70b-instruct",
    "messages": [{"role": "user", "content": "Hello"}]
  }'

Step 3 โ€” Update Model IDs

Together AI uses fully-qualified Hugging Face style model IDs like meta-llama/Llama-3.3-70B-Instruct-Turbo. Railwail normalises these to short slugs (llama-3.3-70b-instruct). The long IDs are also accepted as aliases for compatibility โ€” your existing Together strings will work, but the short forms are recommended.

API Endpoint Mapping

Together AI endpoint โ†’ Railwail equivalent

Together AIRailwailNotes
POST /v1/chat/completionsPOST /v1/chat/completionsIdentical
POST /v1/completionsPOST /v1/completionsLegacy completions supported
POST /v1/embeddingsPOST /v1/embeddingsIdentical
POST /v1/images/generationsPOST /v1/images/generationsFlux, SDXL
POST /v1/audio/transcriptionsPOST /v1/audio/transcriptionsWhisper
GET /v1/modelsGET /v1/models275+ models, filter by provider
POST /v1/rerankPOST /v1/rerankCohere-compatible rerank API

Sponsored

Test Any AI Model Instantly

Our built-in playground lets you compare models side by side. Find the perfect model for your use case in minutes, not days.

Model Mapping

Together AI model โ†’ Railwail model ID

Together AIRailwailNotes
meta-llama/Llama-3.3-70B-Instruct-Turbollama-3.3-70b-instructFrontier open LLM
meta-llama/Meta-Llama-3.1-405B-Instruct-Turbollama-3.1-405b-instructLargest open
meta-llama/Llama-3.1-8B-Instruct-Turbollama-3.1-8b-instructSmall, fast
mistralai/Mixtral-8x7B-Instruct-v0.1mixtral-8x7b-instructMoE
mistralai/Mixtral-8x22B-Instruct-v0.1mixtral-8x22b-instructLarger MoE
deepseek-ai/DeepSeek-V3deepseek-v3Frontier MoE
deepseek-ai/DeepSeek-R1deepseek-r1Reasoning
Qwen/Qwen2.5-72B-Instruct-Turboqwen2.5-72b-instructAlibaba
WhereIsAI/UAE-Large-V1uae-large-v1Embedding
black-forest-labs/FLUX.1-schnellflux-schnellImage

Pricing Comparison (per 1M tokens, May 2026)

Same open-source model, Railwail in EUR

ModelTogether AI (USD)Railwail (EUR)Notes
llama-3.3-70b-instruct$0.88EUR 0.81Identical
llama-3.1-405b-instruct$3.50EUR 3.22Identical
llama-3.1-8b-instruct$0.18EUR 0.17Identical
mixtral-8x7b-instruct$0.60EUR 0.55Identical
deepseek-v3$1.25EUR 1.15Identical
deepseek-r1$3.00 / $7.00EUR 2.76 / 6.44Input/output
qwen2.5-72b-instruct$1.20EUR 1.10Identical

Why Railwail Over Together AI

  • EU billing in EUR with VAT receipts
  • Frankfurt-region hosting for GDPR-compliant logs
  • Same OpenAI-compatible API โ€” drop-in replacement
  • Access to Claude, GPT-4o, Gemini, Mistral La Plateforme through the same key
  • Built-in playground at railwail.com/models for A/B testing
  • Comparable pricing on Together's lineup, plus per-key spend caps and rate limits

Sponsored

Pay Only for What You Use

Transparent per-token pricing with no monthly minimums. Start with free credits and scale as you grow.

FAQ

Are the Turbo variants (FP8) supported?

Yes. Railwail serves the same FP8-quantised Llama 3.x Turbo variants Together AI offers. Specify -turbo in the model slug to opt into FP8.

What about Together's batch inference?

Use Railwail's POST /v1/batches โ€” same 50% discount on async workloads.

Does the rerank API work the same?

Yes. POST /v1/rerank accepts the same query/documents/top_n payload.

Can I bring my own fine-tuned Llama?

Custom fine-tunes are not yet hosted on Railwail. You can keep them on Together AI and use Railwail for everything else.

How is latency vs Together AI?

First-token latency from EU origins is typically 30-80ms faster via Railwail's Frankfurt edge than via Together's US-default routing.

Next Steps

  • Create your Railwail account at railwail.com
  • Generate an API key in Dashboard โ†’ API Keys
  • Change baseURL to https://api.railwail.com/v1
  • Update model IDs to Railwail short slugs (or keep long Together IDs as aliases)
  • Read the full reference at railwail.com/docs
  • Compare per-token pricing at railwail.com/pricing

Railwail Team

Developer Relations

The Railwail team writes integration guides for developers migrating from single-provider AI APIs to a unified multi-model platform.

Tags:
Together AI
Migration
Llama
Mixtral
Open Source
API