Llama 3.3 70B

Together

Text & Chat

Meta's open-source 70B parameter model. Strong all-around performance with multilingual support.

Try this model View API Docs

Free credits on sign-up

Examples

See what Llama 3.3 70B can generate

Comparison Essay

Compare and contrast REST and GraphQL APIs in terms of flexibility and performance

REST APIs use fixed endpoints that return predetermined data structures, which is simple but can lead to over-fetching (getting more data than needed) or under-fetching (requiring multiple requests). GraphQL lets clients request exactly the fields they need in a single query, offering superior flexibility. However, REST benefits from better HTTP caching, simpler error handling, and wider tooling support. GraphQL shines in complex applications with varied data needs, while REST remains ideal for straightforward CRUD operations and public APIs.

Explain Like I'm 5

Explain how the internet works to a 5-year-old

Imagine you want to send a drawing to your friend who lives far away. You put your drawing in a magic mailbox, and the mailbox breaks it into tiny puzzle pieces. These pieces zoom through special tunnels underground and even under the ocean! When all the pieces reach your friend's magic mailbox, it puts the puzzle back together. That's basically how the internet works—your computer breaks messages into tiny pieces, sends them through wires, and the other computer puts them back together super fast.

Try Llama 3.3 70B

System Prompt

Message

Temperature

0.7

Max Tokens

Response will appear here...

Get Started

Pricing

Price per Generation

Per generationFree

API Integration

Use our OpenAI-compatible API to integrate Llama 3.3 70B into your application.

Install

npm install railwail

JavaScript / TypeScript

import railwail from "railwail";

const rw = railwail("YOUR_API_KEY");

// Simple — just pass a string
const reply = await rw.run("llama-3-3-70b", "Hello! What can you do?");
console.log(reply);

// With message history
const reply2 = await rw.run("llama-3-3-70b", [
  { role: "system", content: "You are a helpful assistant." },
  { role: "user", content: "Explain quantum computing simply." },
]);
console.log(reply2);

// Full response with usage info
const res = await rw.chat("llama-3-3-70b", [
  { role: "user", content: "Hello!" },
], { temperature: 0.7, max_tokens: 500 });
console.log(res.choices[0].message.content);
console.log(res.usage);

Specifications

Context window

131,072 tokens

Max output

4,096 tokens

Avg. latency

2.5s

Provider

Together AI

Related Models

View all Text & Chat

Claude Opus 4

Anthropic

Anthropic's most powerful model. Exceptional at complex analysis, agentic tasks, and extended reasoning.

Free

Claude Sonnet 4

Anthropic

Anthropic's most capable model. Excellent for complex analysis, coding, math, and creative writing.

Free

GPT-4.1

OpenAI

OpenAI's newest flagship model. Improved reasoning, instruction following, and coding over GPT-4o.

Free

GPT-4.5 Preview

OpenAI

OpenAI's latest frontier model with improved reasoning, creativity, and instruction following. Significant improvements over GPT-4o.

Free

Start using Llama 3.3 70B today

Get started with free credits. No credit card required. Access Llama 3.3 70B and 100+ other models through a single API.

Get Started Free Browse All Models