Nous Hermes 3 70B

Together AI
Text & Chat

Llama-3.1-70B fine-tune from Nous Research with strong tool/agent capabilities and uncensored alignment.

Try Nous Hermes 3 70B now
Send a single prompt and stream a response inline. Hit Cmd+Enter to submit.
Sign in to try this model with €5 free credits.
Sign in
Press Cmd+Enter to send
Response appears here.
TL;DR·Last updated May 16, 2026

Nous Hermes 3 70B is text & chat AI model from Together AI, priced at €0.000 per 1M input tokens with a 131.1K tokens context window.

Try Nous Hermes 3 70B

0.7

Sign in to generate — 50 free credits on sign-up

Pricing

Price per Generation
Per generationFree

API Integration

Use our OpenAI-compatible API to integrate Nous Hermes 3 70B into your application.

Install
npm install railwail
JavaScript / TypeScript
import railwail from "railwail";

const rw = railwail("YOUR_API_KEY");

// Simple — just pass a string
const reply = await rw.run("hermes-3-70b", "Hello! What can you do?");
console.log(reply);

// With message history
const reply2 = await rw.run("hermes-3-70b", [
  { role: "system", content: "You are a helpful assistant." },
  { role: "user", content: "Explain quantum computing simply." },
]);
console.log(reply2);

// Full response with usage info
const res = await rw.chat("hermes-3-70b", [
  { role: "user", content: "Hello!" },
], { temperature: 0.7, max_tokens: 500 });
console.log(res.choices[0].message.content);
console.log(res.usage);
Specifications
Context window
131,072 tokens
Max output
4,096 tokens
Developer
Together AI
Category
Text & Chat
Supported Formats
text
Tags
nous
open-weights
tools
roleplay
pricing-tbd

Deep dive — Nous Research's Nous Hermes 3 70B

About Nous Research
Founded 2023 · San Francisco, USA (distributed)

Nous Research is a community-driven open-source AI collective founded in 2023, co-led by Karan 'Teknium' Malhotra, Jeffrey Quesnelle and Bowen Peng with a distributed contributor base. Nous focuses on uncensored, steerable, character-rich fine-tunes of open base models. The Hermes line (Hermes, OpenHermes, Hermes 2, Hermes 2.5, Hermes 3) is its flagship instruction-following family. Hermes 3 70B is the mid-sized variant of the August 2024 Hermes 3 release, fine-tuned from Llama 3.1 70B for steerable behaviour with native tool-use and scratchpad tags — the sweet spot for organisations that can run a 70B model on a small GPU cluster.

Visit Nous Research →
Architecture
Decoder-only Transformer (Llama 3.1 architecture)

Hermes 3 70B is a full-parameter supervised fine-tune of Meta's Llama 3.1 70B base. The architecture is identical to Llama 3.1 70B: 80 layers, 8,192 hidden size, 64-head grouped-query attention with 8 KV heads, RoPE positional embeddings with Llama 3 scaling (128K context), SwiGLU activations, and the 128,000-token Llama 3 BPE tokeniser. The fine-tune mixes role-play, function calling, code, math, RAG, agent traces and creative writing — largely Nous-curated synthetic data distilled from larger models. The model uses ChatML formatting with native `<tool_call>` JSON-schema tags and `<scratchpad>` chain-of-thought tags. A light DPO preference-optimisation stage was applied (unlike the 405B which is SFT-only). Released August 2024 under the Llama 3.1 Community License.

Parameters
70B (dense)
Context
128K tokens
What it can do
  • Full-parameter fine-tune of Llama 3.1 70B with light DPO
  • Excellent system-prompt steerability for agents and characters
  • Native ChatML `<tool_call>` and `<scratchpad>` tags
  • More tractable to self-host than the 405B (~140GB FP16, ~35GB INT4)
  • Strong instruction following and creative writing
  • 128K context inherited from Llama 3.1
  • Comparable to Llama 3.1 70B Instruct on benchmarks with friendlier tool-use format
  • Best for: self-hosted assistants, tool-using agents, creative platforms, fine-tuning base for community projects.
Training & License

Supervised fine-tuning on ~390M instruction tokens across ~2.5M examples covering role-play, function calling, code, math, RAG, agent traces and creative writing — largely Nous-curated synthetic data. A light DPO preference-optimisation pass was applied. Base knowledge cutoff December 2023.

License: Llama 3.1 Community License. Commercial use permitted, but services with >700M monthly active users require a separate Meta license. Meta's Acceptable Use Policy applies to all derivatives.

Known limitations
  • Reduced safety guardrails versus Meta's Llama 3.1 70B Instruct
  • Less capable than the 405B variant on hard reasoning, math and code
  • No vision modality
  • Knowledge cutoff inherited from Llama 3.1 (December 2023)
  • Llama 3.1 license restricts services with >700M MAU

Frequently asked questions

Start using Nous Hermes 3 70B today

Get started with free credits. No credit card required. Access Nous Hermes 3 70B and 100+ other models through a single API.