Stable Audio 2

Udio
Text-to-Speech

Stability AI's Stable Audio 2.0. Text-to-music up to 3 minutes of full-length, structured tracks at 44.1 kHz.

Speak with Stable Audio 2
Type any text and hear it spoken in a chosen voice.
Sign in to try this model with €5 free credits.
Sign in
Audio player appears here.
TL;DR·Last updated May 16, 2026

Stable Audio 2 is text-to-speech AI model from Udio, priced at €0.000 per 1M input tokens with a unknown context window.

Try Stable Audio 2

1x

Direct API access coming soon

Pricing

Price per Generation
Per generationFree

API Integration

Use our OpenAI-compatible API to integrate Stable Audio 2 into your application.

Install
npm install railwail
JavaScript / TypeScript
import railwail from "railwail";

const rw = railwail("YOUR_API_KEY");

// Simple — just pass a string
const reply = await rw.run("stable-audio-2", "Hello! What can you do?");
console.log(reply);

// With message history
const reply2 = await rw.run("stable-audio-2", [
  { role: "system", content: "You are a helpful assistant." },
  { role: "user", content: "Explain quantum computing simply." },
]);
console.log(reply2);

// Full response with usage info
const res = await rw.chat("stable-audio-2", [
  { role: "user", content: "Hello!" },
], { temperature: 0.7, max_tokens: 500 });
console.log(res.choices[0].message.content);
console.log(res.usage);
Specifications
Developer
Udio
Category
Text-to-Speech
Supported Formats
text
Tags
stability
music-generation
pricing-tbd

Frequently asked questions

Start using Stable Audio 2 today

Get started with free credits. No credit card required. Access Stable Audio 2 and 100+ other models through a single API.