Kokoro TTS 82M
Open-weights 82M-parameter TTS. Punches above its size class on naturalness benchmarks at a fraction of the inference cost of larger models.
Kokoro TTS 82M is text-to-speech AI model from Replicate, priced at €0.000 per 1M input tokens with a unknown context window.
1x
Pricing
API Integration
Use our OpenAI-compatible API to integrate Kokoro TTS 82M into your application.
npm install railwailimport railwail from "railwail";
const rw = railwail("YOUR_API_KEY");
// Simple — just pass a string
const reply = await rw.run("kokoro-tts-82m", "Hello! What can you do?");
console.log(reply);
// With message history
const reply2 = await rw.run("kokoro-tts-82m", [
{ role: "system", content: "You are a helpful assistant." },
{ role: "user", content: "Explain quantum computing simply." },
]);
console.log(reply2);
// Full response with usage info
const res = await rw.chat("kokoro-tts-82m", [
{ role: "user", content: "Hello!" },
], { temperature: 0.7, max_tokens: 500 });
console.log(res.choices[0].message.content);
console.log(res.usage);Frequently asked questions
Related Models
View all Text-to-SpeechElevenLabs Multilingual V2
ElevenLabs' most natural-sounding TTS model. Supports 29 languages with emotional range.
AudioCraft
Meta's AudioCraft framework wrapping MusicGen, AudioGen and EnCodec. Unified text-to-audio research toolkit for music and sound effects.
AudioLDM 2
Latent-diffusion model for general-purpose text-to-audio. Generates speech, music, and sound effects with a unified prior.
Cartesia Sonic
Cartesia's ultra-low-latency TTS (~90ms TTFB). State-space model with voice cloning support.
Start using Kokoro TTS 82M today
Get started with free credits. No credit card required. Access Kokoro TTS 82M and 100+ other models through a single API.