LLaVA-OneVision 72B

Replicate
Multimodal

LMMs-Lab LLaVA-OneVision 72B. Unified single-image, multi-image and video instruction-tuned VLM with task-transfer across modalities.

Try LLaVA-OneVision 72B now
Send a single prompt and stream a response inline. Hit Cmd+Enter to submit.
Sign in to try this model with €5 free credits.
Sign in
Press Cmd+Enter to send
Response appears here.
TL;DRΒ·Last updated May 16, 2026

LLaVA-OneVision 72B is multimodal AI model from Replicate, priced at €0.000 per 1M input tokens with a 32.8K tokens context window.

Try LLaVA-OneVision 72B

0.7

Sign in to generate β€” 50 free credits on sign-up

Pricing

Price per Generation
Per generation€0.02

API Integration

Use our OpenAI-compatible API to integrate LLaVA-OneVision 72B into your application.

Install
npm install railwail
JavaScript / TypeScript
import railwail from "railwail";

const rw = railwail("YOUR_API_KEY");

// Simple β€” just pass a string
const reply = await rw.run("llava-onevision-72b", "Hello! What can you do?");
console.log(reply);

// With message history
const reply2 = await rw.run("llava-onevision-72b", [
  { role: "system", content: "You are a helpful assistant." },
  { role: "user", content: "Explain quantum computing simply." },
]);
console.log(reply2);

// Full response with usage info
const res = await rw.chat("llava-onevision-72b", [
  { role: "user", content: "Hello!" },
], { temperature: 0.7, max_tokens: 500 });
console.log(res.choices[0].message.content);
console.log(res.usage);
Specifications
Price
€0.02
Context window
32,768 tokens
Max output
4,096 tokens
Developer
Replicate
Category
Multimodal
Supported Formats
text
image
video
Tags
replicate
multimodal
vision-understanding
llava
open-weights

Frequently asked questions

Start using LLaVA-OneVision 72B today

Get started with free credits. No credit card required. Access LLaVA-OneVision 72B and 100+ other models through a single API.