Florence-2 Large

Microsoft
Multimodal

Microsoft Florence-2 Large. Unified prompt-based vision foundation model for captioning, detection, segmentation and OCR with a single 770M-param backbone.

Try Florence-2 Large now
Send a single prompt and stream a response inline. Hit Cmd+Enter to submit.
Sign in to try this model with €5 free credits.
Sign in
Press Cmd+Enter to send
Response appears here.
TL;DRΒ·Last updated May 16, 2026

Florence-2 Large is multimodal AI model from Microsoft, priced at €0.000 per 1M input tokens with a unknown context window.

Try Florence-2 Large

0.7

Sign in to generate β€” 50 free credits on sign-up

Pricing

Price per Generation
Per generation€0.008

API Integration

Use our OpenAI-compatible API to integrate Florence-2 Large into your application.

Install
npm install railwail
JavaScript / TypeScript
import railwail from "railwail";

const rw = railwail("YOUR_API_KEY");

// Simple β€” just pass a string
const reply = await rw.run("florence-2-large", "Hello! What can you do?");
console.log(reply);

// With message history
const reply2 = await rw.run("florence-2-large", [
  { role: "system", content: "You are a helpful assistant." },
  { role: "user", content: "Explain quantum computing simply." },
]);
console.log(reply2);

// Full response with usage info
const res = await rw.chat("florence-2-large", [
  { role: "user", content: "Hello!" },
], { temperature: 0.7, max_tokens: 500 });
console.log(res.choices[0].message.content);
console.log(res.usage);
Specifications
Price
€0.008
Developer
Microsoft
Category
Multimodal
Supported Formats
text
image
Tags
replicate
multimodal
vision-understanding
microsoft
open-weights

Frequently asked questions

Start using Florence-2 Large today

Get started with free credits. No credit card required. Access Florence-2 Large and 100+ other models through a single API.