How much does Physical Intelligence π-0 cost via Railwail?

No monthly minimum, no subscription. Start with €5 free credits.

What is the context window of Physical Intelligence π-0?

Physical Intelligence π-0 supports a unknown context window — enough for typical AI workloads.

How fast is Physical Intelligence π-0?

Latency depends on prompt length and load — typically 200ms to 2s for short prompts. We measure p50/p95 in real-time on /rankings.

Is Physical Intelligence π-0 better than Gemini Robotics (2025)?

It depends on your use case. Physical Intelligence π-0 (Physical Intelligence) and Gemini Robotics (2025) (Google DeepMind) are both strong choices in vla / robotics. Compare them side-by-side at /compare/pi-0-pi-vs-gemini-robotics-2025.

Does Physical Intelligence π-0 support image input (vision)?

Yes — Physical Intelligence π-0 accepts image inputs in addition to text. Send images via the standard OpenAI-compatible `messages` array with `image_url` content blocks. Supported formats: image, text.

Physical Intelligence π-0

Name: Physical Intelligence π-0
Brand: Custom
SKU: pi-0-pi
Availability: InStock

Physical Intelligence

VLA / Robotics

Physical Intelligence's flagship VLA flow-matching policy. Generalist robot control, pretrained on 10k+ hrs robot data.

Research-only model

Physical Intelligence π-0 runs on physical robot hardware and is not exposed via the Railwail API yet.

Not API-accessible

Read the research

TL;DR·Last updated June 24, 2026

Physical Intelligence π-0 is vla / robotics AI model from Physical Intelligence, priced at €0.000 per 1M input tokens with a unknown context window.

Try Physical Intelligence π-0

System Prompt

Message

Temperature

0.7

Max Tokens

Direct API access coming soon

Pricing

Price per Generation

Per generationFree

API Integration

Use our OpenAI-compatible API to integrate Physical Intelligence π-0 into your application.

Install

npm install railwail

JavaScript / TypeScript

import railwail from "railwail";

const rw = railwail("YOUR_API_KEY");

// Simple — just pass a string
const reply = await rw.run("pi-0-pi", "Hello! What can you do?");
console.log(reply);

// With message history
const reply2 = await rw.run("pi-0-pi", [
  { role: "system", content: "You are a helpful assistant." },
  { role: "user", content: "Explain quantum computing simply." },
]);
console.log(reply2);

// Full response with usage info
const res = await rw.chat("pi-0-pi", [
  { role: "user", content: "Hello!" },
], { temperature: 0.7, max_tokens: 500 });
console.log(res.choices[0].message.content);
console.log(res.usage);

Specifications

Developer

Physical Intelligence

Deep dive — Physical Intelligence (PI)'s Physical Intelligence π-0

About Physical Intelligence (PI)

Founded 2024 · San Francisco, California, USA

Physical Intelligence (PI) is a San Francisco robot-foundation-model startup founded in 2024 by Sergey Levine, Chelsea Finn, Karol Hausman, Brian Ichter, Suraj Nair, Lachy Groom and others, with the explicit mission of building general-purpose foundation models for robots. The company raised $400M in early 2025 (led by Jeff Bezos, Thrive Capital, Lux Capital and others) at a multi-billion-dollar valuation. π-0 ('pi-zero') is PI's flagship VLA model, announced in October 2024 as a generalist robot policy that can drive many different robots through dexterous manipulation tasks - folding laundry, bussing tables, packing boxes - using a single set of weights and natural-language instructions. The model is published with a public technical report, and a subset of weights and code are released under the open-source 'openpi' GitHub repository to support reproducible research.

Visit Physical Intelligence (PI) →

Architecture

Flow-matching Vision-Language-Action policy

π-0 is a Vision-Language-Action model built on a 3B-parameter PaliGemma multimodal backbone (Gemma LLM + SigLIP vision tower), with a dedicated 'action expert' transformer head trained as a flow-matching policy. Inputs are multi-view RGB images, robot proprioception, and a natural-language instruction. The action expert outputs continuous action chunks at high frequency via flow matching - a recently popular alternative to diffusion that produces smoother trajectories with fewer integration steps. π-0 is pretrained on a large dataset of ~10,000 hours of multi-embodiment robot teleoperation collected by PI and partners, plus Open-X-Embodiment data and offline simulation. Post-training fine-tuning specialises the policy for individual robots and tasks. The model handles cross-embodiment action spaces (Aloha bimanual, Franka, mobile manipulators, humanoids) through learned action-space adapters.

Parameters: ~3B (PaliGemma backbone + action expert)
Context: unknown

What it can do

Generalist VLA driving multiple robot embodiments
Flow-matching action head producing smooth continuous actions
3B PaliGemma backbone (Gemma LLM + SigLIP vision)
Natural-language instruction conditioning
Long-horizon dexterous manipulation (laundry folding, bussing tables)
Multi-view + proprioception inputs
Reactive closed-loop control at high frequency
Partially open-sourced via the openpi GitHub repo
Best for: dexterous manipulation research, multi-task generalist policies.

Training & License

Approximately 10,000 hours of curated teleoperation data on a fleet of Physical Intelligence robots, augmented with Open-X-Embodiment cross-embodiment data and offline simulation. Multi-view RGB + proprioception + natural-language instructions.

License: Partially open-source - core π-0 model weights and inference code released via the openpi GitHub repository under permissive terms for research; commercial deployment is governed by Physical Intelligence's own licence and partnerships.

Known limitations