How much does Physical Intelligence Pi-0-FAST cost via Railwail?

No monthly minimum, no subscription. Start with €5 free credits.

What is the context window of Physical Intelligence Pi-0-FAST?

Physical Intelligence Pi-0-FAST supports a unknown context window — enough for typical AI workloads.

How fast is Physical Intelligence Pi-0-FAST?

Latency depends on prompt length and load — typically 200ms to 2s for short prompts. We measure p50/p95 in real-time on /rankings.

Is Physical Intelligence Pi-0-FAST better than Gemini Robotics (2025)?

It depends on your use case. Physical Intelligence Pi-0-FAST (Physical Intelligence) and Gemini Robotics (2025) (Google DeepMind) are both strong choices in vla / robotics. Compare them side-by-side at /compare/pi-0-fast-vs-gemini-robotics-2025.

Does Physical Intelligence Pi-0-FAST support image input (vision)?

Yes — Physical Intelligence Pi-0-FAST accepts image inputs in addition to text. Send images via the standard OpenAI-compatible `messages` array with `image_url` content blocks. Supported formats: image, text.

Physical Intelligence Pi-0-FAST

Name: Physical Intelligence Pi-0-FAST
Brand: Custom
SKU: pi-0-fast
Availability: InStock

Physical Intelligence

VLA / Robotics

Autoregressive π-0 variant using FAST action tokenizer. Faster inference at competitive task success.

Research-only model

Physical Intelligence Pi-0-FAST runs on physical robot hardware and is not exposed via the Railwail API yet.

Not API-accessible

Read the research

TL;DR·Last updated June 24, 2026

Physical Intelligence Pi-0-FAST is vla / robotics AI model from Physical Intelligence, priced at €0.000 per 1M input tokens with a unknown context window.

Try Physical Intelligence Pi-0-FAST

System Prompt

Message

Temperature

0.7

Max Tokens

Direct API access coming soon

Pricing

Price per Generation

Per generationFree

API Integration

Use our OpenAI-compatible API to integrate Physical Intelligence Pi-0-FAST into your application.

Install

npm install railwail

JavaScript / TypeScript

import railwail from "railwail";

const rw = railwail("YOUR_API_KEY");

// Simple — just pass a string
const reply = await rw.run("pi-0-fast", "Hello! What can you do?");
console.log(reply);

// With message history
const reply2 = await rw.run("pi-0-fast", [
  { role: "system", content: "You are a helpful assistant." },
  { role: "user", content: "Explain quantum computing simply." },
]);
console.log(reply2);

// Full response with usage info
const res = await rw.chat("pi-0-fast", [
  { role: "user", content: "Hello!" },
], { temperature: 0.7, max_tokens: 500 });
console.log(res.choices[0].message.content);
console.log(res.usage);

Specifications

Developer

Physical Intelligence

Deep dive — Physical Intelligence (PI)'s Physical Intelligence Pi-0-FAST

About Physical Intelligence (PI)

Founded 2024 · San Francisco, California, USA

Physical Intelligence (PI) was founded in 2024 in San Francisco by Sergey Levine, Chelsea Finn, Karol Hausman and other co-founders, with a mission to build foundation models for general-purpose robots. π-0-FAST, released in early 2025, is the autoregressive variant of the π-0 VLA introduced together with the FAST action tokenizer. FAST (Frequency-space Action Sequence Tokenization) uses Discrete Cosine Transform compression to encode entire action chunks as a small number of discrete tokens, allowing a standard autoregressive VLM head to play the role of a robot policy without diffusion or flow-matching sampling. PI publishes π-0-FAST checkpoints and tokenizer code via the openpi GitHub repository alongside the flow-matching π-0 family, giving researchers both autoregressive and flow-matching policy baselines from the same backbone and data.

Visit Physical Intelligence (PI) →

Architecture

Autoregressive Vision-Language-Action policy with FAST action tokenizer

π-0-FAST keeps the PaliGemma 3B backbone (Gemma LLM + SigLIP vision tower) from π-0 but replaces the flow-matching action expert with an autoregressive decoder that emits actions as FAST tokens. FAST encodes a chunk of continuous actions by applying a Discrete Cosine Transform along the time axis and quantising the resulting frequency coefficients, yielding a compact discrete representation that captures both fast and slow motion components efficiently. The VLM is then trained with a standard next-token objective to predict these action tokens given image observations, proprioception and a natural-language instruction. This makes π-0-FAST architecturally similar to OpenVLA / RT-2-X (token-output VLA) but with a much more sample-efficient action codebook. Reported results show π-0-FAST matching or outperforming the flow-matching π-0 on many benchmarks while simplifying inference to a single autoregressive forward pass per action chunk.

Parameters: ~3B (PaliGemma backbone with FAST action head)
Context: unknown

What it can do

Autoregressive VLA variant of π-0 using FAST action tokens
FAST tokenizer compresses action chunks via DCT
Single set of weights for many robot embodiments
Same PaliGemma 3B backbone as π-0 and π-0.5
Matches or exceeds flow-matching π-0 on key benchmarks
Easier integration with standard LLM serving stacks
Open-source code and weights via openpi repository
Compatible with existing autoregressive fine-tuning recipes
Best for: research on autoregressive VLAs and action tokenisation.

Training & License

Trained on the same multi-embodiment teleoperation corpus used for π-0 (~10,000+ hours), plus Open-X-Embodiment data, with the FAST tokenizer providing the discrete action target instead of continuous flow-matching trajectories.

License: Partially open-source via the openpi GitHub repository; research weights and FAST tokenizer published, commercial deployment governed by Physical Intelligence directly.

Known limitations