Physical Intelligence Pi-0-FAST

Physical Intelligence
VLA / Robotics

Autoregressive Ο€-0 variant using FAST action tokenizer. Faster inference at competitive task success.

Research-only model
Physical Intelligence Pi-0-FAST runs on physical robot hardware and is not exposed via the Railwail API yet.
Not API-accessible
Read the research
TL;DRΒ·Last updated May 16, 2026

Physical Intelligence Pi-0-FAST is vla / robotics AI model from Physical Intelligence, priced at €0.000 per 1M input tokens with a unknown context window.

Try Physical Intelligence Pi-0-FAST

0.7

Direct API access coming soon

Pricing

Price per Generation
Per generationFree

API Integration

Use our OpenAI-compatible API to integrate Physical Intelligence Pi-0-FAST into your application.

Install
npm install railwail
JavaScript / TypeScript
import railwail from "railwail";

const rw = railwail("YOUR_API_KEY");

// Simple β€” just pass a string
const reply = await rw.run("pi-0-fast", "Hello! What can you do?");
console.log(reply);

// With message history
const reply2 = await rw.run("pi-0-fast", [
  { role: "system", content: "You are a helpful assistant." },
  { role: "user", content: "Explain quantum computing simply." },
]);
console.log(reply2);

// Full response with usage info
const res = await rw.chat("pi-0-fast", [
  { role: "user", content: "Hello!" },
], { temperature: 0.7, max_tokens: 500 });
console.log(res.choices[0].message.content);
console.log(res.usage);
Specifications
Developer
Physical Intelligence
Category
VLA / Robotics
Supported Formats
image
text
Tags
physical-intelligence
vla
robotics
research-only
open-weights
autoregressive
fast

Deep dive β€” Physical Intelligence (PI)'s Physical Intelligence Pi-0-FAST

About Physical Intelligence (PI)
Founded 2024 Β· San Francisco, California, USA

Physical Intelligence (PI) was founded in 2024 in San Francisco by Sergey Levine, Chelsea Finn, Karol Hausman and other co-founders, with a mission to build foundation models for general-purpose robots. Ο€-0-FAST, released in early 2025, is the autoregressive variant of the Ο€-0 VLA introduced together with the FAST action tokenizer. FAST (Frequency-space Action Sequence Tokenization) uses Discrete Cosine Transform compression to encode entire action chunks as a small number of discrete tokens, allowing a standard autoregressive VLM head to play the role of a robot policy without diffusion or flow-matching sampling. PI publishes Ο€-0-FAST checkpoints and tokenizer code via the openpi GitHub repository alongside the flow-matching Ο€-0 family, giving researchers both autoregressive and flow-matching policy baselines from the same backbone and data.

Visit Physical Intelligence (PI) β†’
Architecture
Autoregressive Vision-Language-Action policy with FAST action tokenizer

Ο€-0-FAST keeps the PaliGemma 3B backbone (Gemma LLM + SigLIP vision tower) from Ο€-0 but replaces the flow-matching action expert with an autoregressive decoder that emits actions as FAST tokens. FAST encodes a chunk of continuous actions by applying a Discrete Cosine Transform along the time axis and quantising the resulting frequency coefficients, yielding a compact discrete representation that captures both fast and slow motion components efficiently. The VLM is then trained with a standard next-token objective to predict these action tokens given image observations, proprioception and a natural-language instruction. This makes Ο€-0-FAST architecturally similar to OpenVLA / RT-2-X (token-output VLA) but with a much more sample-efficient action codebook. Reported results show Ο€-0-FAST matching or outperforming the flow-matching Ο€-0 on many benchmarks while simplifying inference to a single autoregressive forward pass per action chunk.

Parameters
~3B (PaliGemma backbone with FAST action head)
Context
unknown
What it can do
  • Autoregressive VLA variant of Ο€-0 using FAST action tokens
  • FAST tokenizer compresses action chunks via DCT
  • Single set of weights for many robot embodiments
  • Same PaliGemma 3B backbone as Ο€-0 and Ο€-0.5
  • Matches or exceeds flow-matching Ο€-0 on key benchmarks
  • Easier integration with standard LLM serving stacks
  • Open-source code and weights via openpi repository
  • Compatible with existing autoregressive fine-tuning recipes
  • Best for: research on autoregressive VLAs and action tokenisation.
Training & License

Trained on the same multi-embodiment teleoperation corpus used for Ο€-0 (~10,000+ hours), plus Open-X-Embodiment data, with the FAST tokenizer providing the discrete action target instead of continuous flow-matching trajectories.

License: Partially open-source via the openpi GitHub repository; research weights and FAST tokenizer published, commercial deployment governed by Physical Intelligence directly.

Known limitations
  • Discrete action tokens can quantise away very fine motion detail
  • Long action chunks still bottlenecked by autoregressive decoding
  • Generalisation outside training distribution still limited
  • Requires the FAST tokenizer for new action spaces
  • Open-source release trails internal newest checkpoint
  • Documentation primarily targets researchers

Frequently asked questions

Start using Physical Intelligence Pi-0-FAST today

Get started with free credits. No credit card required. Access Physical Intelligence Pi-0-FAST and 100+ other models through a single API.