How much does Gemini 3.1 Pro cost via Railwail?

Input: €0.002 per 1M tokens. Output: €0.012 per 1M tokens. No monthly minimum, no subscription. Start with €5 free credits.

What is the context window of Gemini 3.1 Pro?

Gemini 3.1 Pro supports a 2M tokens context window — enough for entire codebases or research papers in one prompt.

How fast is Gemini 3.1 Pro?

Latency depends on prompt length and load — typically 200ms to 2s for short prompts. We measure p50/p95 in real-time on /rankings.

Is Gemini 3.1 Pro better than BLIP?

It depends on your use case. Gemini 3.1 Pro (Google DeepMind) and BLIP (Salesforce) are both strong choices in multimodal. Compare them side-by-side at /compare/gemini-3-1-pro-vs-blip-captioning.

Does Gemini 3.1 Pro support image input (vision)?

Yes — Gemini 3.1 Pro accepts image inputs in addition to text. Send images via the standard OpenAI-compatible `messages` array with `image_url` content blocks. Supported formats: text, image, audio, video.

Gemini 3.1 Pro

Name: Gemini 3.1 Pro
Brand: Google
SKU: gemini-3-1-pro
Price: 2e-6 EUR
Availability: InStock

New

Popular

Google DeepMind

Multimodal

Google DeepMind's February 2026 flagship. 2M-token context, native multimodal (text/image/audio/video), Deep Think reasoning.

Try Gemini 3.1 Pro now

Send a single prompt and stream a response inline. Hit Cmd+Enter to submit.

Press Cmd+Enter to send

Response appears here.

TL;DR·Last updated June 24, 2026

Gemini 3.1 Pro is multimodal AI model from Google DeepMind, priced at €0.002 per 1M input tokens with a 2M tokens context window.

About this model

Released February 19, 2026, Gemini 3.1 Pro is Google DeepMind's flagship frontier model. Native 2M-token context window, fully multimodal (text, image, audio, video tokens in a single pass), built-in Deep Think reasoning with adjustable thinking budgets. Top scores on GPQA Diamond, AIME, Humanity's Last Exam, and strong coding on SWE-bench. Function calling, JSON schema, Search grounding and Code Execution are first-class. Best for: long-document and long-video analysis, scientific reasoning, agentic workflows.

Try Gemini 3.1 Pro

System Prompt

Message

Temperature

0.7

Max Tokens

Pricing

Price per Generation

Per generationFree

API Integration

Use our OpenAI-compatible API to integrate Gemini 3.1 Pro into your application.

Install

npm install railwail

JavaScript / TypeScript

import railwail from "railwail";

const rw = railwail("YOUR_API_KEY");

// Simple — just pass a string
const reply = await rw.run("gemini-3-1-pro", "Hello! What can you do?");
console.log(reply);

// With message history
const reply2 = await rw.run("gemini-3-1-pro", [
  { role: "system", content: "You are a helpful assistant." },
  { role: "user", content: "Explain quantum computing simply." },
]);
console.log(reply2);

// Full response with usage info
const res = await rw.chat("gemini-3-1-pro", [
  { role: "user", content: "Hello!" },
], { temperature: 0.7, max_tokens: 500 });
console.log(res.choices[0].message.content);
console.log(res.usage);

Specifications

Context window

2,000,000 tokens

Max output

65,536 tokens

Developer

Google DeepMind

Deep dive — Google DeepMind's Gemini 3.1 Pro

About Google DeepMind

Founded 2010 · Mountain View, USA / London, UK

Google DeepMind is the merged AI research organisation formed in April 2023 by combining Google Brain (founded inside Google in 2011) with DeepMind (founded in London in 2010 by Demis Hassabis, Shane Legg and Mustafa Suleyman, acquired by Google in 2014). Demis Hassabis leads the unit as CEO. DeepMind authored seminal papers including 'Attention Is All You Need' (Google Brain, 2017), AlphaGo (2016), AlphaFold (2018-2021, awarded the 2024 Nobel Prize in Chemistry), AlphaZero, Chinchilla scaling laws and the Gemini Technical Report. The Gemini family launched in December 2023 (Ultra, Pro, Nano), followed by Gemini 1.5 Pro with 1M+ context (early 2024), Gemini 2.0 Flash (December 2024), Gemini 2.5 Pro with Deep Think (March/May 2025), Gemini 3 Pro (late 2025), Gemini 3.1 Pro (February 2026) and Gemini 3 Flash (April 2026). Google DeepMind also ships Imagen, Veo, Lyria and NotebookLM and powers AI features across Google Search, Workspace and Android.

Visit Google DeepMind →

Architecture

Sparse Mixture-of-Experts Transformer (natively multimodal, Deep Think reasoning)

Gemini 3.1 Pro was released February 19, 2026 as Google DeepMind's flagship frontier model. It is a natively multimodal Sparse Mixture-of-Experts Transformer that ingests text, image, audio and video tokens through a shared embedding space, building directly on the Gemini 3 Pro architecture (late 2025) and Gemini 2.5 Pro (early 2025). The 2M-token native context window is double that of 2.5 Pro. Pretraining used Google's TPU v6e infrastructure on a multi-trillion-token corpus mixing web text, code, books, scientific papers, image-text pairs, audio waveforms and video frames, with a knowledge cutoff in late 2025. Post-training combined supervised fine-tuning, RLHF, reinforcement learning against verifiable rewards on math and code, and a refreshed 'Deep Think' reasoning stage that teaches the model to allocate test-time thinking budgets and emit long internal chains-of-thought before its final answer. Tool use, function calling, structured output, Search grounding and the Code Execution tool are first-class. Safety training followed Google's Frontier Safety Framework v2.

Parameters: Undisclosed (sparse MoE, total parameters in the hundreds of billions, active per-token undisclosed)
Context: 2M tokens

What it can do

2,000,000 token context window with native long-video and long-audio support
Built-in Deep Think reasoning mode with adjustable thinking budget
Natively multimodal: text, image, audio and video in a single pass
Top scores on GPQA Diamond, AIME 2026 and Humanity's Last Exam
Strong coding performance on SWE-bench Verified and LiveCodeBench
Function calling, JSON schema and parallel tool calls
Search grounding and Code Execution tools built into the API
Cross-lingual reasoning across 100+ languages
Available via Vertex AI, AI Studio and the Gemini app (paid-only since April 2026)
2x batch pricing reduction for large jobs
Best for: long-document and long-video analysis, scientific reasoning, agentic workflows, complex multimodal extraction.

Training & License

Pretrained on a multi-trillion-token mixture of web text, code, books, scientific papers, licensed third-party text, audio waveforms, image-text pairs and video frames. Knowledge cutoff in late 2025. Post-training uses supervised fine-tuning, RLHF, RL against verifiable rewards and Deep Think reasoning training.

License: Proprietary commercial license via Google AI Studio, Vertex AI and the Gemini app. Paid-only since April 1, 2026.

Known limitations

Deep Think adds significant latency and token cost
Prompts above 200K tokens are billed at 2x input / 1.5x output rates
Long-context recall quality can degrade beyond ~1M tokens for some tasks
Vision occasionally misreads dense tables and handwriting
Region availability varies; not yet generally available in all EU regions

Research papers

Frequently asked questions

Related Models

View all Multimodal

BLIP

Salesforce

Salesforce BLIP. Vision-language model for image captioning and visual question answering. Given an image it writes a short natural-language caption, or answers a question about the image when one is supplied. A widely used baseline for automatic captioning.

€1.00

CLIP Interrogator

Community

pharmapsychotic's CLIP Interrogator. Takes an image and produces a Stable-Diffusion-style text prompt by combining BLIP captioning with CLIP to rank likely subjects, artists, mediums and styles. Commonly used to reverse-engineer a prompt from an existing picture.

€1.00

Claude 3.5 Sonnet (vision)

Anthropic

Anthropic Claude 3.5 Sonnet with image input. 200k context, strong on dense documents, tables, charts and handwriting. Reliable structured extraction from screenshots and scans.

Free

Claude Opus 4.7