Gemini 3.1 Pro

New
Popular
Google DeepMind
Multimodal

Google DeepMind's February 2026 flagship. 2M-token context, native multimodal (text/image/audio/video), Deep Think reasoning.

Try Gemini 3.1 Pro now
Send a single prompt and stream a response inline. Hit Cmd+Enter to submit.
Sign in to try this model with €5 free credits.
Sign in
Press Cmd+Enter to send
Response appears here.
TL;DR·Last updated May 16, 2026

Gemini 3.1 Pro is multimodal AI model from Google DeepMind, priced at €0.002 per 1M input tokens with a 2M tokens context window.

About this model

Released February 19, 2026, Gemini 3.1 Pro is Google DeepMind's flagship frontier model. Native 2M-token context window, fully multimodal (text, image, audio, video tokens in a single pass), built-in Deep Think reasoning with adjustable thinking budgets. Top scores on GPQA Diamond, AIME, Humanity's Last Exam, and strong coding on SWE-bench. Function calling, JSON schema, Search grounding and Code Execution are first-class. Best for: long-document and long-video analysis, scientific reasoning, agentic workflows.
Try Gemini 3.1 Pro

0.7

Sign in to generate — 50 free credits on sign-up

Pricing

Price per Generation
Per generationFree

API Integration

Use our OpenAI-compatible API to integrate Gemini 3.1 Pro into your application.

Install
npm install railwail
JavaScript / TypeScript
import railwail from "railwail";

const rw = railwail("YOUR_API_KEY");

// Simple — just pass a string
const reply = await rw.run("gemini-3-1-pro", "Hello! What can you do?");
console.log(reply);

// With message history
const reply2 = await rw.run("gemini-3-1-pro", [
  { role: "system", content: "You are a helpful assistant." },
  { role: "user", content: "Explain quantum computing simply." },
]);
console.log(reply2);

// Full response with usage info
const res = await rw.chat("gemini-3-1-pro", [
  { role: "user", content: "Hello!" },
], { temperature: 0.7, max_tokens: 500 });
console.log(res.choices[0].message.content);
console.log(res.usage);
Specifications
Context window
2,000,000 tokens
Max output
65,536 tokens
Developer
Google DeepMind
Category
Multimodal
Supported Formats
text
image
audio
video
Tags
google
deepmind
flagship
multimodal
deep-think
long-context
2m-context
video-understanding

Deep dive — Google DeepMind's Gemini 3.1 Pro

About Google DeepMind
Founded 2010 · Mountain View, USA / London, UK

Google DeepMind is the merged AI research organisation formed in April 2023 by combining Google Brain (founded inside Google in 2011) with DeepMind (founded in London in 2010 by Demis Hassabis, Shane Legg and Mustafa Suleyman, acquired by Google in 2014). Demis Hassabis leads the unit as CEO. DeepMind authored seminal papers including 'Attention Is All You Need' (Google Brain, 2017), AlphaGo (2016), AlphaFold (2018-2021, awarded the 2024 Nobel Prize in Chemistry), AlphaZero, Chinchilla scaling laws and the Gemini Technical Report. The Gemini family launched in December 2023 (Ultra, Pro, Nano), followed by Gemini 1.5 Pro with 1M+ context (early 2024), Gemini 2.0 Flash (December 2024), Gemini 2.5 Pro with Deep Think (March/May 2025), Gemini 3 Pro (late 2025), Gemini 3.1 Pro (February 2026) and Gemini 3 Flash (April 2026). Google DeepMind also ships Imagen, Veo, Lyria and NotebookLM and powers AI features across Google Search, Workspace and Android.

Visit Google DeepMind →
Architecture
Sparse Mixture-of-Experts Transformer (natively multimodal, Deep Think reasoning)

Gemini 3.1 Pro was released February 19, 2026 as Google DeepMind's flagship frontier model. It is a natively multimodal Sparse Mixture-of-Experts Transformer that ingests text, image, audio and video tokens through a shared embedding space, building directly on the Gemini 3 Pro architecture (late 2025) and Gemini 2.5 Pro (early 2025). The 2M-token native context window is double that of 2.5 Pro. Pretraining used Google's TPU v6e infrastructure on a multi-trillion-token corpus mixing web text, code, books, scientific papers, image-text pairs, audio waveforms and video frames, with a knowledge cutoff in late 2025. Post-training combined supervised fine-tuning, RLHF, reinforcement learning against verifiable rewards on math and code, and a refreshed 'Deep Think' reasoning stage that teaches the model to allocate test-time thinking budgets and emit long internal chains-of-thought before its final answer. Tool use, function calling, structured output, Search grounding and the Code Execution tool are first-class. Safety training followed Google's Frontier Safety Framework v2.

Parameters
Undisclosed (sparse MoE, total parameters in the hundreds of billions, active per-token undisclosed)
Context
2M tokens
What it can do
  • 2,000,000 token context window with native long-video and long-audio support
  • Built-in Deep Think reasoning mode with adjustable thinking budget
  • Natively multimodal: text, image, audio and video in a single pass
  • Top scores on GPQA Diamond, AIME 2026 and Humanity's Last Exam
  • Strong coding performance on SWE-bench Verified and LiveCodeBench
  • Function calling, JSON schema and parallel tool calls
  • Search grounding and Code Execution tools built into the API
  • Cross-lingual reasoning across 100+ languages
  • Available via Vertex AI, AI Studio and the Gemini app (paid-only since April 2026)
  • 2x batch pricing reduction for large jobs
  • Best for: long-document and long-video analysis, scientific reasoning, agentic workflows, complex multimodal extraction.
Training & License

Pretrained on a multi-trillion-token mixture of web text, code, books, scientific papers, licensed third-party text, audio waveforms, image-text pairs and video frames. Knowledge cutoff in late 2025. Post-training uses supervised fine-tuning, RLHF, RL against verifiable rewards and Deep Think reasoning training.

License: Proprietary commercial license via Google AI Studio, Vertex AI and the Gemini app. Paid-only since April 1, 2026.

Known limitations
  • Deep Think adds significant latency and token cost
  • Prompts above 200K tokens are billed at 2x input / 1.5x output rates
  • Long-context recall quality can degrade beyond ~1M tokens for some tasks
  • Vision occasionally misreads dense tables and handwriting
  • Region availability varies; not yet generally available in all EU regions

Frequently asked questions

Start using Gemini 3.1 Pro today

Get started with free credits. No credit card required. Access Gemini 3.1 Pro and 100+ other models through a single API.