Gemini 3.1 Pro
Google DeepMind's February 2026 flagship. 2M-token context, native multimodal (text/image/audio/video), Deep Think reasoning.
Gemini 3.1 Pro is multimodal AI model from Google DeepMind, priced at €0.002 per 1M input tokens with a 2M tokens context window.
About this model
0.7
Pricing
API Integration
Use our OpenAI-compatible API to integrate Gemini 3.1 Pro into your application.
npm install railwailimport railwail from "railwail";
const rw = railwail("YOUR_API_KEY");
// Simple — just pass a string
const reply = await rw.run("gemini-3-1-pro", "Hello! What can you do?");
console.log(reply);
// With message history
const reply2 = await rw.run("gemini-3-1-pro", [
{ role: "system", content: "You are a helpful assistant." },
{ role: "user", content: "Explain quantum computing simply." },
]);
console.log(reply2);
// Full response with usage info
const res = await rw.chat("gemini-3-1-pro", [
{ role: "user", content: "Hello!" },
], { temperature: 0.7, max_tokens: 500 });
console.log(res.choices[0].message.content);
console.log(res.usage);Deep dive — Google DeepMind's Gemini 3.1 Pro
Google DeepMind is the merged AI research organisation formed in April 2023 by combining Google Brain (founded inside Google in 2011) with DeepMind (founded in London in 2010 by Demis Hassabis, Shane Legg and Mustafa Suleyman, acquired by Google in 2014). Demis Hassabis leads the unit as CEO. DeepMind authored seminal papers including 'Attention Is All You Need' (Google Brain, 2017), AlphaGo (2016), AlphaFold (2018-2021, awarded the 2024 Nobel Prize in Chemistry), AlphaZero, Chinchilla scaling laws and the Gemini Technical Report. The Gemini family launched in December 2023 (Ultra, Pro, Nano), followed by Gemini 1.5 Pro with 1M+ context (early 2024), Gemini 2.0 Flash (December 2024), Gemini 2.5 Pro with Deep Think (March/May 2025), Gemini 3 Pro (late 2025), Gemini 3.1 Pro (February 2026) and Gemini 3 Flash (April 2026). Google DeepMind also ships Imagen, Veo, Lyria and NotebookLM and powers AI features across Google Search, Workspace and Android.
Visit Google DeepMind →Gemini 3.1 Pro was released February 19, 2026 as Google DeepMind's flagship frontier model. It is a natively multimodal Sparse Mixture-of-Experts Transformer that ingests text, image, audio and video tokens through a shared embedding space, building directly on the Gemini 3 Pro architecture (late 2025) and Gemini 2.5 Pro (early 2025). The 2M-token native context window is double that of 2.5 Pro. Pretraining used Google's TPU v6e infrastructure on a multi-trillion-token corpus mixing web text, code, books, scientific papers, image-text pairs, audio waveforms and video frames, with a knowledge cutoff in late 2025. Post-training combined supervised fine-tuning, RLHF, reinforcement learning against verifiable rewards on math and code, and a refreshed 'Deep Think' reasoning stage that teaches the model to allocate test-time thinking budgets and emit long internal chains-of-thought before its final answer. Tool use, function calling, structured output, Search grounding and the Code Execution tool are first-class. Safety training followed Google's Frontier Safety Framework v2.
- Parameters
- Undisclosed (sparse MoE, total parameters in the hundreds of billions, active per-token undisclosed)
- Context
- 2M tokens
- 2,000,000 token context window with native long-video and long-audio support
- Built-in Deep Think reasoning mode with adjustable thinking budget
- Natively multimodal: text, image, audio and video in a single pass
- Top scores on GPQA Diamond, AIME 2026 and Humanity's Last Exam
- Strong coding performance on SWE-bench Verified and LiveCodeBench
- Function calling, JSON schema and parallel tool calls
- Search grounding and Code Execution tools built into the API
- Cross-lingual reasoning across 100+ languages
- Available via Vertex AI, AI Studio and the Gemini app (paid-only since April 2026)
- 2x batch pricing reduction for large jobs
- Best for: long-document and long-video analysis, scientific reasoning, agentic workflows, complex multimodal extraction.
Pretrained on a multi-trillion-token mixture of web text, code, books, scientific papers, licensed third-party text, audio waveforms, image-text pairs and video frames. Knowledge cutoff in late 2025. Post-training uses supervised fine-tuning, RLHF, RL against verifiable rewards and Deep Think reasoning training.
License: Proprietary commercial license via Google AI Studio, Vertex AI and the Gemini app. Paid-only since April 1, 2026.
Known limitations
- Deep Think adds significant latency and token cost
- Prompts above 200K tokens are billed at 2x input / 1.5x output rates
- Long-context recall quality can degrade beyond ~1M tokens for some tasks
- Vision occasionally misreads dense tables and handwriting
- Region availability varies; not yet generally available in all EU regions
Frequently asked questions
Related Models
View all MultimodalClaude Opus 4.7
Anthropic's April 2026 flagship. 87.6% on SWE-bench Verified, 3x higher image resolution, output self-verification, vision + reasoning.
Claude Sonnet 4.6
Anthropic's balanced mid-tier model from February 2026. Best price/performance for production workloads: 5x cheaper than Opus, near-flagship quality.
Depth Anything v2
Monocular depth-estimation model trained on 595k labeled and 62M unlabeled images. Strong zero-shot generalization in indoor and outdoor scenes.
GPT-5.4
OpenAI's unified flagship combining GPT and o-series reasoning into one model. 1M context, multimodal, top SWE-Bench Pro and OSWorld scores.
Start using Gemini 3.1 Pro today
Get started with free credits. No credit card required. Access Gemini 3.1 Pro and 100+ other models through a single API.