Google Releases Gemini 3.1 Pro with Massive 2 Million Token Context Window
Google's Gemini 3.1 Pro sets a new industry benchmark with a 2 million token context window and 77.1% on ARC-AGI-2, enabling AI to reason over entire codebases and lengthy documents in one pass.
Google has released Gemini 3.1 Pro, its latest large language model and the new frontrunner in long-context AI processing. The model's headline feature is support for a context window of up to 2 million tokens — enough to process an entire novel, hundreds of academic papers, or a complete large-scale software codebase in a single inference call. The release marks a significant leap from the 1 million token context window supported by its predecessor.
In benchmark testing, Gemini 3.1 Pro scored 77.1% on the ARC-AGI-2 reasoning benchmark, a notoriously difficult test designed to measure fluid intelligence in AI systems. That score places it among the top-performing models in the world for complex multi-step reasoning tasks, particularly those involving long-horizon planning across large bodies of information.
The technical underpinnings of the expanded context window rely on a combination of Ring Attention and Grouped-Query Attention mechanisms. These allow the model to divide ultra-long sequences into chunks, distribute them across computing devices for parallel processing, and exchange key information through an efficient communication ring — dramatically reducing memory overhead without sacrificing accuracy. Google also released Gemma 4, an open-weight companion series under the Apache 2.0 license, targeting agentic workflows on lower-cost hardware.
For enterprise developers, the 2 million token context window opens up a new class of AI-powered applications: real-time code review across entire repositories, comprehensive legal document analysis, and deep scientific literature synthesis — all without chunking or retrieval hacks. Gemini 3.1 Pro is available via the Gemini API and Vertex AI, with pricing structured around token usage. Google has positioned this release as a foundational step toward fully autonomous AI agents capable of reasoning over the full context of complex, real-world tasks.