google

Gemini 3 Pro

Google's Gemini 3 Pro is a multimodal powerhouse featuring a 1M token context window, native video processing, and industry-leading reasoning performance.

Multimodal AILong ContextFrontier ModelAGI-Ready
google logogoogleGemini 3November 17, 2025
Context
1.0Mtokens
Max Output
64Ktokens
Input Price
$2.00/ 1M
Output Price
$12.00/ 1M
Modality:TextImageAudioVideo
Capabilities:VisionToolsStreamingReasoning
Benchmarks
GPQA
92%
GPQA: Graduate-Level Science Q&A. A rigorous benchmark with 448 multiple-choice questions in biology, physics, and chemistry created by domain experts. PhD experts only achieve 65-74% accuracy, while non-experts score just 34% even with unlimited web access (hence 'Google-proof'). Gemini 3 Pro scored 92% on this benchmark.
HLE
37%
HLE: High-Level Expertise Reasoning. Tests a model's ability to demonstrate expert-level reasoning across specialized domains. Evaluates deep understanding of complex topics that require professional-level knowledge. Gemini 3 Pro scored 37% on this benchmark.
MMLU
92%
MMLU: Massive Multitask Language Understanding. A comprehensive benchmark with 16,000 multiple-choice questions across 57 academic subjects including math, philosophy, law, and medicine. Tests broad knowledge and reasoning capabilities. Gemini 3 Pro scored 92% on this benchmark.
MMLU Pro
90%
MMLU Pro: MMLU Professional Edition. An enhanced version of MMLU with 12,032 questions using a harder 10-option multiple choice format. Covers Math, Physics, Chemistry, Law, Engineering, Economics, Health, Psychology, Business, Biology, Philosophy, and Computer Science. Gemini 3 Pro scored 90% on this benchmark.
SimpleQA
47%
SimpleQA: Factual Accuracy Benchmark. Tests a model's ability to provide accurate, factual responses to straightforward questions. Measures reliability and reduces hallucinations in knowledge retrieval tasks. Gemini 3 Pro scored 47% on this benchmark.
IFEval
92%
IFEval: Instruction Following Evaluation. Measures how well a model follows specific instructions and constraints. Tests the ability to adhere to formatting rules, length limits, and other explicit requirements. Gemini 3 Pro scored 92% on this benchmark.
AIME 2025
100%
AIME 2025: American Invitational Math Exam. Competition-level mathematics problems from the prestigious AIME exam designed for talented high school students. Tests advanced mathematical problem-solving requiring abstract reasoning, not just pattern matching. Gemini 3 Pro scored 100% on this benchmark.
MATH
92%
MATH: Mathematical Problem Solving. A comprehensive math benchmark testing problem-solving across algebra, geometry, calculus, and other mathematical domains. Requires multi-step reasoning and formal mathematical knowledge. Gemini 3 Pro scored 92% on this benchmark.
GSM8k
99%
GSM8k: Grade School Math 8K. 8,500 grade school-level math word problems requiring multi-step reasoning. Tests basic arithmetic and logical thinking through real-world scenarios like shopping or time calculations. Gemini 3 Pro scored 99% on this benchmark.
MGSM
92%
MGSM: Multilingual Grade School Math. The GSM8k benchmark translated into 10 languages including Spanish, French, German, Russian, Chinese, and Japanese. Tests mathematical reasoning across different languages. Gemini 3 Pro scored 92% on this benchmark.
MathVista
78%
MathVista: Mathematical Visual Reasoning. Tests the ability to solve math problems that involve visual elements like charts, graphs, geometry diagrams, and scientific figures. Combines visual understanding with mathematical reasoning. Gemini 3 Pro scored 78% on this benchmark.
SWE-Bench
76%
SWE-Bench: Software Engineering Benchmark. AI models attempt to resolve real GitHub issues in open-source Python projects with human verification. Tests practical software engineering skills on production codebases. Top models went from 4.4% in 2023 to over 70% in 2024. Gemini 3 Pro scored 76% on this benchmark.
HumanEval
94%
HumanEval: Python Programming Problems. 164 hand-written programming problems where models must generate correct Python function implementations. Each solution is verified against unit tests. Top models now achieve 90%+ accuracy. Gemini 3 Pro scored 94% on this benchmark.
LiveCodeBench
81%
LiveCodeBench: Live Coding Benchmark. Tests coding abilities on continuously updated, real-world programming challenges. Unlike static benchmarks, uses fresh problems to prevent data contamination and measure true coding skills. Gemini 3 Pro scored 81% on this benchmark.
MMMU
81%
MMMU: Multimodal Understanding. Massive Multi-discipline Multimodal Understanding benchmark testing vision-language models on college-level problems across 30 subjects requiring both image understanding and expert knowledge. Gemini 3 Pro scored 81% on this benchmark.
MMMU Pro
81%
MMMU Pro: MMMU Professional Edition. Enhanced version of MMMU with more challenging questions and stricter evaluation. Tests advanced multimodal reasoning at professional and expert levels. Gemini 3 Pro scored 81% on this benchmark.
ChartQA
91%
ChartQA: Chart Question Answering. Tests the ability to understand and reason about information presented in charts and graphs. Requires extracting data, comparing values, and performing calculations from visual data representations. Gemini 3 Pro scored 91% on this benchmark.
DocVQA
95%
DocVQA: Document Visual Q&A. Document Visual Question Answering benchmark testing the ability to extract and reason about information from document images including forms, reports, and scanned text. Gemini 3 Pro scored 95% on this benchmark.
Terminal-Bench
68%
Terminal-Bench: Terminal/CLI Tasks. Tests the ability to perform command-line operations, write shell scripts, and navigate terminal environments. Measures practical system administration and development workflow skills. Gemini 3 Pro scored 68% on this benchmark.
ARC-AGI
31%
ARC-AGI: Abstraction & Reasoning. Abstraction and Reasoning Corpus for AGI - tests fluid intelligence through novel pattern recognition puzzles. Each task requires discovering the underlying rule from examples, measuring general reasoning ability rather than memorization. Gemini 3 Pro scored 31% on this benchmark.

About Gemini 3 Pro

Learn about Gemini 3 Pro's capabilities, features, and how it can help you achieve better results.

Native Multimodal Architecture

Gemini 3 Pro is Google’s primary flagship model, designed to process text, image, audio, and video natively within a single transformer pass. Unlike previous models that relied on separate encoders, this architecture preserves nuanced data across different modalities. It was released in late 2025 to serve as a high-performance alternative to frontier reasoning models, providing a balance between raw intelligence and operational efficiency.

Reasoning and Technical Performance

Technically, the model excels in quantitative fields, having achieved a perfect 100% on the AIME 2025 math exam. It incorporates an internal Deep Think layer, allowing the system to deliberate on complex logical structures before generating a response. This makes it particularly effective for scientific research, expert-level Q&A on GPQA Diamond, and advanced competitive programming where logic verification is critical.

Enterprise-Grade Context Utility

With a massive 1 million token context window, the model is built for large-scale data synthesis. It can ingest entire codebases or hours of high-definition video to extract specific insights without the information loss common in standard RAG architectures. This long-context capability, combined with optimized caching, allows enterprises to run complex autonomous workflows at a significantly lower cost than rival flagship systems.

Gemini 3 Pro

Use Cases

Discover the different ways you can use Gemini 3 Pro to achieve great results.

Autonomous Codebase Engineering

Ingest entire GitHub repositories into the 1M token context window for repo-wide debugging and feature implementation with architectural awareness.

Multimodal Video Intelligence

Analyze hour-long video files natively to extract temporal insights, summarize complex scenes, or identify visual-audio correlations.

PhD-Level Scientific Research

Solve graduate-level problems in physics and chemistry using leading GPQA scores and the ability to parse dense scientific tables.

3D Spatial Planning

Utilize the model's unique 3D reasoning capabilities to plan virtual environments, design UI layouts, or solve spatial puzzles.

Zero-Shot Game Development

Generate functional retro-style games or physics engines in a single prompt by leveraging advanced coding and logic synthesis.

Enterprise Document Synthesis

Process thousands of unstructured pages of financial documentation simultaneously to identify risks and generate structured reports.

Strengths

Limitations

Elite 3D Reasoning: Demonstrates superior ability to solve spatial puzzles and plan 3D environments, outperforming competitors in visual logic.
Verbosity Issues: Community benchmarks frequently categorize the model as very verbose, often using more tokens than necessary for simple tasks.
Massive Context Utility: The 1M token window allows for the ingestion of entire projects or hours of video without the data loss of RAG systems.
Hallucination Variance: While logic is improved, it still maintains a measurable hallucination rate in open-ended evaluations compared to smaller models.
Top-Tier Math Scores: Achieves a perfect 100% on the AIME 2025 math exam, making it a premier choice for quantitative and scientific analysis.
Context Scaling Penalty: The price doubles immediately after 200,000 tokens, which can lead to unexpected billing for large-scale enterprise operations.
Aggressive Pricing: At $2.00 per 1M input tokens, it offers frontier intelligence at a significantly lower cost than flagship alternatives.
Regional Feature Gaps: Some advanced agentic and deep thinking features are initially restricted to specific regions or English-language settings.

API Quick Start

google/gemini-3-pro-preview

View Documentation
google SDK
import { GoogleGenAI } from "@google/genai";

const genAI = new GoogleGenAI(process.env.GOOGLE_API_KEY);
const model = genAI.getGenerativeModel({ 
  model: "gemini-3-pro",
  thinkingConfig: { includeThoughts: true }
});

const prompt = "Explain the architectural implications of this 1M token codebase.";
const result = await model.generateContent(prompt);
console.log(result.response.text());

Install the SDK and start making API calls in minutes.

Community Feedback

See what the community thinks about Gemini 3 Pro

Gemini 3 Pro's 1M context is a game changer for codebase analysis. I finally uploaded my whole project and it didn't hallucinate the structure.
dev_guru_2026
reddit
The Deep Think mode is significantly better at logic than GPT-4o. It actually stops to deliberate rather than just blurting out the first answer.
AIExpertX
twitter
Google finally caught up with the 3.1 release. The benchmarks on ARC-AGI-2 don't lie; this is the reasoning crown for now.
hackernews_reader
hackernews
I love the speed and the multimodal features, but man, it can be too verbose sometimes. It gives you a 10-page report for a simple prompt.
TheTechReviewer
youtube
The math performance is the real story here. 100% on AIME 2025 is effectively solving high school competition math.
logic_king
reddit
Native audio processing makes a huge difference. It picks up on tone and sarcasm that text-only models miss.
prompt_engineer
twitter

Related Videos

Watch tutorials, reviews, and discussions about Gemini 3 Pro

Gemini 3 Pro... genuinely marks a new chapter in the race to true artificial intelligence.

On my own private independent benchmark, Simple Bench, it crushed its rivals.

The model exhibits a form of internal deliberation we haven't seen in previous iterations.

Its ability to understand long-form video content without pre-processing is its most underrated feature.

I think there are virtually no benchmarks left where the average human could perform better than Gemini 3 Pro.

Gemini 3 Pro Deepthink... arguably the smartest LLM in existence that is at least publicly accessible.

It uses advanced parallel reasoning to explore multiple hypotheses simultaneously.

The consistency across the 1M token window is significantly higher than 1.5 Pro.

You can see it correcting its own logical fallacies in the thought trace.

It is one of the only models to actually properly showcase the plane animation falling out of the sky.

The leap in capability from Gemini 2.5 to Gemini 3 Pro is the most significant jump seen since GPT-4.

The pricing is actually insane for what you're getting in terms of reasoning capacity.

Once Karpathy enabled the Google Search tool, the model experienced what it called temporal shock.

It handles TypeScript types better than any other model I've tested this year.

Gemini 3 successfully generated a recognizable game controller... while GPT 5.1 produced a barely recognizable shape.

More than just prompts

Supercharge your workflow with AI Automation

Automatio combines the power of AI agents, web automation, and smart integrations to help you accomplish more in less time.

AI Agents
Web Automation
Smart Workflows

Pro Tips

Expert tips to help you get the most out of Gemini 3 Pro and achieve better results.

Leverage Reasoning Toggles

Use the Deep Think configuration to balance speed and accuracy, reserving the High setting for competitive programming.

Context Caching for ROI

Utilize context caching for long-term projects to reduce costs by up to 90% when querying the same 1M token dataset.

Provide Full Repository Context

When coding, upload the entire file structure rather than snippets to allow the model to maintain architectural consistency.

Temporal Prompting

When analyzing video, reference specific timestamps in your prompt to help the model focus its attention on key visual events.

Testimonials

What Our Users Say

Join thousands of satisfied users who have transformed their workflow

Jonathan Kogan

Jonathan Kogan

Co-Founder/CEO, rpatools.io

Automatio is one of the most used for RPA Tools both internally and externally. It saves us countless hours of work and we realized this could do the same for other startups and so we choose Automatio for most of our automation needs.

Mohammed Ibrahim

Mohammed Ibrahim

CEO, qannas.pro

I have used many tools over the past 5 years, Automatio is the Jack of All trades.. !! it could be your scraping bot in the morning and then it becomes your VA by the noon and in the evening it does your automations.. its amazing!

Ben Bressington

Ben Bressington

CTO, AiChatSolutions

Automatio is fantastic and simple to use to extract data from any website. This allowed me to replace a developer and do tasks myself as they only take a few minutes to setup and forget about it. Automatio is a game changer!

Sarah Chen

Sarah Chen

Head of Growth, ScaleUp Labs

We've tried dozens of automation tools, but Automatio stands out for its flexibility and ease of use. Our team productivity increased by 40% within the first month of adoption.

David Park

David Park

Founder, DataDriven.io

The AI-powered features in Automatio are incredible. It understands context and adapts to changes in websites automatically. No more broken scrapers!

Emily Rodriguez

Emily Rodriguez

Marketing Director, GrowthMetrics

Automatio transformed our lead generation process. What used to take our team days now happens automatically in minutes. The ROI is incredible.

Jonathan Kogan

Jonathan Kogan

Co-Founder/CEO, rpatools.io

Automatio is one of the most used for RPA Tools both internally and externally. It saves us countless hours of work and we realized this could do the same for other startups and so we choose Automatio for most of our automation needs.

Mohammed Ibrahim

Mohammed Ibrahim

CEO, qannas.pro

I have used many tools over the past 5 years, Automatio is the Jack of All trades.. !! it could be your scraping bot in the morning and then it becomes your VA by the noon and in the evening it does your automations.. its amazing!

Ben Bressington

Ben Bressington

CTO, AiChatSolutions

Automatio is fantastic and simple to use to extract data from any website. This allowed me to replace a developer and do tasks myself as they only take a few minutes to setup and forget about it. Automatio is a game changer!

Sarah Chen

Sarah Chen

Head of Growth, ScaleUp Labs

We've tried dozens of automation tools, but Automatio stands out for its flexibility and ease of use. Our team productivity increased by 40% within the first month of adoption.

David Park

David Park

Founder, DataDriven.io

The AI-powered features in Automatio are incredible. It understands context and adapts to changes in websites automatically. No more broken scrapers!

Emily Rodriguez

Emily Rodriguez

Marketing Director, GrowthMetrics

Automatio transformed our lead generation process. What used to take our team days now happens automatically in minutes. The ROI is incredible.

Related AI Models

anthropic

Claude Opus 4.6

Anthropic

Claude Opus 4.6 is Anthropic's flagship model featuring a 1M token context window, Adaptive Thinking, and world-class coding and reasoning performance.

1M context
$5.00/$25.00/1M
openai

GPT-5.2 Pro

OpenAI

GPT-5.2 Pro is OpenAI's 2025 flagship reasoning model featuring Extended Thinking for SOTA performance in mathematics, coding, and expert knowledge work.

400K context
$21.00/$168.00/1M
xai

Grok-3

xAI

Grok-3 is xAI's flagship reasoning model, featuring deep logic deduction, a 128k context window, and real-time integration with X for live research and coding.

1M context
$3.00/$15.00/1M
google

Gemini 3 Flash

Google

Gemini 3 Flash is Google's high-speed multimodal model featuring a 1M token context window, elite 90.4% GPQA reasoning, and autonomous browser automation tools.

1M context
$0.50/$3.00/1M
anthropic

Claude Sonnet 4.6

Anthropic

Claude Sonnet 4.6 offers frontier performance for coding and computer use with a massive 1M token context window for only $3/1M tokens.

1M context
$3.00/$15.00/1M
google

Gemini 3.1 Pro

Google

Gemini 3.1 Pro is Google's elite multimodal model featuring the DeepThink reasoning engine, a 1M+ context window, and industry-leading ARC-AGI logic scores.

1M context
$2.00/$12.00/1M
alibaba

Qwen3.5-397B-A17B

alibaba

Qwen3.5-397B-A17B is Alibaba's flagship open-weight MoE model. It features native multimodal reasoning, a 1M context window, and a 19x decoding throughput...

1M context
$0.40/$2.40/1M
openai

GPT-5.1

OpenAI

GPT-5.1 is OpenAI’s advanced reasoning flagship featuring adaptive thinking, native multimodality, and state-of-the-art performance in math and technical...

400K context
$1.25/$10.00/1M

Frequently Asked Questions

Find answers to common questions about Gemini 3 Pro