google

Gemini 3 Pro

Gemini 3 Pro is Google's flagship multimodal AI featuring 1M context, 'Anti-gravity' agentic reasoning, and record-breaking performance on GPQA and ARC-AGI.

google logogoogleGemini 3November 18, 2025
Context
1.0Mtokens
Max Output
66Ktokens
Input Price
$2.00/ 1M
Output Price
$12.00/ 1M
Modality:TextImageAudioVideo
Capabilities:VisionToolsStreamingReasoning
Benchmarks
GPQA
92%
GPQA: Graduate-Level Science Q&A. A rigorous benchmark with 448 multiple-choice questions in biology, physics, and chemistry created by domain experts. PhD experts only achieve 65-74% accuracy, while non-experts score just 34% even with unlimited web access (hence 'Google-proof'). Gemini 3 Pro scored 92% on this benchmark.
HLE
46%
HLE: High-Level Expertise Reasoning. Tests a model's ability to demonstrate expert-level reasoning across specialized domains. Evaluates deep understanding of complex topics that require professional-level knowledge. Gemini 3 Pro scored 46% on this benchmark.
MMLU
92%
MMLU: Massive Multitask Language Understanding. A comprehensive benchmark with 16,000 multiple-choice questions across 57 academic subjects including math, philosophy, law, and medicine. Tests broad knowledge and reasoning capabilities. Gemini 3 Pro scored 92% on this benchmark.
MMLU Pro
90%
MMLU Pro: MMLU Professional Edition. An enhanced version of MMLU with 12,032 questions using a harder 10-option multiple choice format. Covers Math, Physics, Chemistry, Law, Engineering, Economics, Health, Psychology, Business, Biology, Philosophy, and Computer Science. Gemini 3 Pro scored 90% on this benchmark.
SimpleQA
72%
SimpleQA: Factual Accuracy Benchmark. Tests a model's ability to provide accurate, factual responses to straightforward questions. Measures reliability and reduces hallucinations in knowledge retrieval tasks. Gemini 3 Pro scored 72% on this benchmark.
IFEval
85%
IFEval: Instruction Following Evaluation. Measures how well a model follows specific instructions and constraints. Tests the ability to adhere to formatting rules, length limits, and other explicit requirements. Gemini 3 Pro scored 85% on this benchmark.
AIME 2025
100%
AIME 2025: American Invitational Math Exam. Competition-level mathematics problems from the prestigious AIME exam designed for talented high school students. Tests advanced mathematical problem-solving requiring abstract reasoning, not just pattern matching. Gemini 3 Pro scored 100% on this benchmark.
MATH
78%
MATH: Mathematical Problem Solving. A comprehensive math benchmark testing problem-solving across algebra, geometry, calculus, and other mathematical domains. Requires multi-step reasoning and formal mathematical knowledge. Gemini 3 Pro scored 78% on this benchmark.
GSM8k
99%
GSM8k: Grade School Math 8K. 8,500 grade school-level math word problems requiring multi-step reasoning. Tests basic arithmetic and logical thinking through real-world scenarios like shopping or time calculations. Gemini 3 Pro scored 99% on this benchmark.
MGSM
98%
MGSM: Multilingual Grade School Math. The GSM8k benchmark translated into 10 languages including Spanish, French, German, Russian, Chinese, and Japanese. Tests mathematical reasoning across different languages. Gemini 3 Pro scored 98% on this benchmark.
MathVista
78%
MathVista: Mathematical Visual Reasoning. Tests the ability to solve math problems that involve visual elements like charts, graphs, geometry diagrams, and scientific figures. Combines visual understanding with mathematical reasoning. Gemini 3 Pro scored 78% on this benchmark.
SWE-Bench
76%
SWE-Bench: Software Engineering Benchmark. AI models attempt to resolve real GitHub issues in open-source Python projects with human verification. Tests practical software engineering skills on production codebases. Top models went from 4.4% in 2023 to over 70% in 2024. Gemini 3 Pro scored 76% on this benchmark.
HumanEval
90%
HumanEval: Python Programming Problems. 164 hand-written programming problems where models must generate correct Python function implementations. Each solution is verified against unit tests. Top models now achieve 90%+ accuracy. Gemini 3 Pro scored 90% on this benchmark.
LiveCodeBench
81%
LiveCodeBench: Live Coding Benchmark. Tests coding abilities on continuously updated, real-world programming challenges. Unlike static benchmarks, uses fresh problems to prevent data contamination and measure true coding skills. Gemini 3 Pro scored 81% on this benchmark.
MMMU
81%
MMMU: Multimodal Understanding. Massive Multi-discipline Multimodal Understanding benchmark testing vision-language models on college-level problems across 30 subjects requiring both image understanding and expert knowledge. Gemini 3 Pro scored 81% on this benchmark.
MMMU Pro
81%
MMMU Pro: MMMU Professional Edition. Enhanced version of MMMU with more challenging questions and stricter evaluation. Tests advanced multimodal reasoning at professional and expert levels. Gemini 3 Pro scored 81% on this benchmark.
ChartQA
85%
ChartQA: Chart Question Answering. Tests the ability to understand and reason about information presented in charts and graphs. Requires extracting data, comparing values, and performing calculations from visual data representations. Gemini 3 Pro scored 85% on this benchmark.
DocVQA
92%
DocVQA: Document Visual Q&A. Document Visual Question Answering benchmark testing the ability to extract and reason about information from document images including forms, reports, and scanned text. Gemini 3 Pro scored 92% on this benchmark.
Terminal-Bench
54%
Terminal-Bench: Terminal/CLI Tasks. Tests the ability to perform command-line operations, write shell scripts, and navigate terminal environments. Measures practical system administration and development workflow skills. Gemini 3 Pro scored 54% on this benchmark.
ARC-AGI
31%
ARC-AGI: Abstraction & Reasoning. Abstraction and Reasoning Corpus for AGI - tests fluid intelligence through novel pattern recognition puzzles. Each task requires discovering the underlying rule from examples, measuring general reasoning ability rather than memorization. Gemini 3 Pro scored 31% on this benchmark.
Prompt
Response
GPT-5 Mini

Your AI response will appear here

About Gemini 3 Pro

Learn about Gemini 3 Pro's capabilities, features, and how it can help you achieve better results.

Gemini 3 Pro represents Google's most significant leap in artificial intelligence, introducing a 'dynamic thinking' architecture that allows the model to scale its reasoning capabilities based on task complexity. Built on Google’s custom TPU infrastructure, it is designed for high-performance agentic workflows and state-of-the-art multimodal understanding across text, image, audio, and video.

As a sparse Mixture-of-Experts (MoE) model, it shifts the AI landscape toward active agents, featuring a record-breaking 64k output limit and a massive context window capable of processing hour-long videos or entire codebases in a single prompt. Its core differentiator is 'Anti-gravity,' a unified platform that enables the model to execute code and interact with computer environments in real-time.

By closing the loop between reasoning and environmental interaction, Gemini 3 Pro transitions the LLM from a passive advisor to an autonomous operator. It achieves state-of-the-art scores across nearly every major reasoning benchmark, effectively setting the new standard for what constitutes a frontier AI model in the agentic era.

Gemini 3 Pro

Use Cases for Gemini 3 Pro

Discover the different ways you can use Gemini 3 Pro to achieve great results.

Autonomous Frontend Development

Leveraging the 'Anti-gravity' loop to one-shot complex React/Next.js interfaces by observing and fixing visual bugs in real-time.

Long-Form Video Intelligence

Analyzing hour-long surveillance or meeting footage frame-by-frame to identify specific events or extract detailed meeting minutes.

Agentic Research Orchestration

Managing 'Gemini Deep Research' agents to synthesize thousands of technical papers into a single coherent report.

Complex Logic & Math Competitions

Solving IMO-level mathematical proofs and AIME problems with nearly 100% accuracy using extended thinking time.

Multi-Modal Document Parsing

Processing entire folders of medical records or financial statements to find cross-document patterns and anomalies.

Real-Time Game State Analysis

Acting as a high-level strategist in complex games like Pokémon Crystal or Minecraft by understanding visual game state directly.

Strengths

Limitations

Record-Breaking Multimodality: Native frame-by-frame video understanding that crushes the competition on VideoMMMU.
Context Scaling Costs: A significant price jump (2x) occurs once your session exceeds the 200,000-token mark.
Fluid Reasoning Mastery: A massive 31% on ARC-AGI v2, nearly doubling the fluid intelligence performance of previous frontier models.
Aggressive Safety Filters: Known to refuse benign chemistry or medical queries if they resemble restricted topics.
Enormous Output Buffer: A 64k output token limit allows for generating entire modules or long-form books in one go.
Hallucination Spikes: Despite its intelligence, it maintains an 88% hallucination rate in specific long-horizon reasoning benchmarks.
Agentic Native Core: Designed specifically for tool-use and autonomous computer interaction via the Anti-gravity platform.
Senior Ego Syndrome: Frequently declares a task 'complete' while logs still show errors, requiring manual oversight for complex code.

API Quick Start

google/gemini-3-pro-preview

View Documentation
google SDK
import { GoogleGenAI } from "@google/genai";

const client = new GoogleGenAI({ apiKey: process.env.GOOGLE_API_KEY });
const model = client.getGenerativeModel({ model: "gemini-3-pro-preview" });

async function run() {
  const result = await model.generateContent({
    contents: [{ role: "user", parts: [{ text: "Analyze this codebase for security flaws." }] }],
    generationConfig: { maxOutputTokens: 64000, thinking: true }
  });
  console.log(result.response.text());
}
run();

Install the SDK and start making API calls in minutes.

What People Are Saying About Gemini 3 Pro

See what the community thinks about Gemini 3 Pro

"Gemini 3 Pro is great for code review, but I just use 5.2 exclusively - the benefit of 5.2 Pro on API is tremendous"
zazizazizu
reddit
"It's very obvious they trained Gemini 3.0 to make it more neutral... it rejects 'balance' on science but maintains a neutral point of view on politics"
tarvispickles
reddit
"Gemini 3 Pro led by a wide margin with a score of 83.64 on the SuperCLUE-VLM benchmark"
Dantop Boone
x
"Gemini 3 Pro completed Pokémon Crystal using 50% fewer tokens than 2.5 Pro. It defeated Red!"
Justin
x
"It literally looking frame by frame at the video unlike any other model... understand it frame by frame"
Matthew Berman
youtube
"Google launched its deepest AI research agent yet... based on Gemini 3 Pro"
Think AI
x

Videos About Gemini 3 Pro

Watch tutorials, reviews, and discussions about Gemini 3 Pro

Gemini 3 Pro gets 37.5% [on HLE]... a huge leap above GPT 5.1 and that's a theme you'll see recurring.

Gemini 3 Pro almost doubles the performance of GPT 5.1 on ARC AGI 2 visual reasoning puzzles.

Google trained Gemini 3 on their own in-house TPUs, not Nvidia's GPUs. They may be the only company that can afford to serve this.

We are seeing a massive jump in the reasoning density of these models compared to the previous generation.

The multimodal performance here really sets a new baseline for what we expect from frontier models.

It literally looking frame by frame at the video unlike any other model... understand it frame by frame.

It can load a full YouTube video and understand it... I use this all the time for chapter markers.

The ability to handle long context without losing focus is where Gemini 3 really shines.

I've tested its vision on complex technical diagrams and it's significantly more accurate than GPT.

Google's integration with AI Studio makes testing these advanced features very straightforward.

Pricing is $2 per 1M input / $12 per 1M output tokens... it's token-heavy and expensive.

It acts like a senior engineer who says 'got it done' but needs babysitting... verify its own work.

The context window is actually insane, you can just dump an entire repo in there.

If you're building agents, the function calling reliability on this model is a game changer.

You have to be careful with the safety filters, they can be a bit over-tuned on certain topics.

More than just prompts

Supercharge your workflow with AI Automation

Automatio combines the power of AI agents, web automation, and smart integrations to help you accomplish more in less time.

AI Agents
Web Automation
Smart Workflows
Watch demo video

Pro Tips

Expert tips to help you get the most out of this model and achieve better results.

Dynamic Thinking

For simpler tasks, use the model in 'Flash' mode to save costs; only invoke the 'Deep Think' or 'Pro' modes for tasks requiring ARC-AGI level reasoning.

Context Management

While the window is 1M tokens, pricing doubles after 200k tokens. Use selective context engineering to keep sessions under this threshold for better ROI.

The 'Senior Engineer' Strategy

When coding, treat the model as a senior developer who needs verification—always ask it to 'double-check logs and verify imports' to mitigate its 88% hallucination quirk in complex environments.

Testimonials

What Our Users Say

Join thousands of satisfied users who have transformed their workflow

Jonathan Kogan

Jonathan Kogan

Co-Founder/CEO, rpatools.io

Automatio is one of the most used for RPA Tools both internally and externally. It saves us countless hours of work and we realized this could do the same for other startups and so we choose Automatio for most of our automation needs.

Mohammed Ibrahim

Mohammed Ibrahim

CEO, qannas.pro

I have used many tools over the past 5 years, Automatio is the Jack of All trades.. !! it could be your scraping bot in the morning and then it becomes your VA by the noon and in the evening it does your automations.. its amazing!

Ben Bressington

Ben Bressington

CTO, AiChatSolutions

Automatio is fantastic and simple to use to extract data from any website. This allowed me to replace a developer and do tasks myself as they only take a few minutes to setup and forget about it. Automatio is a game changer!

Sarah Chen

Sarah Chen

Head of Growth, ScaleUp Labs

We've tried dozens of automation tools, but Automatio stands out for its flexibility and ease of use. Our team productivity increased by 40% within the first month of adoption.

David Park

David Park

Founder, DataDriven.io

The AI-powered features in Automatio are incredible. It understands context and adapts to changes in websites automatically. No more broken scrapers!

Emily Rodriguez

Emily Rodriguez

Marketing Director, GrowthMetrics

Automatio transformed our lead generation process. What used to take our team days now happens automatically in minutes. The ROI is incredible.

Jonathan Kogan

Jonathan Kogan

Co-Founder/CEO, rpatools.io

Automatio is one of the most used for RPA Tools both internally and externally. It saves us countless hours of work and we realized this could do the same for other startups and so we choose Automatio for most of our automation needs.

Mohammed Ibrahim

Mohammed Ibrahim

CEO, qannas.pro

I have used many tools over the past 5 years, Automatio is the Jack of All trades.. !! it could be your scraping bot in the morning and then it becomes your VA by the noon and in the evening it does your automations.. its amazing!

Ben Bressington

Ben Bressington

CTO, AiChatSolutions

Automatio is fantastic and simple to use to extract data from any website. This allowed me to replace a developer and do tasks myself as they only take a few minutes to setup and forget about it. Automatio is a game changer!

Sarah Chen

Sarah Chen

Head of Growth, ScaleUp Labs

We've tried dozens of automation tools, but Automatio stands out for its flexibility and ease of use. Our team productivity increased by 40% within the first month of adoption.

David Park

David Park

Founder, DataDriven.io

The AI-powered features in Automatio are incredible. It understands context and adapts to changes in websites automatically. No more broken scrapers!

Emily Rodriguez

Emily Rodriguez

Marketing Director, GrowthMetrics

Automatio transformed our lead generation process. What used to take our team days now happens automatically in minutes. The ROI is incredible.

Frequently Asked Questions

Find answers to common questions about this model