google

Gemini 3 Pro

Google's Gemini 3 Pro is a multimodal powerhouse featuring a 1M token context window, native video processing, and industry-leading reasoning performance.

Multimodal AILong ContextFrontier ModelAGI-Ready
google logogoogleGemini 3November 18, 2025
Context
1.0Mtokens
Max Output
66Ktokens
Input Price
$2.00/ 1M
Output Price
$12.00/ 1M
Modality:TextImageAudioVideo
Capabilities:VisionToolsStreamingReasoning
Benchmarks
GPQA
91.9%
GPQA: Graduate-Level Science Q&A. A rigorous benchmark with 448 multiple-choice questions in biology, physics, and chemistry created by domain experts. PhD experts only achieve 65-74% accuracy, while non-experts score just 34% even with unlimited web access (hence 'Google-proof'). Gemini 3 Pro scored 91.9% on this benchmark.
HLE
45.8%
HLE: High-Level Expertise Reasoning. Tests a model's ability to demonstrate expert-level reasoning across specialized domains. Evaluates deep understanding of complex topics that require professional-level knowledge. Gemini 3 Pro scored 45.8% on this benchmark.
MMLU
91.8%
MMLU: Massive Multitask Language Understanding. A comprehensive benchmark with 16,000 multiple-choice questions across 57 academic subjects including math, philosophy, law, and medicine. Tests broad knowledge and reasoning capabilities. Gemini 3 Pro scored 91.8% on this benchmark.
MMLU Pro
85%
MMLU Pro: MMLU Professional Edition. An enhanced version of MMLU with 12,032 questions using a harder 10-option multiple choice format. Covers Math, Physics, Chemistry, Law, Engineering, Economics, Health, Psychology, Business, Biology, Philosophy, and Computer Science. Gemini 3 Pro scored 85% on this benchmark.
SimpleQA
72.1%
SimpleQA: Factual Accuracy Benchmark. Tests a model's ability to provide accurate, factual responses to straightforward questions. Measures reliability and reduces hallucinations in knowledge retrieval tasks. Gemini 3 Pro scored 72.1% on this benchmark.
IFEval
85%
IFEval: Instruction Following Evaluation. Measures how well a model follows specific instructions and constraints. Tests the ability to adhere to formatting rules, length limits, and other explicit requirements. Gemini 3 Pro scored 85% on this benchmark.
AIME 2025
100%
AIME 2025: American Invitational Math Exam. Competition-level mathematics problems from the prestigious AIME exam designed for talented high school students. Tests advanced mathematical problem-solving requiring abstract reasoning, not just pattern matching. Gemini 3 Pro scored 100% on this benchmark.
MATH
94%
MATH: Mathematical Problem Solving. A comprehensive math benchmark testing problem-solving across algebra, geometry, calculus, and other mathematical domains. Requires multi-step reasoning and formal mathematical knowledge. Gemini 3 Pro scored 94% on this benchmark.
GSM8k
99%
GSM8k: Grade School Math 8K. 8,500 grade school-level math word problems requiring multi-step reasoning. Tests basic arithmetic and logical thinking through real-world scenarios like shopping or time calculations. Gemini 3 Pro scored 99% on this benchmark.
MGSM
93%
MGSM: Multilingual Grade School Math. The GSM8k benchmark translated into 10 languages including Spanish, French, German, Russian, Chinese, and Japanese. Tests mathematical reasoning across different languages. Gemini 3 Pro scored 93% on this benchmark.
MathVista
79%
MathVista: Mathematical Visual Reasoning. Tests the ability to solve math problems that involve visual elements like charts, graphs, geometry diagrams, and scientific figures. Combines visual understanding with mathematical reasoning. Gemini 3 Pro scored 79% on this benchmark.
SWE-Bench
76.2%
SWE-Bench: Software Engineering Benchmark. AI models attempt to resolve real GitHub issues in open-source Python projects with human verification. Tests practical software engineering skills on production codebases. Top models went from 4.4% in 2023 to over 70% in 2024. Gemini 3 Pro scored 76.2% on this benchmark.
HumanEval
93%
HumanEval: Python Programming Problems. 164 hand-written programming problems where models must generate correct Python function implementations. Each solution is verified against unit tests. Top models now achieve 90%+ accuracy. Gemini 3 Pro scored 93% on this benchmark.
LiveCodeBench
81.3%
LiveCodeBench: Live Coding Benchmark. Tests coding abilities on continuously updated, real-world programming challenges. Unlike static benchmarks, uses fresh problems to prevent data contamination and measure true coding skills. Gemini 3 Pro scored 81.3% on this benchmark.
MMMU
81%
MMMU: Multimodal Understanding. Massive Multi-discipline Multimodal Understanding benchmark testing vision-language models on college-level problems across 30 subjects requiring both image understanding and expert knowledge. Gemini 3 Pro scored 81% on this benchmark.
MMMU Pro
81%
MMMU Pro: MMMU Professional Edition. Enhanced version of MMMU with more challenging questions and stricter evaluation. Tests advanced multimodal reasoning at professional and expert levels. Gemini 3 Pro scored 81% on this benchmark.
ChartQA
81.4%
ChartQA: Chart Question Answering. Tests the ability to understand and reason about information presented in charts and graphs. Requires extracting data, comparing values, and performing calculations from visual data representations. Gemini 3 Pro scored 81.4% on this benchmark.
DocVQA
92%
DocVQA: Document Visual Q&A. Document Visual Question Answering benchmark testing the ability to extract and reason about information from document images including forms, reports, and scanned text. Gemini 3 Pro scored 92% on this benchmark.
Terminal-Bench
54.2%
Terminal-Bench: Terminal/CLI Tasks. Tests the ability to perform command-line operations, write shell scripts, and navigate terminal environments. Measures practical system administration and development workflow skills. Gemini 3 Pro scored 54.2% on this benchmark.
ARC-AGI
31.1%
ARC-AGI: Abstraction & Reasoning. Abstraction and Reasoning Corpus for AGI - tests fluid intelligence through novel pattern recognition puzzles. Each task requires discovering the underlying rule from examples, measuring general reasoning ability rather than memorization. Gemini 3 Pro scored 31.1% on this benchmark.

About Gemini 3 Pro

Learn about Gemini 3 Pro's capabilities, features, and how it can help you achieve better results.

A New Frontier in AGI

Gemini 3 Pro represents Google’s definitive leap into the frontier of Artificial General Intelligence, reclaiming the top spot in the global AI landscape upon its late 2025 release. Built on a unified 'native multimodal' architecture, the model does not merely interpret different data types through separate encoders; it perceives text, high-resolution images, professional-grade audio, and hours of video within a single transformer pass.

Unmatched Reasoning and Technical Prowess

Technically, Gemini 3 Pro is a scientific and mathematical juggernaut, achieving a perfect 100% on the AIME 2025 math exam and setting a new gold standard for expert-level knowledge on GPQA Diamond. Its massive 1 million token context window facilitates enterprise-grade workflows like 'Deep Research,' where the model autonomously navigates massive codebases or video libraries to synthesize actionable insights.

Gemini 3 Pro

Use Cases for Gemini 3 Pro

Discover the different ways you can use Gemini 3 Pro to achieve great results.

Scientific Research

Utilizing its 91.9% GPQA score to analyze complex PhD-level scientific papers and formulate novel research hypotheses.

Long-Form Video Analysis

Leveraging the 1M+ context window to natively search and summarize hours of raw video footage for media production.

Advanced Mathematics

Solving olympiad-level mathematical problems with a verified 100% success rate on the AIME 2025 benchmark.

Automated Coding

Generating and debugging entire feature sets in one shot, outperforming competitors in complex 3D simulations.

Agentic Market Simulation

Operating as a virtual product manager to simulate market conditions and test business strategies against competitive pressures.

Interactive UI Generation

Creating 'Generative Interfaces' that build mini-web pages and interactive sliders dynamically in response to user queries.

Strengths

Limitations

Perfect Math Performance: Achieved a 100% score on the AIME 2025 benchmark with internal tool-use and code execution.
Increased Context Latency: Processing the full 1M context window can lead to high Time-To-First-Token compared to Flash variants.
Unified Multimodal Architecture: Processes audio, video, and text in a single stream, capturing nuanced temporal cues.
Tiered Pricing Jump: Costs double from $2/$12 to $4/$18 per 1M tokens once a prompt exceeds 200,000 tokens of context.
Highest LMArena Elo: Reclaimed the #1 global spot with a launch Elo of 1,501, ahead of GPT-5.1 and Claude 3.7.
Hallucination Persistence: Despite knowledge gains, it maintains an 88% hallucination rate in specific factuality evaluations.
Agentic Computer Control: Exceptional grounding in professional environments, scoring 72.7% on ScreenSpot Pro.
Rotary Encoding Bias: Long-context conversations with rapid topic shifts can cause the model to glitch or ignore recent prompts.

API Quick Start

google/gemini-3-pro-preview

View Documentation
google SDK
import { GoogleGenAI } from "@google/genai";

const genAI = new GoogleGenAI("YOUR_API_KEY");
const model = genAI.getGenerativeModel({ model: "gemini-3-pro" });

async function run() {
  const prompt = "Synthesize the architectural differences in Gemini 3 Pro.";
  const result = await model.generateContent(prompt);
  const response = await result.response;
  console.log(response.text());
}

run();

Install the SDK and start making API calls in minutes.

What People Are Saying About Gemini 3 Pro

See what the community thinks about Gemini 3 Pro

"The 'vibe' of an LLM matters as much as reasoning; Gemini is the only one telling me to breathe and think."
Kargichauhan_
x
"Gemini 3 Pro is the new leader. Google has the leading language model for the first time."
Artificial Analysis
x
"The video feature is really nice; it is able to very easily identify what is in front of it."
MartonPiller012
x
"Gemini 3 models have made a significant 2X SOTA jump on ARC-AGI-2."
ARC Prize
x
"Gemini 3 Pro hitting 1500+ Elo on day one is insane. Google is back."
AI_Enthusiast_99
reddit
"The native audio understanding is night and day compared to Whisper + LLM pipelines."
DevGuru
hackernews

Videos About Gemini 3 Pro

Watch tutorials, reviews, and discussions about Gemini 3 Pro

Marks a new chapter in the race to true artificial intelligence.

Gemini 3 Pro sets a record 92% almost on GPQA Diamond.

Google trained Gemini 3 on their own in-house TPUs, not Nvidia's GPUs.

The leap in reasoning here is the largest we've seen since GPT-4.

This model is essentially a super-computer for logic.

Gemini 3 Pro is available across all of the Gemini tiers. Take note of that, OpenAI.

Calling it the best model in the world for multimodal understanding.

These agents are actually able to open up a web browser and check on their own work.

Google is finally using their scale to their advantage.

The temporal understanding of video is actually insane compared to prior models.

It beats Sonnet and GPT 5.1 in almost all benchmarks.

SVG Panda holding a burger... Even X58 wasn't this good.

Passes both the mathematics questions in the first try... Kingbench 2.0 has retired.

The context retention after 500k tokens is surprisingly solid.

Coding agents built on this are just on another level.

More than just prompts

Supercharge your workflow with AI Automation

Automatio combines the power of AI agents, web automation, and smart integrations to help you accomplish more in less time.

AI Agents
Web Automation
Smart Workflows
Watch demo video

Pro Tips for Gemini 3 Pro

Expert tips to help you get the most out of Gemini 3 Pro and achieve better results.

Leverage Prompt Caching

For repeating long-context tasks, use Google's prompt caching to reduce Time-To-First-Token and lower costs by up to 90%.

Native Multimodal Inputs

Avoid transcribing media before inputting; feed raw audio and video files directly to take advantage of native understanding.

Dynamic Thinking Mode

Use specific system instructions to trigger 'Deep Think' for math and logic tasks while maintaining standard speed for creative writing.

Context Instance Management

In very long conversations, start new instances for major topic shifts to ensure the model doesn't lose track of recent instructions.

Testimonials

What Our Users Say

Join thousands of satisfied users who have transformed their workflow

Jonathan Kogan

Jonathan Kogan

Co-Founder/CEO, rpatools.io

Automatio is one of the most used for RPA Tools both internally and externally. It saves us countless hours of work and we realized this could do the same for other startups and so we choose Automatio for most of our automation needs.

Mohammed Ibrahim

Mohammed Ibrahim

CEO, qannas.pro

I have used many tools over the past 5 years, Automatio is the Jack of All trades.. !! it could be your scraping bot in the morning and then it becomes your VA by the noon and in the evening it does your automations.. its amazing!

Ben Bressington

Ben Bressington

CTO, AiChatSolutions

Automatio is fantastic and simple to use to extract data from any website. This allowed me to replace a developer and do tasks myself as they only take a few minutes to setup and forget about it. Automatio is a game changer!

Sarah Chen

Sarah Chen

Head of Growth, ScaleUp Labs

We've tried dozens of automation tools, but Automatio stands out for its flexibility and ease of use. Our team productivity increased by 40% within the first month of adoption.

David Park

David Park

Founder, DataDriven.io

The AI-powered features in Automatio are incredible. It understands context and adapts to changes in websites automatically. No more broken scrapers!

Emily Rodriguez

Emily Rodriguez

Marketing Director, GrowthMetrics

Automatio transformed our lead generation process. What used to take our team days now happens automatically in minutes. The ROI is incredible.

Jonathan Kogan

Jonathan Kogan

Co-Founder/CEO, rpatools.io

Automatio is one of the most used for RPA Tools both internally and externally. It saves us countless hours of work and we realized this could do the same for other startups and so we choose Automatio for most of our automation needs.

Mohammed Ibrahim

Mohammed Ibrahim

CEO, qannas.pro

I have used many tools over the past 5 years, Automatio is the Jack of All trades.. !! it could be your scraping bot in the morning and then it becomes your VA by the noon and in the evening it does your automations.. its amazing!

Ben Bressington

Ben Bressington

CTO, AiChatSolutions

Automatio is fantastic and simple to use to extract data from any website. This allowed me to replace a developer and do tasks myself as they only take a few minutes to setup and forget about it. Automatio is a game changer!

Sarah Chen

Sarah Chen

Head of Growth, ScaleUp Labs

We've tried dozens of automation tools, but Automatio stands out for its flexibility and ease of use. Our team productivity increased by 40% within the first month of adoption.

David Park

David Park

Founder, DataDriven.io

The AI-powered features in Automatio are incredible. It understands context and adapts to changes in websites automatically. No more broken scrapers!

Emily Rodriguez

Emily Rodriguez

Marketing Director, GrowthMetrics

Automatio transformed our lead generation process. What used to take our team days now happens automatically in minutes. The ROI is incredible.

Related AI Models

deepseek

DeepSeek-V3.2-Speciale

deepseek

DeepSeek-V3.2-Speciale is a reasoning-first LLM featuring gold-medal math performance, DeepSeek Sparse Attention, and a 131K context window. Rivaling GPT-5...

131K context
$0.28/$0.42/1M
moonshot

Kimi K2 Thinking

moonshot

Kimi K2 Thinking is Moonshot AI's trillion-parameter reasoning model. It outperforms GPT-5 on HLE and supports 300 sequential tool calls autonomously for...

256K context
$0.15/1M
openai

GPT-5.2

openai

GPT-5.2 is OpenAI's flagship model for professional tasks, featuring a 400K context window, elite coding, and deep multi-step reasoning capabilities.

400K context
$1.75/$14.00/1M
openai

GPT-5.2 Pro

openai

GPT-5.2 Pro is OpenAI's 2025 flagship reasoning model featuring Extended Thinking for SOTA performance in mathematics, coding, and expert knowledge work.

400K context
$21.00/$168.00/1M
google

Gemini 3 Flash

google

Gemini 3 Flash is Google's high-speed multimodal model featuring a 1M token context window, elite 90.4% GPQA reasoning, and autonomous browser automation tools.

1M context
$0.50/$3.00/1M
openai

GPT-5.1

openai

GPT-5.1 is OpenAI’s advanced reasoning flagship featuring adaptive thinking, native multimodality, and state-of-the-art performance in math and technical...

400K context
$1.25/$10.00/1M
xai

Grok-4

xai

Grok-4 by xAI is a frontier model featuring a 2M token context window, real-time X platform integration, and world-record reasoning capabilities.

2M context
$3.00/$15.00/1M
anthropic

Claude Opus 4.5

anthropic

Claude Opus 4.5 is Anthropic's most powerful frontier model, delivering record-breaking 80.9% SWE-bench performance and advanced autonomous agency for coding.

200K context
$5.00/$25.00/1M

Frequently Asked Questions About Gemini 3 Pro

Find answers to common questions about Gemini 3 Pro