google

Gemini 3 Flash

Gemini 3 Flash is Google's high-speed multimodal model featuring a 1M token context window, elite 90.4% GPQA reasoning, and autonomous browser automation tools.

google logogoogleGemini 32025-12-17
Context
1.0Mtokens
Max Output
66Ktokens
Input Price
$0.50/ 1M
Output Price
$3.00/ 1M
Modality:TextImageAudioVideo
Capabilities:VisionToolsStreamingReasoning
Benchmarks
GPQA
90.4%
GPQA: Graduate-Level Science Q&A. A rigorous benchmark with 448 multiple-choice questions in biology, physics, and chemistry created by domain experts. PhD experts only achieve 65-74% accuracy, while non-experts score just 34% even with unlimited web access (hence 'Google-proof'). Gemini 3 Flash scored 90.4% on this benchmark.
HLE
33.7%
HLE: High-Level Expertise Reasoning. Tests a model's ability to demonstrate expert-level reasoning across specialized domains. Evaluates deep understanding of complex topics that require professional-level knowledge. Gemini 3 Flash scored 33.7% on this benchmark.
MMLU
88.2%
MMLU: Massive Multitask Language Understanding. A comprehensive benchmark with 16,000 multiple-choice questions across 57 academic subjects including math, philosophy, law, and medicine. Tests broad knowledge and reasoning capabilities. Gemini 3 Flash scored 88.2% on this benchmark.
MMLU Pro
88.6%
MMLU Pro: MMLU Professional Edition. An enhanced version of MMLU with 12,032 questions using a harder 10-option multiple choice format. Covers Math, Physics, Chemistry, Law, Engineering, Economics, Health, Psychology, Business, Biology, Philosophy, and Computer Science. Gemini 3 Flash scored 88.6% on this benchmark.
SimpleQA
58%
SimpleQA: Factual Accuracy Benchmark. Tests a model's ability to provide accurate, factual responses to straightforward questions. Measures reliability and reduces hallucinations in knowledge retrieval tasks. Gemini 3 Flash scored 58% on this benchmark.
AIME 2025
99.7%
AIME 2025: American Invitational Math Exam. Competition-level mathematics problems from the prestigious AIME exam designed for talented high school students. Tests advanced mathematical problem-solving requiring abstract reasoning, not just pattern matching. Gemini 3 Flash scored 99.7% on this benchmark.
GSM8k
94%
GSM8k: Grade School Math 8K. 8,500 grade school-level math word problems requiring multi-step reasoning. Tests basic arithmetic and logical thinking through real-world scenarios like shopping or time calculations. Gemini 3 Flash scored 94% on this benchmark.
SWE-Bench
78%
SWE-Bench: Software Engineering Benchmark. AI models attempt to resolve real GitHub issues in open-source Python projects with human verification. Tests practical software engineering skills on production codebases. Top models went from 4.4% in 2023 to over 70% in 2024. Gemini 3 Flash scored 78% on this benchmark.
HumanEval
92%
HumanEval: Python Programming Problems. 164 hand-written programming problems where models must generate correct Python function implementations. Each solution is verified against unit tests. Top models now achieve 90%+ accuracy. Gemini 3 Flash scored 92% on this benchmark.
MMMU
82.5%
MMMU: Multimodal Understanding. Massive Multi-discipline Multimodal Understanding benchmark testing vision-language models on college-level problems across 30 subjects requiring both image understanding and expert knowledge. Gemini 3 Flash scored 82.5% on this benchmark.
MMMU Pro
52%
MMMU Pro: MMMU Professional Edition. Enhanced version of MMMU with more challenging questions and stricter evaluation. Tests advanced multimodal reasoning at professional and expert levels. Gemini 3 Flash scored 52% on this benchmark.
ChartQA
89%
ChartQA: Chart Question Answering. Tests the ability to understand and reason about information presented in charts and graphs. Requires extracting data, comparing values, and performing calculations from visual data representations. Gemini 3 Flash scored 89% on this benchmark.
DocVQA
94%
DocVQA: Document Visual Q&A. Document Visual Question Answering benchmark testing the ability to extract and reason about information from document images including forms, reports, and scanned text. Gemini 3 Flash scored 94% on this benchmark.
Terminal-Bench
55%
Terminal-Bench: Terminal/CLI Tasks. Tests the ability to perform command-line operations, write shell scripts, and navigate terminal environments. Measures practical system administration and development workflow skills. Gemini 3 Flash scored 55% on this benchmark.
ARC-AGI
84.6%
ARC-AGI: Abstraction & Reasoning. Abstraction and Reasoning Corpus for AGI - tests fluid intelligence through novel pattern recognition puzzles. Each task requires discovering the underlying rule from examples, measuring general reasoning ability rather than memorization. Gemini 3 Flash scored 84.6% on this benchmark.

About Gemini 3 Flash

Learn about Gemini 3 Flash's capabilities, features, and how it can help you achieve better results.

The Performance Powerhouse of Gemini 3

Gemini 3 Flash is Google's frontier-class multimodal model optimized for extreme speed and massive scalability. Developed by Google DeepMind, it serves as the efficiency-first workhorse of the Gemini 3 ecosystem, delivering high-quality reasoning and native multimodal processing across text, code, images, and audio. It is specifically designed for high-volume enterprise workloads where low latency and cost-effectiveness are paramount.

Unprecedented Context and Agency

The model features a massive 1-million-token context window, allowing it to process entire code repositories, hours of video, or thousands of pages of documentation in a single prompt. More than just a chatbot, it is engineered for agency. Integrated with Google's Stagehand and Nano Browser APIs, it can autonomously navigate the web, execute multi-step digital tasks, and interact with live web elements as a human would.

Elite Scientific Reasoning

While optimized for speed, Gemini 3 Flash does not sacrifice intelligence. Through the specialized Deep Think activation protocol, the model can trigger internal chain-of-thought processes to solve PhD-level problems in math, science, and logic. This dual nature allows it to switch between rapid data extraction and sophisticated, expert-level analysis with simple system instructions.

Gemini 3 Flash

Use Cases

Discover the different ways you can use Gemini 3 Flash to achieve great results.

Autonomous Web Navigation

Execute multi-step web tasks like booking travel or competitor research using the Nano Browser API.

Large-scale Code Refactoring

Ingest and analyze entire software repositories using the 1-million-token window to map dependency logic.

Multimodal Content Auditing

Analyze hours of video or hundreds of technical PDFs to extract specific visual patterns and structured data.

Real-time Customer Support

Power responsive chatbots that handle complex multimodal queries with sub-second response times.

Scientific Research Synthesis

Analyze PhD-level papers and datasets to propose experimental designs using the Deep Think protocol.

Interactive Tutoring

Provide step-by-step tutoring for advanced mathematics with internal chain-of-thought explanations.

Strengths

Limitations

Unrivaled Spatial Reasoning: Achieves top-tier results in visual understanding, excelling at precise SVG generation and screen analysis.
High Hallucination Rate: Measured at a 91% tendency to fabricate plausible responses rather than admitting a lack of specific information.
Elite Coding Efficiency: Scores 78% on SWE-bench Verified, making it faster and more accurate for software engineering than many Pro models.
Reasoning Token Overhead: Deep Think mode generates high output token volume, which can significantly increase the total cost per request.
Massive 1M Context Window: The huge token capacity allows the model to process hours of video or entire project directories without data loss.
Instruction Following Gaps: Occasionally struggles with negative constraints, such as including unwanted UI elements when specifically told to avoid them.
High Inference Speed: Optimized for sub-second latency, making it the fastest frontier-class model currently available in the Gemini family.
Unstable API Experience: Developer endpoints are noted for frequent breaking changes and inconsistent documentation compared to competitors.

API Quick Start

google/gemini-3-flash

View Documentation
google SDK
import { GoogleGenAI } from "@google/genai";

const genAI = new GoogleGenAI(process.env.GOOGLE_API_KEY);
const model = genAI.getGenerativeModel({ 
  model: "gemini-3-flash",
  thinkingMode: true 
});

const prompt = "Analyze the spatial layout of this UI screenshot for accessibility.";
const result = await model.generateContent(prompt);
console.log(result.response.text());

Install the SDK and start making API calls in minutes.

Community Feedback

See what the community thinks about Gemini 3 Flash

Gemini 3 Flash slaughtered the Pelican SVG test, the best results I've seen from any model to date.
Simon Willison
twitter
Gemini 3's thought process is wild. It actually wrestles with its own identity and system constraints in real-time.
rutan668
reddit
The knowledge density is incredible, but the hallucination rate makes it dangerous for unattended tasks.
anonymous_engineer
hackernews
Finally, a model that allows me to control the compute budget. Standard mode is lightning fast, thinking mode is brilliant.
AI_Insights_Daily
twitter
Flash 3 is the first time I felt like a 'small' model could actually replace a 'pro' model for 90% of my coding workflow.
CodeMasterV
reddit
The spatial reasoning is on another level. It understood my messy whiteboard drawing perfectly on the first try.
DesignFlow
twitter

Related Videos

Watch tutorials, reviews, and discussions about Gemini 3 Flash

It actually beats Gemini 3 Pro at coding.

MMU Pro is the number one model out of everything.

It is basically the frontier of intelligence at a fraction of the cost.

The speed at which it generates complex reasoning is just unmatched.

Google is really pushing the limits of what a 'flash' model can do.

Created a full flock of birds simulation using only 3,000 tokens while Gemini 3 Pro is still building.

One of the worst models tested on hallucinations. It will just make it up.

The context window is the real star here, handling entire repos easily.

Don't trust it for factual history or niche technical data without RAG.

It's the ultimate tool for visual analysis of logs and dashboards.

Its understanding of spatial reasoning is best in class.

91% of the time it doesn't know, it will lie and make up an answer.

Screen understanding crushes the scores from 2.5 Flash.

The ability to parse visual UI and turn it into code is flawless.

The pricing makes it a no-brainer for high-volume agent tasks.

More than just prompts

Supercharge your workflow with AI Automation

Automatio combines the power of AI agents, web automation, and smart integrations to help you accomplish more in less time.

AI Agents
Web Automation
Smart Workflows

Pro Tips

Expert tips to help you get the most out of Gemini 3 Flash and achieve better results.

Utilize Thinking Mode

Enable 'thinkingMode' specifically for logic-heavy tasks or math problems to improve accuracy significantly.

Batch Processing for Cost

Use the Batch API for non-urgent tasks to receive a 50% discount on standard token pricing.

Optimize via MCP

Use the Model Context Protocol to integrate third-party tools seamlessly into the model's agentic workflows.

Fact-Check Critical Output

Implement verification layers for factual queries, as the model has a high hallucination rate on unknown data.

Testimonials

What Our Users Say

Join thousands of satisfied users who have transformed their workflow

Jonathan Kogan

Jonathan Kogan

Co-Founder/CEO, rpatools.io

Automatio is one of the most used for RPA Tools both internally and externally. It saves us countless hours of work and we realized this could do the same for other startups and so we choose Automatio for most of our automation needs.

Mohammed Ibrahim

Mohammed Ibrahim

CEO, qannas.pro

I have used many tools over the past 5 years, Automatio is the Jack of All trades.. !! it could be your scraping bot in the morning and then it becomes your VA by the noon and in the evening it does your automations.. its amazing!

Ben Bressington

Ben Bressington

CTO, AiChatSolutions

Automatio is fantastic and simple to use to extract data from any website. This allowed me to replace a developer and do tasks myself as they only take a few minutes to setup and forget about it. Automatio is a game changer!

Sarah Chen

Sarah Chen

Head of Growth, ScaleUp Labs

We've tried dozens of automation tools, but Automatio stands out for its flexibility and ease of use. Our team productivity increased by 40% within the first month of adoption.

David Park

David Park

Founder, DataDriven.io

The AI-powered features in Automatio are incredible. It understands context and adapts to changes in websites automatically. No more broken scrapers!

Emily Rodriguez

Emily Rodriguez

Marketing Director, GrowthMetrics

Automatio transformed our lead generation process. What used to take our team days now happens automatically in minutes. The ROI is incredible.

Jonathan Kogan

Jonathan Kogan

Co-Founder/CEO, rpatools.io

Automatio is one of the most used for RPA Tools both internally and externally. It saves us countless hours of work and we realized this could do the same for other startups and so we choose Automatio for most of our automation needs.

Mohammed Ibrahim

Mohammed Ibrahim

CEO, qannas.pro

I have used many tools over the past 5 years, Automatio is the Jack of All trades.. !! it could be your scraping bot in the morning and then it becomes your VA by the noon and in the evening it does your automations.. its amazing!

Ben Bressington

Ben Bressington

CTO, AiChatSolutions

Automatio is fantastic and simple to use to extract data from any website. This allowed me to replace a developer and do tasks myself as they only take a few minutes to setup and forget about it. Automatio is a game changer!

Sarah Chen

Sarah Chen

Head of Growth, ScaleUp Labs

We've tried dozens of automation tools, but Automatio stands out for its flexibility and ease of use. Our team productivity increased by 40% within the first month of adoption.

David Park

David Park

Founder, DataDriven.io

The AI-powered features in Automatio are incredible. It understands context and adapts to changes in websites automatically. No more broken scrapers!

Emily Rodriguez

Emily Rodriguez

Marketing Director, GrowthMetrics

Automatio transformed our lead generation process. What used to take our team days now happens automatically in minutes. The ROI is incredible.

Related AI Models

anthropic

Claude Sonnet 4.6

Anthropic

Claude Sonnet 4.6 offers frontier performance for coding and computer use with a massive 1M token context window for only $3/1M tokens.

1M context
$3.00/$15.00/1M
anthropic

Claude Opus 4.6

Anthropic

Claude Opus 4.6 is Anthropic's flagship model featuring a 1M token context window, Adaptive Thinking, and world-class coding and reasoning performance.

1M context
$5.00/$25.00/1M
google

Gemini 3 Pro

Google

Google's Gemini 3 Pro is a multimodal powerhouse featuring a 1M token context window, native video processing, and industry-leading reasoning performance.

1M context
$2.00/$12.00/1M
alibaba

Qwen3.5-397B-A17B

alibaba

Qwen3.5-397B-A17B is Alibaba's flagship open-weight MoE model. It features native multimodal reasoning, a 1M context window, and a 19x decoding throughput...

1M context
$0.40/$2.40/1M
openai

GPT-5.1

OpenAI

GPT-5.1 is OpenAI’s advanced reasoning flagship featuring adaptive thinking, native multimodality, and state-of-the-art performance in math and technical...

400K context
$1.25/$10.00/1M
moonshot

Kimi K2.5

Moonshot

Discover Moonshot AI's Kimi K2.5, a 1T-parameter open-source agentic model featuring native multimodal capabilities, a 262K context window, and SOTA reasoning.

256K context
$0.60/$3.00/1M
openai

GPT-5.2 Pro

OpenAI

GPT-5.2 Pro is OpenAI's 2025 flagship reasoning model featuring Extended Thinking for SOTA performance in mathematics, coding, and expert knowledge work.

400K context
$21.00/$168.00/1M
xai

Grok-4

xAI

Grok-4 by xAI is a frontier model featuring a 2M token context window, real-time X platform integration, and world-record reasoning capabilities.

2M context
$3.00/$15.00/1M

Frequently Asked Questions

Find answers to common questions about Gemini 3 Flash