google

Gemini 3.5 Flash

Gemini 3.5 Flash is Google's high-speed multimodal model with a 1M context window, optimized for sub-second agentic loops and complex coding tasks.

Multimodal AIAgentic Workflows1M ContextHigh-Speed LLM
google logogoogleGeminiMay 19, 2026
Context
1.0Mtokens
Max Output
66Ktokens
Input Price
$1.50/ 1M
Output Price
$9.00/ 1M
Modality:TextImageAudioVideo
Capabilities:VisionToolsStreamingReasoning
Benchmarks
GPQA
74%
GPQA: Graduate-Level Science Q&A. A rigorous benchmark with 448 multiple-choice questions in biology, physics, and chemistry created by domain experts. PhD experts only achieve 65-74% accuracy, while non-experts score just 34% even with unlimited web access (hence 'Google-proof'). Gemini 3.5 Flash scored 74% on this benchmark.
HLE
34%
HLE: High-Level Expertise Reasoning. Tests a model's ability to demonstrate expert-level reasoning across specialized domains. Evaluates deep understanding of complex topics that require professional-level knowledge. Gemini 3.5 Flash scored 34% on this benchmark.
MMLU
89%
MMLU: Massive Multitask Language Understanding. A comprehensive benchmark with 16,000 multiple-choice questions across 57 academic subjects including math, philosophy, law, and medicine. Tests broad knowledge and reasoning capabilities. Gemini 3.5 Flash scored 89% on this benchmark.
MMLU Pro
83%
MMLU Pro: MMLU Professional Edition. An enhanced version of MMLU with 12,032 questions using a harder 10-option multiple choice format. Covers Math, Physics, Chemistry, Law, Engineering, Economics, Health, Psychology, Business, Biology, Philosophy, and Computer Science. Gemini 3.5 Flash scored 83% on this benchmark.
SimpleQA
76.7%
SimpleQA: Factual Accuracy Benchmark. Tests a model's ability to provide accurate, factual responses to straightforward questions. Measures reliability and reduces hallucinations in knowledge retrieval tasks. Gemini 3.5 Flash scored 76.7% on this benchmark.
IFEval
88%
IFEval: Instruction Following Evaluation. Measures how well a model follows specific instructions and constraints. Tests the ability to adhere to formatting rules, length limits, and other explicit requirements. Gemini 3.5 Flash scored 88% on this benchmark.
AIME 2025
68%
AIME 2025: American Invitational Math Exam. Competition-level mathematics problems from the prestigious AIME exam designed for talented high school students. Tests advanced mathematical problem-solving requiring abstract reasoning, not just pattern matching. Gemini 3.5 Flash scored 68% on this benchmark.
MATH
88%
MATH: Mathematical Problem Solving. A comprehensive math benchmark testing problem-solving across algebra, geometry, calculus, and other mathematical domains. Requires multi-step reasoning and formal mathematical knowledge. Gemini 3.5 Flash scored 88% on this benchmark.
GSM8k
97%
GSM8k: Grade School Math 8K. 8,500 grade school-level math word problems requiring multi-step reasoning. Tests basic arithmetic and logical thinking through real-world scenarios like shopping or time calculations. Gemini 3.5 Flash scored 97% on this benchmark.
MGSM
92%
MGSM: Multilingual Grade School Math. The GSM8k benchmark translated into 10 languages including Spanish, French, German, Russian, Chinese, and Japanese. Tests mathematical reasoning across different languages. Gemini 3.5 Flash scored 92% on this benchmark.
MathVista
74%
MathVista: Mathematical Visual Reasoning. Tests the ability to solve math problems that involve visual elements like charts, graphs, geometry diagrams, and scientific figures. Combines visual understanding with mathematical reasoning. Gemini 3.5 Flash scored 74% on this benchmark.
SWE-Bench
55.1%
SWE-Bench: Software Engineering Benchmark. AI models attempt to resolve real GitHub issues in open-source Python projects with human verification. Tests practical software engineering skills on production codebases. Top models went from 4.4% in 2023 to over 70% in 2024. Gemini 3.5 Flash scored 55.1% on this benchmark.
HumanEval
92%
HumanEval: Python Programming Problems. 164 hand-written programming problems where models must generate correct Python function implementations. Each solution is verified against unit tests. Top models now achieve 90%+ accuracy. Gemini 3.5 Flash scored 92% on this benchmark.
LiveCodeBench
56%
LiveCodeBench: Live Coding Benchmark. Tests coding abilities on continuously updated, real-world programming challenges. Unlike static benchmarks, uses fresh problems to prevent data contamination and measure true coding skills. Gemini 3.5 Flash scored 56% on this benchmark.
MMMU
84%
MMMU: Multimodal Understanding. Massive Multi-discipline Multimodal Understanding benchmark testing vision-language models on college-level problems across 30 subjects requiring both image understanding and expert knowledge. Gemini 3.5 Flash scored 84% on this benchmark.
MMMU Pro
88.3%
MMMU Pro: MMMU Professional Edition. Enhanced version of MMMU with more challenging questions and stricter evaluation. Tests advanced multimodal reasoning at professional and expert levels. Gemini 3.5 Flash scored 88.3% on this benchmark.
ChartQA
89%
ChartQA: Chart Question Answering. Tests the ability to understand and reason about information presented in charts and graphs. Requires extracting data, comparing values, and performing calculations from visual data representations. Gemini 3.5 Flash scored 89% on this benchmark.
DocVQA
94%
DocVQA: Document Visual Q&A. Document Visual Question Answering benchmark testing the ability to extract and reason about information from document images including forms, reports, and scanned text. Gemini 3.5 Flash scored 94% on this benchmark.
Terminal-Bench
76.2%
Terminal-Bench: Terminal/CLI Tasks. Tests the ability to perform command-line operations, write shell scripts, and navigate terminal environments. Measures practical system administration and development workflow skills. Gemini 3.5 Flash scored 76.2% on this benchmark.
ARC-AGI
12%
ARC-AGI: Abstraction & Reasoning. Abstraction and Reasoning Corpus for AGI - tests fluid intelligence through novel pattern recognition puzzles. Each task requires discovering the underlying rule from examples, measuring general reasoning ability rather than memorization. Gemini 3.5 Flash scored 12% on this benchmark.

About Gemini 3.5 Flash

Learn about Gemini 3.5 Flash's capabilities, features, and how it can help you achieve better results.

High-Efficiency Agentic Performance

Gemini 3.5 Flash is a multimodal model designed for speed and complex reasoning. It supports a 1-million-token context window, enabling users to process massive data sets including hour-long videos and entire code repositories in a single prompt. The architecture is optimized for sub-second latency, targeting developers building interactive AI agents and automated workflows.

Native Multimodality and Reasoning

This model introduces a Thinking mode for advanced chain-of-thought logic. It natively processes text, images, audio, video, and PDFs, which removes the need for separate preprocessing pipelines. Benchmarks indicate that it outperforms the previous Gemini 3.1 Pro in coding and tool-use tasks while maintaining the efficiency of the Flash tier.

Production-Ready Scaling

At a cost of $1.50 per million input tokens, it provides a cost-effective path for high-volume applications. It is specifically tuned for function calling and terminal-based tasks, achieving high scores on agentic benchmarks like SWE-bench and Terminal-Bench. This makes it a primary choice for real-time coding assistants and data curation systems.

Gemini 3.5 Flash

Use Cases

Discover the different ways you can use Gemini 3.5 Flash to achieve great results.

Automated Newsroom Curation

Scanning thousands of RSS feeds and social threads to score and rank stories based on specific editorial profiles in real-time.

High-Volume Document Analysis

Processing massive archives like legal case histories to extract structured summaries and actionable insights without losing context.

Real-time Music Synthesis

Generating interactive audio tools and musical interfaces using native understanding of music theory and audio waveforms.

Interactive Browser OS Generation

Creating fully functional operating system simulations and complex UI dashboards from natural language prompts.

Rapid Code Refactoring

Executing logic updates across large codebases without consuming the higher credits required by flagship models.

Agentic Terminal Automation

Performing multi-step system tasks and coding iterations using a terminal harness to orchestrate development environments.

Strengths

Limitations

Massive 1M Token Context: Supports deep analysis of long-form data including full-length videos and entire software repositories.
Increased Pricing: Token costs have tripled compared to previous Flash preview models, moving to $1.50 input and $9 output per million tokens.
Exceptional Synthesis Logic: Leading performance in generating complex interactive audio tools and modern browser-based operating system simulations.
Arithmetic Inaccuracy: Occasionally struggles with basic mathematical operations, failing simple prompts that specialized reasoning models solve easily.
Sub-Second Latency: Optimized for extreme throughput, reaching output speeds up to 1500 tokens per second in production environments.
Context Window Degradation: Users report that retrieval reliability can diminish slightly as the context window approaches the 1-million-token limit.
Agentic Performance Gains: Outperforms many larger flagship models on real-world coding tasks and terminal-based agentic benchmarks.
3D Lighting Inconsistencies: Can produce overly dark or poorly lit environments in complex 3D simulations, requiring iterative prompting to correct.

API Quick Start

google/gemini-3.5-flash

View Documentation
google SDK
import { GoogleGenAI } from "@google/genai";

const client = new GoogleGenAI(process.env.GOOGLE_API_KEY);
const model = client.getGenerativeModel({ 
  model: "gemini-3.5-flash",
  generationConfig: { maxOutputTokens: 65536 }
});

async function run() {
  const prompt = "Build a fully interactive 3D synthwave landscape using Three.js.";
  const result = await model.generateContent(prompt);
  console.log(result.response.text());
}

run();

Install the SDK and start making API calls in minutes.

Community Feedback

See what the community thinks about Gemini 3.5 Flash

Gemini 3.5 Flash is the clear leader on the Intelligence vs Speed Pareto frontier and makes large gains on real-world agentic tasks.
Artificial Analysis
twitter
Gemini 3 is brilliant for UK business use. It captures nuanced politeness levels and UK-specific tax assumptions better than US-centric models.
Efficient_Degree9569
reddit
This model is so like it loves music stuff. It is very, very fast and the audio synthesizer it generated had me completely sold.
Bjaman
youtube
Gemini 3.5 Flash is definitely outperforming the previous Pro model on coding related things, which is huge for agentic developers.
DevGuru99
reddit
Google just released Gemini 3.5 Flash. The interesting part is not just that it’s faster. Google is positioning this as the agentic king.
TestingCatalog
twitter
Gemini 3.5 Flash is super strong model for its class. Beats Gemini 3.1 Pro on so many benchmarks.
AI_Expert
twitter

Related Videos

Watch tutorials, reviews, and discussions about Gemini 3.5 Flash

Gemini 3.5 Flash has been released from Google. And this is hypothetically a pretty big leap in performance.

This is the best one that I've seen, period. Even if it doesn't work, this model is so like it loves music stuff.

I noticed it is very, very fast and it really does seem to like music.

The reasoning capabilities for such a small, fast model are genuinely impressive during these code tests.

It managed to create a working three-dimensional synth environment from a single prompt without errors.

Gemini 3.5 Flash completely shocked me. Not only was it insanely fast but it actually completed the task better than Opus.

Gemini 3.5 Flash finished this task within a minute. This is really insane. The speed of Gemini 3.5 Plus is insane.

Gemini 3.5 Flash did it in just $0.36, whereas Claude Opus did it in almost double price.

The multi-modal understanding here is clearly a step up from the previous Flash version.

You are getting near flagship intelligence for a fraction of the token cost.

This is a model that's positioned as Google's strongest agentic coding model yet, above the Gemini 3.1 Pro.

The quality jump is very noticeable. Hallucination rates have reportedly reduced from 91% to 61% which is remarkable.

Gemini 3.5 Flash excels obviously for its price and quality SVG art as well as working in 3JS.

It natively processes video and audio, allowing for much more accurate extraction of temporal data.

The new thinking mode helps developers audit exactly how the model plans its tool usage.

More than just prompts

Supercharge your workflow with AI Automation

Automatio combines the power of AI agents, web automation, and smart integrations to help you accomplish more in less time.

AI Agents
Web Automation
Smart Workflows

Pro Tips

Expert tips to help you get the most out of Gemini 3.5 Flash and achieve better results.

Enable Thinking Mode

Toggle the thinking setting in the API or Google AI Studio to activate advanced chain-of-thought reasoning for engineering problems.

Leverage Native Multimodality

Upload raw audio or video files directly for analysis to preserve temporal and tonal data instead of using external transcripts.

Specify Constraints Verbatim

The model follows negative constraints strictly. Use instructions like 'No explanations' for raw code output to minimize latency.

Apply The High-Low Strategy

Use Flash for high-volume tasks like UI drafting and only use Pro models for final architectural verification.

Testimonials

What Our Users Say

Join thousands of satisfied users who have transformed their workflow

Jonathan Kogan

Jonathan Kogan

Co-Founder/CEO, rpatools.io

Automatio is one of the most used for RPA Tools both internally and externally. It saves us countless hours of work and we realized this could do the same for other startups and so we choose Automatio for most of our automation needs.

Mohammed Ibrahim

Mohammed Ibrahim

CEO, qannas.pro

I have used many tools over the past 5 years, Automatio is the Jack of All trades.. !! it could be your scraping bot in the morning and then it becomes your VA by the noon and in the evening it does your automations.. its amazing!

Ben Bressington

Ben Bressington

CTO, AiChatSolutions

Automatio is fantastic and simple to use to extract data from any website. This allowed me to replace a developer and do tasks myself as they only take a few minutes to setup and forget about it. Automatio is a game changer!

Sarah Chen

Sarah Chen

Head of Growth, ScaleUp Labs

We've tried dozens of automation tools, but Automatio stands out for its flexibility and ease of use. Our team productivity increased by 40% within the first month of adoption.

David Park

David Park

Founder, DataDriven.io

The AI-powered features in Automatio are incredible. It understands context and adapts to changes in websites automatically. No more broken scrapers!

Emily Rodriguez

Emily Rodriguez

Marketing Director, GrowthMetrics

Automatio transformed our lead generation process. What used to take our team days now happens automatically in minutes. The ROI is incredible.

Jonathan Kogan

Jonathan Kogan

Co-Founder/CEO, rpatools.io

Automatio is one of the most used for RPA Tools both internally and externally. It saves us countless hours of work and we realized this could do the same for other startups and so we choose Automatio for most of our automation needs.

Mohammed Ibrahim

Mohammed Ibrahim

CEO, qannas.pro

I have used many tools over the past 5 years, Automatio is the Jack of All trades.. !! it could be your scraping bot in the morning and then it becomes your VA by the noon and in the evening it does your automations.. its amazing!

Ben Bressington

Ben Bressington

CTO, AiChatSolutions

Automatio is fantastic and simple to use to extract data from any website. This allowed me to replace a developer and do tasks myself as they only take a few minutes to setup and forget about it. Automatio is a game changer!

Sarah Chen

Sarah Chen

Head of Growth, ScaleUp Labs

We've tried dozens of automation tools, but Automatio stands out for its flexibility and ease of use. Our team productivity increased by 40% within the first month of adoption.

David Park

David Park

Founder, DataDriven.io

The AI-powered features in Automatio are incredible. It understands context and adapts to changes in websites automatically. No more broken scrapers!

Emily Rodriguez

Emily Rodriguez

Marketing Director, GrowthMetrics

Automatio transformed our lead generation process. What used to take our team days now happens automatically in minutes. The ROI is incredible.

Related AI Models

anthropic

Claude 3.7 Sonnet

Anthropic

Claude 3.7 Sonnet is Anthropic's first hybrid reasoning model, delivering state-of-the-art coding capabilities, a 200k context window, and visible thinking.

200K context
$3.00/$15.00/1M
anthropic

Claude 4.5 Sonnet

Anthropic

Anthropic's Claude Sonnet 4.5 delivers world-leading coding (77.2% SWE-bench) and a 200K context window, optimized for the next generation of autonomous agents.

200K context
$3.00/$15.00/1M
openai

GPT-5.3 Codex

OpenAI

GPT-5.3 Codex is OpenAI's 2026 frontier coding agent, featuring a 400K context window, 77.3% Terminal-Bench score, and superior logic for complex software...

400K context
$1.75/$14.00/1M
alibaba

Qwen3.5-Omni

alibaba

Qwen3.5-Omni is a natively omnimodal AI by Alibaba Cloud, offering seamless audio-visual reasoning, real-time voice chat, and 256k context for low-latency apps.

256K context
$0.40/$4.80/1M
openai

GPT-5.4

OpenAI

GPT-5.4 is OpenAI's frontier model featuring a 1.05M context window and Extreme Reasoning. It excels at autonomous UI interaction and long-form data analysis.

1M context
$2.50/$15.00/1M
moonshot

Kimi K2 Thinking

Moonshot

Kimi K2 Thinking is Moonshot AI's trillion-parameter reasoning model. It outperforms GPT-5 on HLE and supports 300 sequential tool calls autonomously for...

256K context
$0.60/$2.50/1M
openai

GPT-5.2

OpenAI

GPT-5.2 is OpenAI's flagship model for professional tasks, featuring a 400K context window, elite coding, and deep multi-step reasoning capabilities.

400K context
$1.75/$14.00/1M
alibaba

Qwen3.6-Max-Preview

alibaba

Qwen3.6-Max-Preview is Alibaba's flagship MoE model featuring 1M context, a native thinking mode, and SOTA scores in agentic coding and reasoning.

1M context
$1.25/$10.00/1M

Frequently Asked Questions

Find answers to common questions about Gemini 3.5 Flash