What is the context window of GPT-5.4?

The model features a 1,050,000 token context window. This allows it to process massive datasets and long-form code projects without significant information loss.

How much does the GPT-5.4 API cost?

It costs $2.50 per 1 million input tokens and $15.00 per 1 million output tokens. This reflects the premium nature of its long-context reasoning capabilities.

What is Mid-Response Steering?

It is a capability that allows users to adjust the model reasoning path in real-time while it is still generating its response.

Does GPT-5.4 support computer use?

Yes, it can natively capture screenshots, identify UI elements, and execute keyboard or mouse commands to complete desktop tasks.

Is GPT-5.4 better for coding than Claude?

GPT-5.4 scores 57.7% on SWE-Bench Pro, positioning it as one of the most capable models for autonomous software engineering.

How do the reasoning modes differ?

Standard mode is fast for general tasks, while Heavy Thinking mode uses more compute for high-stakes logic and math problems.

Can I use GPT-5.4 in Excel?

Yes, OpenAI has released a GPT-5.4 add-in for Excel that supports dynamic AI-generated arrays and financial modeling.

What is the maximum output limit?

The model can generate up to 128,000 tokens in a single response, making it suitable for generating entire applications.

GPT-5.4

GPT-5.4 is OpenAI's frontier model featuring a 1.05M context window and Extreme Reasoning. It excels at autonomous UI interaction and long-form data analysis.

OpenAIGPT-51M ContextReasoningMultimodal

openaiGPT-5March 5, 2026

Context

1.1Mtokens

Max Output

128Ktokens

Input Price

$2.50/ 1M

Output Price

$15.00/ 1M

Modality:TextImageAudio

Capabilities:VisionToolsStreamingReasoning

Benchmarks

GPQA

84%

HLE

36.6%

MMLU

91.2%

MMLU Pro

83.1%

SimpleQA

58%

IFEval

95.2%

AIME 2025

88%

MATH

92.4%

GSM8k

98.6%

MGSM

98%

MathVista

78.4%

SWE-Bench

57.7%

HumanEval

94.5%

LiveCodeBench

78.3%

MMMU

84%

MMMU Pro

65.2%

ChartQA

92.5%

DocVQA

95%

Terminal-Bench

60%

ARC-AGI

14.2%

View API Documentation

About GPT-5.4

Learn about GPT-5.4's capabilities, features, and how it can help you achieve better results.

The Frontier of Long-Context Reasoning

GPT-5.4 represents the high-performance evolution of the GPT-5 series. It features an industry-leading 1.05-million-token context window. This model handles expansive datasets, such as massive code repositories or multi-year historical logs, without losing reasoning fidelity. The interactive Mid-Response Steering allows users to monitor and adjust the model thinking plan in real-time. This ensures the output aligns with complex, multi-step intents.

Unified Intelligence and Autonomous Action

Technically, GPT-5.4 unifies the world-class coding strengths of previous Codex branches with the creative nuances of the standard GPT-5 series. It features a specialized Thinking mode with adjustable effort levels. These include Standard, Extended, and Heavy modes. It utilizes reinforced chain-of-thought processing to solve PhD-level science and logic problems. Beyond text, GPT-5.4 introduces native computer use capabilities. It achieves a 75% score on OSWorld-Verified tasks by interpreting visual screenshots and executing coordinate-based clicks.

Efficiency and Reliability

OpenAI reports a 33% decrease in claim-level errors compared to predecessors. This makes GPT-5.4 a primary choice for autonomous agents and high-stakes decision support. It is engineered for token and energy efficiency. This allows for cheaper long-context processing than previous iterations. Whether managing an entire enterprise codebase or acting as an autonomous scheduling agent, GPT-5.4 sets a new standard for reliability and agentic performance.

Use Cases

Discover the different ways you can use GPT-5.4 to achieve great results.

Large-Scale Code Refactoring

Systematically rewriting legacy codebases exceeding 300,000 lines with strict adherence to architectural standards.

Autonomous Financial Modeling

Building complex three-statement models where the AI reconciles income statements, balance sheets, and cash flows.

Interactive System Design

Developing 3D simulations or physics-based games by steering the model logic path during the generation process.

Agentic Computer Use

Executing multi-step desktop tasks such as bulk data entry, email management, and software testing via native UI interaction.

Long-Context Legal Analysis

Cross-referencing hundreds of legal documents to identify inconsistencies or extract specific clauses with high recall accuracy.

PhD-Level Research Support

Solving complex mathematical proofs and scientific problems using Heavy Reasoning mode for verified logical chains.

Strengths

Limitations

Massive 1.05M Context: Provides industry-leading capacity for deep analysis of enormous codebases and document sets without context decay.

Reasoning Latency: Enabling Heavy Thinking mode can result in wait times of several minutes for complex logic or large code generations.

Interactive Thinking: Unique mid-response navigation allows users to steer reasoning paths, significantly reducing wasted generations and tokens.

Rate Limiting: During the initial rollout, users may encounter aggressive message limits or temporary account bugs as capacity scales.

Native Computer Use: High-accuracy UI interaction (75% on OSWorld) enables the model to work directly within desktop and browser environments.

Non-Linear Scaling: In some creative tasks, lighter reasoning modes have been found to outperform heavy modes in aesthetic detail.

Extreme Token Efficiency: Optimized architecture delivers 2026-frontier performance with lower latency and energy consumption than previous GPT-5 versions.

Context Rot at 1M: While the window is large, retrieval accuracy drops significantly when moving from 256K to 1M tokens.

API Quick Start

openai/gpt-5.4

View Documentation

openai SDK

import OpenAI from "openai";

const openai = new OpenAI();

async function main() {
  const completion = await openai.chat.completions.create({
    model: "gpt-5.4",
    messages: [
      { role: "user", content: "Refactor this controller for better error handling." }
    ],
    reasoning_effort: "heavy"
  });

  console.log(completion.choices[0].message.content);
}

main();

Install the SDK and start making API calls in minutes.

Community Feedback

See what the community thinks about GPT-5.4

“GPT 5.4 in Codex is a very huge improvement... I've actually seen it work for 150 minutes at once without losing context.”

— ArchMeta1868

“GPT 5.4's 3D design chops are unmatched. The way it handled transparency and physics in my ship simulator was spookily accurate.”

— AI_Creative_Daily

twitter

“The mid-response course correction is incredible. I can actually see where the model is going and fix it before it wastes tokens.”

— dev_guru_99

“It beat humans 83% of the time across 44 different jobs. Lawyer. Accountant. Financial analyst. Administrator.”

— Josh Kale

twitter

“OpenAI finally fixed the output bottleneck. 128k output tokens is a dream for developers building full-stack applications.”

— TheCodeChannel

youtube

“The computer use latency is still there, but the precision is high enough to handle complex SAP workflows which is wild.”

— enterprise_sysadmin

hackernews

Supercharge your workflow with AI Automation

Automatio combines the power of AI agents, web automation, and smart integrations to help you accomplish more in less time.

AI Agents

Web Automation

Smart Workflows

Get Started Free

Pro Tips

Expert tips to help you get the most out of GPT-5.4 and achieve better results.

Toggle Thinking Effort

Use the Standard, Extended, or Heavy parameters to balance the need for accuracy against generation speed and cost.

Review the Thinking Plan

Monitor the upfront plan provided by the model and use Mid-Response Steering to correct it if the logic deviates.

Leverage Deferred Tool Loading

For agentic workflows, use the deferred loading registry to reduce upfront token costs by up to 47%.

Use Completeness Contracts

Explicitly define what finished means in your prompt to make the model more persistent during long-running tasks.

Max Resolution Vision

Upload high-fidelity images up to 10.24M pixels for precise visual inspections of UI elements or technical diagrams.

Testimonials

What Our Users Say

Join thousands of satisfied users who have transformed their workflow

Jonathan Kogan

Co-Founder/CEO, rpatools.io

Automatio is one of the most used for RPA Tools both internally and externally. It saves us countless hours of work and we realized this could do the same for other startups and so we choose Automatio for most of our automation needs.

Mohammed Ibrahim

CEO, qannas.pro

I have used many tools over the past 5 years, Automatio is the Jack of All trades.. !! it could be your scraping bot in the morning and then it becomes your VA by the noon and in the evening it does your automations.. its amazing!

Ben Bressington

CTO, AiChatSolutions

Automatio is fantastic and simple to use to extract data from any website. This allowed me to replace a developer and do tasks myself as they only take a few minutes to setup and forget about it. Automatio is a game changer!

Sarah Chen

Head of Growth, ScaleUp Labs

We've tried dozens of automation tools, but Automatio stands out for its flexibility and ease of use. Our team productivity increased by 40% within the first month of adoption.

David Park

Founder, DataDriven.io

The AI-powered features in Automatio are incredible. It understands context and adapts to changes in websites automatically. No more broken scrapers!

Emily Rodriguez

Marketing Director, GrowthMetrics

Automatio transformed our lead generation process. What used to take our team days now happens automatically in minutes. The ROI is incredible.

Jonathan Kogan

Co-Founder/CEO, rpatools.io

Mohammed Ibrahim

CEO, qannas.pro

Ben Bressington

CTO, AiChatSolutions

Sarah Chen

Head of Growth, ScaleUp Labs

We've tried dozens of automation tools, but Automatio stands out for its flexibility and ease of use. Our team productivity increased by 40% within the first month of adoption.

David Park

Founder, DataDriven.io

The AI-powered features in Automatio are incredible. It understands context and adapts to changes in websites automatically. No more broken scrapers!

Emily Rodriguez

Marketing Director, GrowthMetrics

Automatio transformed our lead generation process. What used to take our team days now happens automatically in minutes. The ROI is incredible.

Related AI Models

Qwen3.5-Omni

alibaba

Qwen3.5-Omni is a natively omnimodal AI by Alibaba Cloud, offering seamless audio-visual reasoning, real-time voice chat, and 256k context for low-latency apps.

256K context

$0.40/$4.80/1M

Kimi K2 Thinking

Moonshot

Kimi K2 Thinking is Moonshot AI's trillion-parameter reasoning model. It outperforms GPT-5 on HLE and supports 300 sequential tool calls autonomously for...

256K context

$0.60/$2.50/1M

GPT-5.2

OpenAI

GPT-5.2 is OpenAI's flagship model for professional tasks, featuring a 400K context window, elite coding, and deep multi-step reasoning capabilities.

400K context

$1.75/$14.00/1M

Qwen3.6-Max-Preview

alibaba

Qwen3.6-Max-Preview is Alibaba's flagship MoE model featuring 1M context, a native thinking mode, and SOTA scores in agentic coding and reasoning.

1M context

$1.25/$10.00/1M

GLM-5

Zhipu (GLM)

GLM-5 is Zhipu AI's 744B parameter open-weight powerhouse, excelling in long-horizon agentic tasks, coding, and factual accuracy with a 200k context window.

200K context

$1.00/$3.20/1M

GLM-5.1

Zhipu (GLM)

GLM-5.1 is Zhipu AI's flagship reasoning model, featuring a 202K context window and an autonomous 8-hour execution loop for complex agentic engineering.

203K context

$1.40/$4.40/1M

Gemini 3.1 Flash-Lite

Google

Gemini 3.1 Flash-Lite is Google's fastest, most cost-efficient model. Features 1M context, native multimodality, and 363 tokens/sec speed for scale.

1M context

$0.25/$1.50/1M

GPT-5.3 Codex

OpenAI

GPT-5.3 Codex is OpenAI's 2026 frontier coding agent, featuring a 400K context window, 77.3% Terminal-Bench score, and superior logic for complex software...

400K context

$1.75/$14.00/1M

Frequently Asked Questions

Find answers to common questions about GPT-5.4

GPT-5.4

About GPT-5.4

The Frontier of Long-Context Reasoning

Unified Intelligence and Autonomous Action

Efficiency and Reliability

Use Cases

Large-Scale Code Refactoring

Autonomous Financial Modeling

Interactive System Design

Agentic Computer Use

Long-Context Legal Analysis

PhD-Level Research Support

Strengths

Limitations

API Quick Start

Community Feedback

Related Videos

Supercharge your workflow with AI Automation

Pro Tips

Toggle Thinking Effort

Review the Thinking Plan

Leverage Deferred Tool Loading

Use Completeness Contracts

Max Resolution Vision

What Our Users Say

Related AI Models

Qwen3.5-Omni

Kimi K2 Thinking

GPT-5.2

Qwen3.6-Max-Preview

GLM-5

GLM-5.1

Gemini 3.1 Flash-Lite

GPT-5.3 Codex

Frequently Asked Questions

What is the context window of GPT-5.4?

How much does the GPT-5.4 API cost?

What is Mid-Response Steering?

Does GPT-5.4 support computer use?

Is GPT-5.4 better for coding than Claude?

How do the reasoning modes differ?

Can I use GPT-5.4 in Excel?

What is the maximum output limit?