
GPT-5.4
GPT-5.4 是 OpenAI 的 frontier model,拥有 1.05M context window 和极致推理能力。它在自主 UI 交互和长篇数据分析方面表现卓越。
关于 GPT-5.4
了解 GPT-5.4 的功能、特性以及它如何帮助您获得更好的效果。
长 Context 推理的前沿
GPT-5.4 代表了 GPT-5 系列的高性能演进,其特点是拥有行业领先的 105 万 token context window。该 model 专为处理海量数据集(如大型代码库或多年历史日志)而设计,同时不丧失执行高保真 reasoning 的能力。一个突出的功能是交互式的“Mid-Response Steering”,它允许用户实时视觉化监控并调整 model 的思考计划,确保输出完全符合复杂的多步骤意图。
统一智能与自主行动
从技术上讲,GPT-5.4 统一了之前 Codex 特定分支的世界级编程能力与标准 GPT-5 系列的创意细微差别。它具有专门的 “Thinking”模式,具备可调节的强度等级(Standard、Extended 和 Heavy),利用强化 chain-of-thought 处理来解决博士级别的科学和逻辑问题。除了文本,GPT-5.4 还引入了原生 computer use 能力,通过解释高保真视觉屏幕截图并执行基于坐标的点击,在 OSWorld-Verified 任务中获得了 75% 的评分。
效率与可靠性
OpenAI 报告称,与其前代产品相比,claim 级错误显著减少了 33%,这使得 GPT-5.4 成为自主 agentic 和高风险决策支持的首选。尽管功能强大,但它在设计上注重 token 和能源效率,与之前的迭代相比,能够以更低的成本进行长 context 处理。无论是管理整个企业代码库还是充当自主调度代理,GPT-5.4 都为生成式 AI 领域的可靠性和 agentic 性能树立了新标准。

GPT-5.4 的使用案例
发现使用 GPT-5.4 获得出色效果的不同方式。
大型代码库重构
同时提取和分析数百个源文件,以确保跨模块的一致性,并在整个代码库中识别深度语义 bug。
自主 Agentic 调度
通过视觉定位与电子邮件和日历交互,自主协调复杂的活动安排并发送后续沟通。
高保真建筑设计
使用超过 1,000 行精准、可用于模拟的代码,生成复杂的 3D 场景和结构规划(如功能性地铁站)。
长期科学规划
利用极致的 reasoning 能力解决博士级科学问题,并执行需要数小时持续状态管理的步骤分析。
网络安全事件调查
在单个 1.05M context 会话中处理海量原始日志数据,以自主识别、调查并报告安全违规行为。
交互式 Mid-Response Steering
在内部“思考”阶段纠正 model 的方向,调整架构选择或逻辑路径,而无需重新启动 prompt。
优势
局限性
API快速入门
openai/gpt-5.4
import OpenAI from 'openai';
const openai = new OpenAI();
async function main() {
const completion = await openai.chat.completions.create({
model: "gpt-5.4",
messages: [{ role: "user", content: "Analyze this 1.05M token log file for security threats." }],
reasoning_effort: "heavy",
stream: true,
});
for await (const chunk of completion) {
process.stdout.write(chunk.choices[0]?.delta?.content || '');
}
}
main();安装SDK并在几分钟内开始进行API调用。
人们对 GPT-5.4 的评价
看看社区对 GPT-5.4 的看法
“GPT-5 正在强势回归... 它生成的每一行代码都完全可以运行。”
“核心功能显然是 1M context window,而其他 model 仅支持约 200k。”
“哇,GPT-5.4 强得离谱。它本该是一个跨代升级的 6.0 版本。很难相信 Codex 已经发展到这种地步。”
“GPT-5.4 在 NYT Connections 挑战中获得了 94.0 的超高分。它总能第一次就做对。”
“GPT-5.4 现已列入人工智能分析指数... 与 Gemini 3.1 Pro 并列第一。”
“reasoning 深度终于达到了可以处理企业级架构问题的水平。”
关于 GPT-5.4 的视频
观看关于 GPT-5.4 的教程、评测和讨论
“105 万 token 的 context window... 这是一个非常大的窗口。”
“在经过 5 分 22 秒的思考后,我们得到了结果... 它确实以一种更具 agentic 的方式进行了测试。”
“更新了观察高保真图像的能力... 总像素高达 1024 万。”
“该 model 实际上会在网络上进行研究以验证自己的逻辑。”
“对于需要持久状态的 agentic 工作流来说,这是一个巨大的飞跃。”
“GPT-5.4 拥有一切... 基本上可以看作是 GPT-5.2 和 GPT-5.3 Codex 的完美结合体。”
“它的代码编写能力强得离谱。简直无懈可击。”
“在前端审美方面,它仍落后于 Opus 4.6 和 Gemini 3.1 Pro。”
“感觉它对开发者细微意图的理解要好得多。”
“考虑到 100 万 token 的窗口大小,这个价位非常有竞争力。”
“这显然给 OpenAI 带来了压力,促使其推出一款能够匹配 100 万 context 能力的 model。”
“在 single shot 下,该 model 能够创建一个 Minecraft 克隆版,这太了不起了。”
“我们看到事实性幻觉率降低了 33%。”
“reasoning 模式分为 Standard、Extended 和 Heavy 三个级别。”
“在 OSWorld benchmark 上的视觉定位目前处于行业领先地位。”
GPT-5.4专业提示
专家提示助您充分利用GPT-5.4。
切换 Reasoning 强度
根据任务的复杂程度,使用 Standard、Extended 或 Heavy 的 reasoning 强度,以平衡计算成本和输出质量。
监控预设规划
使用 Thinking 变体时,请关注其预设规划;如果 model 提议的逻辑路径看起来有误,你可以在生成中途进行干预。
策略性 Prompt Caching
将大型静态 context 块放置在 prompt 的开头,以利用 OpenAI 的自动 prompt caching 来节省成本。
管理 Context 稳定性
虽然 1.05M 窗口非常强大,但据报道性能在最初的 256K token 内最为稳定;请将关键摘要放在 prompt 末尾附近。
用户评价
用户怎么说
加入数千名已改变工作流程的满意用户
Jonathan Kogan
Co-Founder/CEO, rpatools.io
Automatio is one of the most used for RPA Tools both internally and externally. It saves us countless hours of work and we realized this could do the same for other startups and so we choose Automatio for most of our automation needs.
Mohammed Ibrahim
CEO, qannas.pro
I have used many tools over the past 5 years, Automatio is the Jack of All trades.. !! it could be your scraping bot in the morning and then it becomes your VA by the noon and in the evening it does your automations.. its amazing!
Ben Bressington
CTO, AiChatSolutions
Automatio is fantastic and simple to use to extract data from any website. This allowed me to replace a developer and do tasks myself as they only take a few minutes to setup and forget about it. Automatio is a game changer!
Sarah Chen
Head of Growth, ScaleUp Labs
We've tried dozens of automation tools, but Automatio stands out for its flexibility and ease of use. Our team productivity increased by 40% within the first month of adoption.
David Park
Founder, DataDriven.io
The AI-powered features in Automatio are incredible. It understands context and adapts to changes in websites automatically. No more broken scrapers!
Emily Rodriguez
Marketing Director, GrowthMetrics
Automatio transformed our lead generation process. What used to take our team days now happens automatically in minutes. The ROI is incredible.
Jonathan Kogan
Co-Founder/CEO, rpatools.io
Automatio is one of the most used for RPA Tools both internally and externally. It saves us countless hours of work and we realized this could do the same for other startups and so we choose Automatio for most of our automation needs.
Mohammed Ibrahim
CEO, qannas.pro
I have used many tools over the past 5 years, Automatio is the Jack of All trades.. !! it could be your scraping bot in the morning and then it becomes your VA by the noon and in the evening it does your automations.. its amazing!
Ben Bressington
CTO, AiChatSolutions
Automatio is fantastic and simple to use to extract data from any website. This allowed me to replace a developer and do tasks myself as they only take a few minutes to setup and forget about it. Automatio is a game changer!
Sarah Chen
Head of Growth, ScaleUp Labs
We've tried dozens of automation tools, but Automatio stands out for its flexibility and ease of use. Our team productivity increased by 40% within the first month of adoption.
David Park
Founder, DataDriven.io
The AI-powered features in Automatio are incredible. It understands context and adapts to changes in websites automatically. No more broken scrapers!
Emily Rodriguez
Marketing Director, GrowthMetrics
Automatio transformed our lead generation process. What used to take our team days now happens automatically in minutes. The ROI is incredible.
相关 AI Models
Grok-3
xAI
Grok-3 is xAI's flagship reasoning model, featuring deep logic deduction, a 128k context window, and real-time integration with X for live research and coding.
Claude 3.7 Sonnet
Anthropic
Claude 3.7 Sonnet is Anthropic's first hybrid reasoning model, delivering state-of-the-art coding capabilities, a 200k context window, and visible thinking.
Claude Sonnet 4.5
Anthropic
Anthropic's Claude Sonnet 4.5 delivers world-leading coding (77.2% SWE-bench) and a 200K context window, optimized for the next generation of autonomous agents.
GLM-4.7
Zhipu (GLM)
GLM-4.7 by Zhipu AI is a flagship 358B MoE model featuring a 200K context window, elite 73.8% SWE-bench performance, and native Deep Thinking for agentic...
Gemini 3.1 Flash-Lite
Gemini 3.1 Flash-Lite is Google's fastest, most cost-efficient model. Features 1M context, native multimodality, and 363 tokens/sec speed for scale.
Claude Opus 4.5
Anthropic
Claude Opus 4.5 is Anthropic's most powerful frontier model, delivering record-breaking 80.9% SWE-bench performance and advanced autonomous agency for coding.
GPT-5.3 Codex
OpenAI
GPT-5.3 Codex is OpenAI's 2026 frontier coding agent, featuring a 400K context window, 77.3% Terminal-Bench score, and superior logic for complex software...
Grok-4
xAI
Grok-4 by xAI is a frontier model featuring a 2M token context window, real-time X platform integration, and world-record reasoning capabilities.
关于GPT-5.4的常见问题
查找关于GPT-5.4的常见问题答案