
GLM-5
GLM-5 是智谱 AI 推出的 744B 参数 open-weight 强力模型,在长时程 agentic 任务、编程和准确性方面表现优异,并拥有 200k context window。
关于 GLM-5
了解 GLM-5 的功能、特性以及它如何帮助您获得更好的效果。
GLM-5 是智谱 AI 的 flagship 基础模型,专为自主 agentic 工作流和复杂系统工程而设计。它利用了海量的 7440 亿参数 Mixture-of-Experts (MoE) 架构,在 inference 期间激活 400 亿参数,以平衡性能和速度。该模型是首个在软件工程任务中证明能与 proprietary frontier model 媲美的 open-weight 系统,在 SWE-bench Verified 上得分为 77.8%。
该模型在 10 万张华为昇腾芯片组成的国产集群上,使用 28.5 万亿 tokens 进行训练。它集成了多头潜在注意力 (MLA) 和 DeepSeek 稀疏注意力 (DSA) 等专门机制,以在 200,000 token context window 中保持逻辑一致性。这种技术栈使得 GLM-5 能够处理长时程规划和资源管理,而不会出现此类大型密集模型通常所伴随的高延迟。
智谱 AI 以 MIT 协议发布了 GLM-5,使企业用户能够将权重部署在本地,以进行敏感数据处理。其输入成本仅为每百万 tokens 1.00 美元,相比 Claude 4.5 等竞品拥有 6 倍的价格优势。该模型包含专门的 Thinking Mode,显著降低了幻觉率。

GLM-5 的使用案例
发现使用 GLM-5 获得出色效果的不同方式。
自主软件工程
利用其在 SWE-bench Verified 上 77.8% 的得分,解决复杂的 GitHub issue 并执行跨仓库的代码重构。
企业级工具编排
跨内部 API 执行多步骤的 agentic 工作流,以处理金融和法律行业的后台自动化任务。
长 context 仓库分析
利用 200,000 token 的窗口,单次输入即可导入并分析整个文档集或多文件代码库。
个人 AI 协作助手
驱动 OpenClaw 等 open-source agent,全天候高可靠地管理电子邮件、日历及后台任务。
本地化私有智能
根据 MIT 协议在本地部署该 open-weight 模型,确保企业敏感数据的完全隐私。
高性价比 Agent 扩展
以比 frontier model 低 6-8 倍的成本运行大规模 agentic 会话,且不牺牲 reasoning 深度。
优势
局限性
API快速入门
zai/glm-5
import OpenAI from "openai";
const client = new OpenAI({
apiKey: process.env.ZHIPU_API_KEY,
baseURL: "https://open.bigmodel.cn/api/paas/v4/",
});
const response = await client.chat.completions.create({
model: "glm-5",
messages: [{ role: "user", content: "Analyze this repo structure and refactor to GraphQL." }],
stream: true,
});
for await (const chunk of response) {
process.stdout.write(chunk.choices[0]?.delta?.content || "");
}安装SDK并在几分钟内开始进行API调用。
人们对 GLM-5 的评价
看看社区对 GLM-5 的看法
“GLM-5 是一款 open-source 的 744B 参数模型,在编程方面表现接近 Claude Opus 水平……但价格差异非常关键。”
“200,000 的 token 窗口改变了我的工作流:单次分析 20+ 个文件进行重构,或一次性评审复杂的 PR diff。”
“使用 GLM-5 后,我的 Claude API 调用费用从每月 ~$90 降到了 $15 以下,而且没感觉到质量有明显的下降。”
“它的幻觉率在 30% 左右,相比之下 Gemini 3 Pro 为 88%。”
“GLM-5 在我完成 4.7 测试前就发布了,在日常编程中,其 reasoning 的提升确实非常明显。”
“智谱 AI 以 MIT 协议发布此模型,这对本地 LLM 社区来说是一个巨大的举措。”
关于 GLM-5 的视频
观看关于 GLM-5 的教程、评测和讨论
“它与 5.2 编码模型和 Opus 4.5 不相上下。”
“这是我第一个成功运行超过一小时任务且没有问题的 open-weight 模型。”
“它的幻觉率在 30% 左右,相比之下 Gemini 3 Pro 为 88%。”
“其 reasoning 密度显著高于 GLM-4。”
“它基本上取代了我的内部编码任务中的 Claude 3.5 Sonnet。”
“他们字面上将参数数量翻了一番……一直增加到 744。”
“尽管规模大得多,但它的运行速度几乎与旧模型一样快,甚至更快。”
“自我纠正。不要居高临下。把它当作一个有效的问题来对待。”
“稀疏注意力机制为这样的大型模型保持了较低的内存使用量。”
“Open-weight 的可用性使其成为本地托管的新冠军。”
“他们创建了名为 Slime 的自有 RL 引擎。”
“200,000 的 context window 改变了企业级 AI 的定义。”
“它在 SWE-bench verified 上达到 77.8,击败了 76.2 的 Gemini 3 Pro。”
“智谱 AI 证明了国产硬件可以训练出世界级的模型。”
“这里的主要焦点是 agentic 工程,而不仅仅是简单的聊天。”
GLM-5专业提示
专家提示助您充分利用GLM-5。
激活 Agentic 模式
在 prompt 中定义多步骤计划,因为 GLM-5 针对自主工程任务进行了优化,而非简单的聊天回复。
本地硬件分配
确保拥有充足的 RAM/VRAM,或使用配备 MindSpore 框架的华为昇腾硬件,以获得最佳的 throughput。
实现回退链 (Fallback Chains)
配置 GLM-5 作为您的主要 reasoning 模型,并以 GLM-4.7-Flash 作为处理简单指令的低成本回退模型。
使用结构化输出
当给定明确的交付物 schema 要求时,GLM-5 在生成精确的 .docx 和 .xlsx 格式方面表现卓越。
用户评价
用户怎么说
加入数千名已改变工作流程的满意用户
Jonathan Kogan
Co-Founder/CEO, rpatools.io
Automatio is one of the most used for RPA Tools both internally and externally. It saves us countless hours of work and we realized this could do the same for other startups and so we choose Automatio for most of our automation needs.
Mohammed Ibrahim
CEO, qannas.pro
I have used many tools over the past 5 years, Automatio is the Jack of All trades.. !! it could be your scraping bot in the morning and then it becomes your VA by the noon and in the evening it does your automations.. its amazing!
Ben Bressington
CTO, AiChatSolutions
Automatio is fantastic and simple to use to extract data from any website. This allowed me to replace a developer and do tasks myself as they only take a few minutes to setup and forget about it. Automatio is a game changer!
Sarah Chen
Head of Growth, ScaleUp Labs
We've tried dozens of automation tools, but Automatio stands out for its flexibility and ease of use. Our team productivity increased by 40% within the first month of adoption.
David Park
Founder, DataDriven.io
The AI-powered features in Automatio are incredible. It understands context and adapts to changes in websites automatically. No more broken scrapers!
Emily Rodriguez
Marketing Director, GrowthMetrics
Automatio transformed our lead generation process. What used to take our team days now happens automatically in minutes. The ROI is incredible.
Jonathan Kogan
Co-Founder/CEO, rpatools.io
Automatio is one of the most used for RPA Tools both internally and externally. It saves us countless hours of work and we realized this could do the same for other startups and so we choose Automatio for most of our automation needs.
Mohammed Ibrahim
CEO, qannas.pro
I have used many tools over the past 5 years, Automatio is the Jack of All trades.. !! it could be your scraping bot in the morning and then it becomes your VA by the noon and in the evening it does your automations.. its amazing!
Ben Bressington
CTO, AiChatSolutions
Automatio is fantastic and simple to use to extract data from any website. This allowed me to replace a developer and do tasks myself as they only take a few minutes to setup and forget about it. Automatio is a game changer!
Sarah Chen
Head of Growth, ScaleUp Labs
We've tried dozens of automation tools, but Automatio stands out for its flexibility and ease of use. Our team productivity increased by 40% within the first month of adoption.
David Park
Founder, DataDriven.io
The AI-powered features in Automatio are incredible. It understands context and adapts to changes in websites automatically. No more broken scrapers!
Emily Rodriguez
Marketing Director, GrowthMetrics
Automatio transformed our lead generation process. What used to take our team days now happens automatically in minutes. The ROI is incredible.
相关 AI Models
Claude 3.7 Sonnet
Anthropic
Claude 3.7 Sonnet is Anthropic's first hybrid reasoning model, delivering state-of-the-art coding capabilities, a 200k context window, and visible thinking.
DeepSeek-V3.2-Speciale
DeepSeek
DeepSeek-V3.2-Speciale is a reasoning-first LLM featuring gold-medal math performance, DeepSeek Sparse Attention, and a 131K context window. Rivaling GPT-5...
Claude 4.5 Sonnet
Anthropic
Anthropic's Claude Sonnet 4.5 delivers world-leading coding (77.2% SWE-bench) and a 200K context window, optimized for the next generation of autonomous agents.
GPT-5.3 Codex
OpenAI
GPT-5.3 Codex is OpenAI's 2026 frontier coding agent, featuring a 400K context window, 77.3% Terminal-Bench score, and superior logic for complex software...
GPT-4o mini
OpenAI
OpenAI's most cost-efficient small model, GPT-4o mini offers multimodal intelligence and high-speed performance at a significantly lower price point.
GPT-5.4
OpenAI
GPT-5.4 is OpenAI's frontier model featuring a 1.05M context window and Extreme Reasoning. It excels at autonomous UI interaction and long-form data analysis.
Gemini 3.1 Flash-Lite
Gemini 3.1 Flash-Lite is Google's fastest, most cost-efficient model. Features 1M context, native multimodality, and 363 tokens/sec speed for scale.
GPT-5.3 Instant
OpenAI
Explore GPT-5.3 Instant, OpenAI's "Anti-Cringe" model. Features a 128K context window, 26.8% fewer hallucinations, and a natural, helpful tone for everyday...
关于GLM-5的常见问题
查找关于GLM-5的常见问题答案