What is the context window for GLM-4.7?

GLM-4.7 features a massive 200,000-token context window with a maximum output limit of 131,072 tokens.

How much does the GLM-4.7 API cost?

The API is priced at $0.60 per 1 million input tokens and $2.20 per 1 million output tokens, making it significantly more affordable than many Western rivals.

Is GLM-4.7 open source?

Yes, GLM-4.7 is an open-weight model released under the MIT license, allowing for both commercial and personal local hosting.

Does GLM-4.7 support a reasoning or thinking mode?

Yes, it includes a native 'Deep Thinking' architecture that interleaves reasoning before every tool call and response.

How does GLM-4.7 compare to Claude 3.5 Sonnet?

It matches or exceeds Claude 3.5 Sonnet in specific coding benchmarks like LiveCodeBench and SWE-bench while being open-weight.

What hardware is required to run GLM-4.7 locally?

The full model requires roughly 800GB of VRAM in BF16, but quantized versions (4-bit) can fit into 140GB to 200GB VRAM setups.

Does the model support vision capabilities?

Yes, GLM-4.7 is a multimodal model that can analyze images and generate sophisticated UI designs based on visual context.

When was GLM-4.7 officially released?

The model was officially released by Zhipu AI on December 22, 2025.

GLM-4.7

GLM-4.7 by Zhipu AI is a flagship 358B MoE model featuring a 200K context window, elite 73.8% SWE-bench performance, and native Deep Thinking for agentic...

zhipuGLMDecember 22, 2025

Context

200Ktokens

Max Output

131Ktokens

Input Price

$0.60/ 1M

Output Price

$2.20/ 1M

Modality:TextImage

Capabilities:VisionToolsStreamingReasoning

Benchmarks

GPQA

85.7%

HLE

42.8%

MMLU

90.1%

MMLU Pro

84.3%

SimpleQA

46%

IFEval

88%

AIME 2025

95.7%

MATH

92%

GSM8k

98%

MGSM

94%

MathVista

74%

SWE-Bench

73.8%

HumanEval

94.2%

LiveCodeBench

84.9%

MMMU

74.2%

MMMU Pro

58%

ChartQA

86%

DocVQA

93%

Terminal-Bench

41%

ARC-AGI

12%

View API Documentation

About GLM-4.7

Learn about GLM-4.7's capabilities, features, and how it can help you achieve better results.

GLM-4.7 is the latest flagship AI model from Zhipu AI, representing a significant leap in open-weight intelligence. This massive 358-billion parameter Mixture-of-Experts (MoE) model is specifically engineered for advanced reasoning, coding automation, and complex agentic workflows. It introduces a dedicated Deep Thinking mode that enables multi-step planning and error recovery, allowing the model to solve high-stakes software engineering tasks with unprecedented reliability.

The model distinguishes itself through exceptional technical performance, achieving a state-of-the-art 73.8% score on SWE-bench Verified and an 84.9 on LiveCodeBench v6. With its 200,000-token context window and massive 131,072-token output capacity, GLM-4.7 is optimized for generating entire applications and conducting deep research across vast datasets.

As an open-weight release under the MIT license, it offers a powerful and flexible alternative to proprietary APIs, supporting both cloud-based integration and local hosting. Its multimodal capabilities extend to advanced UI design and document analysis, making it a versatile powerhouse for modern AI-driven development.

Use Cases for GLM-4.7

Discover the different ways you can use GLM-4.7 to achieve great results.

Agentic Software Engineering

Resolving complex GitHub issues and implementing full-stack features autonomously across entire repositories.

High-Fidelity Vibe Coding

Rapidly generating modern, production-ready web interfaces using Tailwind CSS and interactive Framer Motion components.

Multilingual Technical Support

Providing advanced coding assistance and logical problem solving across 10+ international programming environments.

Deep Academic Research

Analyzing massive document sets to extract multi-hop, verifiable information using the BrowseComp search framework.

Automated Presentation Design

Creating structured, visually balanced slides with accurate layouts and typography from single-sentence prompts.

Terminal-Based Automation

Executing complex system administration and DevOps tasks directly within a terminal sandbox with 41% benchmark accuracy.

Strengths

Limitations

Elite Coding Proficiency: Currently leads open-weight models with a 73.8% SWE-bench score, outperforming many proprietary competitors.

Extreme Hardware Intensity: The 355B parameter count makes local hosting prohibitive for individual developers without multi-GPU setups.

Massive Output Tokens: Features a 131K output limit, allowing for the generation of massive, production-ready codebases in a single turn.

API vs Web Disparity: There is a noticeable performance gap between the instant API responses and the deeper reasoning found in the web interface.

Native Reasoning Engine: Incorporates 'Deep Thinking' capabilities that allow for better planning and reduced drift in long-running agentic tasks.

Temporal Hallucinations: Users have reported occasional inaccuracies regarding current dates and events immediately following the model's launch.

Unbeatable Cost-to-Performance: Provides frontier-level intelligence at a fraction of the cost, starting at just $0.60 per million input tokens.

High Reasoning Latency: Enabling the full Deep Thinking mode can significantly increase the response time for complex, multi-step prompts.

API Quick Start

zai/glm-4.7

View Documentation

zhipu SDK

import { ZhipuAI } from "zhipuai";

const client = new ZhipuAI({ apiKey: "YOUR_API_KEY" });

async function main() {
  const response = await client.chat.completions.create({
    model: "glm-4.7",
    messages: [{ role: "user", content: "Build a real-time collaborative whiteboard using Next.js." }],
    stream: true,
    extra_body: { "thinking": true }
  });

  for await (const chunk of response) {
    process.stdout.write(chunk.choices[0].delta.content || "");
  }
}

main();

Install the SDK and start making API calls in minutes.

What People Are Saying About GLM-4.7

See what the community thinks about GLM-4.7

“GLM 4.7 CRUSHES OPEN SOURCE RECORDS! ... hit 42.8% on Humanity's Last Exam”

— MindColliers

x/twitter

“GLM-4.7... scores 73.8% on SWE-Bench at $0.6/M tokens... The AI race is becoming truly multipolar.”

— MateusGalasso

x/twitter

“GLM 4.7 brings clear gains... in multilingual agentic coding and terminal-based tasks”

— Dear-Success-1441

“This model is crushing it on many 2025 coding benchmarks”

— cloris_rust

“GLM 4.7 wins for speed and stability, while Minimax M2.1 dominates in multi-agent coding”

— JamMasterJulian

youtube

“Zhipu is really showing what open weights can do against the big labs in the US.”

— DevGuru

hackernews

Videos About GLM-4.7

Watch tutorials, reviews, and discussions about GLM-4.7

“GLM 4.7 is a model that delivers major improvements in code quality, complex reasoning, and tool usage”

“Scored a 73.8 percentage on SWE-bench verified, which is absolutely incredible for an open-source model”

“It even surpasses Claude Sonnet 4.5 and GPT 5.1 in tool use benchmarks”

“The mixture of experts approach here is very refined, leading to higher efficiency despite the size”

“It is essentially the first open-weight model to provide a viable alternative to Claude 3.5 for heavy coding”

“It is the best open model yet by a long shot”

“It produces cleaner, more modern web pages and generates better looking slides”

“Reasons, but the thinking traces are not available in the coding plan API”

“Vibe coding results are near-perfect, even with complex Tailwind animations”

“The 200k context handles long repos with very little needle loss compared to previous GLM versions”

“Important upgrade is thinking before acting, which helps the model handle complex tasks reliably”

“Highlight vibe coding, where GLM 4.7 improves UI quality”

“API pricing to be around the same $3, making it a very cost-effective option”

“The multimodal performance allows it to convert Figma designs to code with high accuracy”

“Local deployment is possible if you have a massive workstation, but the API is remarkably fast”

More than just prompts

Supercharge your workflow with AI Automation

Automatio combines the power of AI agents, web automation, and smart integrations to help you accomplish more in less time.

AI Agents

Web Automation

Smart Workflows

Get Started Free

Pro Tips for GLM-4.7

Expert tips to help you get the most out of GLM-4.7 and achieve better results.

Enable Deep Thinking

For complex logical tasks, explicitly trigger the thinking mode via API parameters to enable multi-step planning.

Leverage Preserved Thinking

Maintain long conversation histories to utilize the model's ability to retain reasoning traces across multiple turns.

Local Quantization

Use Unsloth-optimized 2-bit or 4-bit GGUF versions to run this high-parameter model on consumer-grade hardware.

Date Injection

Manually include the current date in the system prompt to avoid temporal hallucinations and improve scheduling accuracy.

Testimonials

What Our Users Say

Join thousands of satisfied users who have transformed their workflow

Jonathan Kogan

Co-Founder/CEO, rpatools.io

Automatio is one of the most used for RPA Tools both internally and externally. It saves us countless hours of work and we realized this could do the same for other startups and so we choose Automatio for most of our automation needs.

Mohammed Ibrahim

CEO, qannas.pro

I have used many tools over the past 5 years, Automatio is the Jack of All trades.. !! it could be your scraping bot in the morning and then it becomes your VA by the noon and in the evening it does your automations.. its amazing!

Ben Bressington

CTO, AiChatSolutions

Automatio is fantastic and simple to use to extract data from any website. This allowed me to replace a developer and do tasks myself as they only take a few minutes to setup and forget about it. Automatio is a game changer!

Sarah Chen

Head of Growth, ScaleUp Labs

We've tried dozens of automation tools, but Automatio stands out for its flexibility and ease of use. Our team productivity increased by 40% within the first month of adoption.

David Park

Founder, DataDriven.io

The AI-powered features in Automatio are incredible. It understands context and adapts to changes in websites automatically. No more broken scrapers!

Emily Rodriguez

Marketing Director, GrowthMetrics

Automatio transformed our lead generation process. What used to take our team days now happens automatically in minutes. The ROI is incredible.

Jonathan Kogan

Co-Founder/CEO, rpatools.io

Mohammed Ibrahim

CEO, qannas.pro

Ben Bressington

CTO, AiChatSolutions

Sarah Chen

Head of Growth, ScaleUp Labs

We've tried dozens of automation tools, but Automatio stands out for its flexibility and ease of use. Our team productivity increased by 40% within the first month of adoption.

David Park

Founder, DataDriven.io

The AI-powered features in Automatio are incredible. It understands context and adapts to changes in websites automatically. No more broken scrapers!

Emily Rodriguez

Marketing Director, GrowthMetrics

Automatio transformed our lead generation process. What used to take our team days now happens automatically in minutes. The ROI is incredible.

Related AI Models

Claude 3.7 Sonnet

Anthropic

Claude 3.7 Sonnet is Anthropic's first hybrid reasoning model, delivering state-of-the-art coding capabilities, a 200k context window, and visible thinking.

200K context

$3.00/$15.00/1M

Grok-3

xAI

Grok-3 is xAI's flagship reasoning model, featuring deep logic deduction, a 128k context window, and real-time integration with X for live research and coding.

128K context

$3.00/$15.00/1M

Gemini 3.1 Flash-Lite

Google

Gemini 3.1 Flash-Lite is Google's fastest, most cost-efficient model. Features 1M context, native multimodality, and 363 tokens/sec speed for scale.

1M context

$0.25/$1.50/1M

Claude Opus 4.5

Anthropic

Claude Opus 4.5 is Anthropic's most powerful frontier model, delivering record-breaking 80.9% SWE-bench performance and advanced autonomous agency for coding.

200K context

$5.00/$25.00/1M

GPT-5.4

OpenAI

GPT-5.4 is OpenAI's frontier model featuring a 1.05M context window and Extreme Reasoning. It excels at autonomous UI interaction and long-form data analysis.

1M context

$2.50/$15.00/1M

Grok-4

xAI

Grok-4 by xAI is a frontier model featuring a 2M token context window, real-time X platform integration, and world-record reasoning capabilities.

2M context

$3.00/$15.00/1M

Kimi K2.5

Moonshot

Discover Moonshot AI's Kimi K2.5, a 1T-parameter open-source agentic model featuring native multimodal capabilities, a 262K context window, and SOTA reasoning.

262K context

$0.60/$2.50/1M

GPT-5.1

OpenAI

GPT-5.1 is OpenAI’s advanced reasoning flagship featuring adaptive thinking, native multimodality, and state-of-the-art performance in math and technical...

400K context

$1.25/$10.00/1M

Frequently Asked Questions About GLM-4.7

Find answers to common questions about GLM-4.7