What is the context window size of Qwen3.6-Max-Preview?

The model supports a context window of up to 1,000,000 tokens. This capacity is enabled by Gated Delta Networks, allowing it to process extremely long documents and codebases efficiently.

How much does the API cost?

The API is priced at $1.25 per 1 million input tokens and $10.00 per 1 million output tokens. This tiered pricing is handled through Alibaba Cloud Model Studio.

Does this model support function calling?

Yes, Qwen3.6-Max-Preview has native support for function calling and tool use. It is highly optimized for following complex formatting requirements in tool-use benchmarks.

Is Qwen3.6-Max-Preview open source?

No, the Max-Preview version is a proprietary hosted model available via Alibaba Cloud. While other Qwen 3.6 models are open-weight, this flagship version is closed-access.

What is Thinking Mode?

Thinking Mode is a native capability where the model processes a chain of reasoning internally before outputting its answer. This thought process is visible to users within specific tags.

How does it compare to Claude 4.5 Opus in coding?

Qwen3.6-Max-Preview currently outperforms Claude 4.5 Opus on SWE-bench Pro, scoring 57.3% compared to Claude's 57.1%. This makes it a world leader in agentic coding tasks.

Can the model process video input?

Yes, the model is multimodal and can analyze video content for perception and reasoning tasks. It has shown significant gains in video-based benchmarks over previous versions.

Qwen3.6-Max-Preview

Qwen3.6-Max-Preview is Alibaba's flagship MoE model featuring 1M context, a native thinking mode, and SOTA scores in agentic coding and reasoning.

MoEAgentic Coding1M ContextFrontier ModelAlibaba Qwen

alibabaQwen 3.6April 20, 2026

Context

1.0Mtokens

Max Output

8Ktokens

Input Price

$1.25/ 1M

Output Price

$10.00/ 1M

Modality:TextImageVideo

Capabilities:VisionToolsStreamingReasoning

Benchmarks

GPQA

86%

HLE

51%

MMLU

83%

MMLU Pro

79%

SimpleQA

52%

IFEval

75%

AIME 2025

93%

MATH

95%

GSM8k

98%

MGSM

92%

MathVista

86%

SWE-Bench

73%

HumanEval

91%

LiveCodeBench

79%

MMMU

82%

MMMU Pro

75%

ChartQA

85%

DocVQA

89%

Terminal-Bench

65%

ARC-AGI

14%

View API Documentation

About Qwen3.6-Max-Preview

Learn about Qwen3.6-Max-Preview's capabilities, features, and how it can help you achieve better results.

Qwen3.6-Max-Preview is the flagship proprietary large language model from Alibaba, representing the next step in their high-performance AI series. Utilizing a sparse Mixture-of-Experts (MoE) architecture, the model achieves the reasoning depth of a trillion-parameter system while maintaining significant operational efficiency. It is specifically optimized for agentic coding, world knowledge, and complex instruction following.

The model's standout feature is its native Thinking Mode, which allows the system to generate a visible internal chain-of-thought before delivering a final response. This transparency is particularly valuable for developers building autonomous agents, as it provides a clear window into logical planning and error-correction steps. Combined with a massive 1-million-token context window, the model can ingest entire project repositories or extensive documentation libraries in a single pass.

Hosted on Alibaba Cloud Model Studio, Qwen3.6-Max-Preview supports industry-standard protocols and is compatible with OpenAI-style API specifications. It is designed to be the primary choice for enterprises requiring frontier-level AI capabilities for multimodal data analysis and robust agentic workflows, offering a high-performance alternative to Western closed-source models.

Use Cases

Discover the different ways you can use Qwen3.6-Max-Preview to achieve great results.

Autonomous Software Engineering

Deploy the model as a coding agent capable of navigating entire codebases, planning architectural changes, and fixing bugs across multiple files.

Large-Scale Technical Analysis

Utilize the 1M token context window to ingest complete documentation sets or legal frameworks for deep-dive analysis without RAG limitations.

Complex Reasoning and Planning

Leverage the native Thinking Mode to solve high-level mathematical problems where a multi-step internal plan is required for accuracy.

Multimodal Content Understanding

Analyze both static imagery and complex video sequences to extract data and summarize dynamic visual events.

Interactive Terminal Operations

Build tools that allow the AI to interact directly with shells and CLI environments, benefiting from its optimized Terminal-Bench performance.

Enterprise Agentic Workflows

Integrate the model into complex business pipelines where high instructional reliability and sophisticated tool-calling are required for automation.

Strengths

Limitations

World-Leading Coding Ability: Achieves a 57.3% score on SWE-bench Pro, surpassing major frontier models like Claude 4.5 Opus for autonomous software tasks.

Closed Source Restriction: Unlike the Medium versions of Qwen 3.6, the Max-Preview is proprietary and cannot be self-hosted on local hardware.

Enormous 1M Token Context: Handles massive datasets and full technical libraries within a single prompt without the typical context degradation of older architectures.

High Output Token Premium: The $10.00/1M output pricing is an 8x markup over the input price, making long reasoning chains more expensive than ingestion.

Transparent Native Reasoning: The built-in Thinking Mode exposes internal logic, allowing for higher reliability in complex problem-solving and easier debugging.

Knowledge Cutoff Constraints: As a static preview model, it lacks real-time awareness of events or library updates beyond its early 2026 training cutoff.

Aggressive Price Positioning: At $1.25 per million input tokens, it offers frontier-level performance at a fraction of the cost of Western proprietary equivalents.

Regional API Latency: Depending on the deployment region, international users may face higher latency compared to highly optimized local variants.

API Quick Start

alibaba/qwen3.6-max-preview

View Documentation

alibaba SDK

import OpenAI from 'openai';

const client = new OpenAI({
  apiKey: process.env.DASHSCOPE_API_KEY,
  base_url: 'https://dashscope-intl.aliyuncs.com/compatible-mode/v1',
});

async function main() {
  const completion = await client.chat.completions.create({
    model: 'qwen3.6-max-preview',
    messages: [{ role: 'user', content: 'Design a system architecture for a real-time AI agent.' }],
    extra_body: { enable_thinking: true },
    stream: true
  });

  for await (const chunk of completion) {
    process.stdout.write(chunk.choices[0]?.delta?.content || '');
  }
}

main();

Install the SDK and start making API calls in minutes.

Community Feedback

See what the community thinks about Qwen3.6-Max-Preview

“The kind of performance you'd expect from a model running on a massive server farm is now sitting on your desktop.”

— softtechhubus

“Qwen3.6-Max-Preview just beat Claude Opus 4.5 on SWE-Bench Pro. China is catching up fast.”

— BridgeMind

twitter

“At $1.25 per million tokens, Qwen is significantly cheaper than Claude for large scale data ingestion.”

— TechReviewer2026

“The fact that Thinking Mode is baked in as the default state is a meaningful design choice for agentic reliability.”

— DevGuru

twitter

“Qwen has launched Qwen 3.6 Max Preview as a new top-end proprietary flagship model.”

— AICodeKing

youtube

“It shows improved agentic coding and better real-world agent reliability compared to the Plus model.”

— Codedigipt

youtube

Supercharge your workflow with AI Automation

Automatio combines the power of AI agents, web automation, and smart integrations to help you accomplish more in less time.

AI Agents

Web Automation

Smart Workflows

Get Started Free

Pro Tips

Expert tips to help you get the most out of Qwen3.6-Max-Preview and achieve better results.

Enable Internal Reasoning

Set the 'enable_thinking' parameter to true in your API request to view the model's internal logic for debugging complex reasoning.

Preserve Long-Horizon Logic

Use the 'preserve_thinking' feature for multi-turn conversations to ensure the model maintains logical consistency across a session.

Feed Entire Libraries

Take advantage of the 1M context window by providing full source materials instead of chunked data for better cross-file understanding.

Use Compatible Endpoints

For global applications, use the Singapore or US Virginia endpoints in Alibaba Cloud to minimize regional latency for international users.

Testimonials

What Our Users Say

Join thousands of satisfied users who have transformed their workflow

Jonathan Kogan

Co-Founder/CEO, rpatools.io

Automatio is one of the most used for RPA Tools both internally and externally. It saves us countless hours of work and we realized this could do the same for other startups and so we choose Automatio for most of our automation needs.

Mohammed Ibrahim

CEO, qannas.pro

I have used many tools over the past 5 years, Automatio is the Jack of All trades.. !! it could be your scraping bot in the morning and then it becomes your VA by the noon and in the evening it does your automations.. its amazing!

Ben Bressington

CTO, AiChatSolutions

Automatio is fantastic and simple to use to extract data from any website. This allowed me to replace a developer and do tasks myself as they only take a few minutes to setup and forget about it. Automatio is a game changer!

Sarah Chen

Head of Growth, ScaleUp Labs

We've tried dozens of automation tools, but Automatio stands out for its flexibility and ease of use. Our team productivity increased by 40% within the first month of adoption.

David Park

Founder, DataDriven.io

The AI-powered features in Automatio are incredible. It understands context and adapts to changes in websites automatically. No more broken scrapers!

Emily Rodriguez

Marketing Director, GrowthMetrics

Automatio transformed our lead generation process. What used to take our team days now happens automatically in minutes. The ROI is incredible.

Jonathan Kogan

Co-Founder/CEO, rpatools.io

Mohammed Ibrahim

CEO, qannas.pro

Ben Bressington

CTO, AiChatSolutions

Sarah Chen

Head of Growth, ScaleUp Labs

We've tried dozens of automation tools, but Automatio stands out for its flexibility and ease of use. Our team productivity increased by 40% within the first month of adoption.

David Park

Founder, DataDriven.io

The AI-powered features in Automatio are incredible. It understands context and adapts to changes in websites automatically. No more broken scrapers!

Emily Rodriguez

Marketing Director, GrowthMetrics

Automatio transformed our lead generation process. What used to take our team days now happens automatically in minutes. The ROI is incredible.

Related AI Models

GLM-5

Zhipu (GLM)

GLM-5 is Zhipu AI's 744B parameter open-weight powerhouse, excelling in long-horizon agentic tasks, coding, and factual accuracy with a 200k context window.

200K context

$1.00/$3.20/1M

GLM-5.1

Zhipu (GLM)

GLM-5.1 is Zhipu AI's flagship reasoning model, featuring a 202K context window and an autonomous 8-hour execution loop for complex agentic engineering.

203K context

$1.40/$4.40/1M

GPT-5.2

OpenAI

GPT-5.2 is OpenAI's flagship model for professional tasks, featuring a 400K context window, elite coding, and deep multi-step reasoning capabilities.

400K context

$1.75/$14.00/1M

Gemini 3.1 Flash-Lite

Google

Gemini 3.1 Flash-Lite is Google's fastest, most cost-efficient model. Features 1M context, native multimodality, and 363 tokens/sec speed for scale.

1M context

$0.25/$1.50/1M

Claude Opus 4.5

Anthropic

Claude Opus 4.5 is Anthropic's most powerful frontier model, delivering record-breaking 80.9% SWE-bench performance and advanced autonomous agency for coding.

200K context

$5.00/$25.00/1M

Grok-4

xAI

Grok-4 by xAI is a frontier model featuring a 2M token context window, real-time X platform integration, and world-record reasoning capabilities.

2M context

$3.00/$15.00/1M

Kimi K2 Thinking

Moonshot

Kimi K2 Thinking is Moonshot AI's trillion-parameter reasoning model. It outperforms GPT-5 on HLE and supports 300 sequential tool calls autonomously for...

256K context

$0.60/$2.50/1M

Kimi K2.5

Moonshot

Discover Moonshot AI's Kimi K2.5, a 1T-parameter open-source agentic model featuring native multimodal capabilities, a 262K context window, and SOTA reasoning.

256K context

$0.60/$3.00/1M

Frequently Asked Questions

Find answers to common questions about Qwen3.6-Max-Preview

Qwen3.6-Max-Preview

About Qwen3.6-Max-Preview

Use Cases

Autonomous Software Engineering

Large-Scale Technical Analysis

Complex Reasoning and Planning

Multimodal Content Understanding

Interactive Terminal Operations

Enterprise Agentic Workflows

Strengths

Limitations

API Quick Start

Community Feedback

Related Videos

Supercharge your workflow with AI Automation

Pro Tips

Enable Internal Reasoning

Preserve Long-Horizon Logic

Feed Entire Libraries

Use Compatible Endpoints

What Our Users Say

Related AI Models

GLM-5

GLM-5.1

GPT-5.2

Gemini 3.1 Flash-Lite

Claude Opus 4.5

Grok-4

Kimi K2 Thinking

Kimi K2.5

Frequently Asked Questions

What is the context window size of Qwen3.6-Max-Preview?

How much does the API cost?

Does this model support function calling?

Is Qwen3.6-Max-Preview open source?

What is Thinking Mode?

How does it compare to Claude 4.5 Opus in coding?

Can the model process video input?