What is the pricing for MiMo V2.5 Pro?

Pricing is $1.00 per 1M input tokens and $3.00 per 1M output tokens on the official platform. This structure is significantly more affordable than closed models like GPT-5.4 for generation tasks.

How do I access the MiMo V2.5 Pro API?

Access is provided via the Xiaomi MiMo API Open Platform with headers compatible with the OpenAI SDK. It is also available through third-party aggregators such as OpenRouter.

What is the context window size?

The model supports a context window of 1,048,576 tokens. This allows users to input entire libraries or several hours of video content in a single prompt.

How does it compare to closed frontier models?

It matches or exceeds models like Claude Opus 4.6 on SWE-Bench Verified and agentic tasks. It achieved these results while consuming approximately 40% fewer tokens per trajectory.

Does it support native multimodal input?

Yes, it is a native omnimodal agent that accepts text, image, audio, and video inputs. It reasons across these data types directly without requiring external preprocessing.

What is the model's license?

MiMo V2.5 Pro is released under the MIT License. This allows for unrestricted commercial use, modification, and redistribution.

Can I use function calling?

Yes, it supports dependable function calling and is optimized for multi-turn tool use. It maintains coherence over sequences exceeding 1,000 tool calls in software environments.

What hardware is needed for local hosting?

Local hosting requires enterprise-grade hardware due to the 1.02T parameter count. Developers should use multi-GPU clusters and FP8 precision weights for efficiency.

MiMo V2.5 Pro

MiMo V2.5 Pro is Xiaomi's open-source 1.02T parameter MoE model featuring a 1M context window, native multimodality, and elite agentic coding performance.

Open SourceAgentic AIMultimodal1M ContextXiaomi

otherMiMoApril 27, 2026

Context

1.0Mtokens

Max Output

131Ktokens

Input Price

$1.00/ 1M

Output Price

$3.00/ 1M

Modality:TextImageAudioVideo

Capabilities:VisionToolsStreamingReasoning

Benchmarks

GPQA

54%

HLE

48%

MMLU

86.7%

MMLU Pro

84.9%

SimpleQA

45%

IFEval

88%

AIME 2025

41%

MATH

75%

GSM8k

95.5%

MGSM

92%

MathVista

65%

SWE-Bench

78.9%

HumanEval

90%

LiveCodeBench

80.6%

MMMU

73%

MMMU Pro

52%

ChartQA

89%

DocVQA

93.5%

Terminal-Bench

68.4%

ARC-AGI

View API Documentation

About MiMo V2.5 Pro

Learn about MiMo V2.5 Pro's capabilities, features, and how it can help you achieve better results.

MiMo V2.5 Pro is Xiaomi's flagship open-source model. It uses a 1.02 trillion parameter Mixture-of-Experts architecture where 42 billion parameters are active during inference. The hybrid-attention design mixes Local Sliding Window Attention and Global Attention at a 6:1 ratio. This specific configuration reduces KV-cache storage requirements by nearly 7x compared to standard transformer models.

The model handles a 1-million-token context window while supporting native omnimodal inputs including text, image, audio, and video. It is optimized for long-horizon agentic tasks and autonomous tool use. Developers can run the model locally using FP8 precision weights, which balance memory usage with output throughput. The permissive MIT license allows for modification and commercial deployment without additional fees.

Use Cases

Discover the different ways you can use MiMo V2.5 Pro to achieve great results.

Autonomous Software Engineering

Resolving GitHub issues and building system components like compilers with self-correcting logic.

Long-Horizon Agent Workflows

Executing plans requiring coherence across more than 1,000 tool calls in software environments.

Native Multimodal Analysis

Directly reasoning across combined inputs of video and text without external preprocessing or frame extraction.

Large-Scale Codebase Navigation

Ingesting entire project repositories within the 1M token context window to refactor logic or find bugs.

Analog Circuit Design

Optimizing complex circuits by interacting with simulation loops to meet multi-metric specifications.

3D Web Generation

Creating sophisticated environments and physics simulations using Three.js and procedural terrain generation.

Strengths

Limitations

Low Token Consumption: Delivers intelligence matching frontier models while using 40% to 60% fewer tokens per task trajectory.

Reasoning Latency: The deep thinking mode can result in delays of several minutes before the model begins generating text.

Long-Horizon Coherence: Maintains reasoning accuracy across context windows of 1 million tokens and sequences of over 1,000 tool calls.

Complex Platform Access: The official web portal has an unstable sign-in process that users frequently describe as difficult to navigate.

Software Engineering Performance: Reaches a 78.9% score on SWE-bench Verified, indicating high proficiency in resolving GitHub-level code issues.

Safety Refusal Patterns: Occasional refusals can occur at the very end of long thinking cycles, which consumes compute time without providing output.

Permissive MIT Licensing: Allows for commercial integration and weight modification without the restrictive terms found in other open-source licenses.

Significant Hardware Requirements: Hosting the 1.02T parameter model locally requires multi-GPU clusters, making self-hosting expensive for small teams.

API Quick Start

xiaomi/mimo-v2.5-pro

View Documentation

other SDK

import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://api.xiaomimimo.com/v1",
  apiKey: process.env.MIMO_API_KEY
});

const completion = await client.chat.completions.create({
  model: "mimo-v2.5-pro",
  messages: [{ role: "user", content: "Identify logic errors in this 50,000 line codebase." }],
  thinking: { type: "enabled" }
});

console.log(completion.choices[0].message.content);

Install the SDK and start making API calls in minutes.

Community Feedback

See what the community thinks about MiMo V2.5 Pro

“The speed-to-context ratio on MiMo-V2.5-Pro is unbeatable for RAG pipelines that need to scan entire codebases in one go.”

— u/DevBuilder

“China just matched USA frontier coding AI at 40-60% lower token cost. This isn't incremental; it's rewriting the game.”

— Shruti

twitter

“MiMo-V2.5-Pro solved problems that would take human experts weeks. It built a complete compiler in just over 4 hours.”

— TechCrunchy

twitter

“The model's value isn't just in benchmarks, but in its ability to sustain complex agent workflows without breaking.”

— XiaomiMiMo Team

hackernews

“The speed is actually decent for a 1T model. The MoE routing is doing a lot of heavy lifting here.”

— AIExplorer

“Finally an MIT licensed model that actually competes with the closed giants. Local deployment is the next hurdle.”

— OpenSourceFan

twitter

Supercharge your workflow with AI Automation

Automatio combines the power of AI agents, web automation, and smart integrations to help you accomplish more in less time.

AI Agents

Web Automation

Smart Workflows

Get Started Free

Pro Tips

Expert tips to help you get the most out of MiMo V2.5 Pro and achieve better results.

Manage Chain-of-Thought Latency

Add 'don't overthink' to your prompt to reduce reasoning latency for simple technical queries.

Preserve Reasoning Content

Pass back the previous reasoning_content in multi-turn conversations to maintain agentic performance.

Define Environment Affordances

Specify tool environment capabilities clearly as the model is optimized for harness awareness.

Optimize Local Deployment

Use FP8 mixed precision weights to balance memory efficiency with high output throughput.

Testimonials

What Our Users Say

Join thousands of satisfied users who have transformed their workflow

Jonathan Kogan

Co-Founder/CEO, rpatools.io

Automatio is one of the most used for RPA Tools both internally and externally. It saves us countless hours of work and we realized this could do the same for other startups and so we choose Automatio for most of our automation needs.

Mohammed Ibrahim

CEO, qannas.pro

I have used many tools over the past 5 years, Automatio is the Jack of All trades.. !! it could be your scraping bot in the morning and then it becomes your VA by the noon and in the evening it does your automations.. its amazing!

Ben Bressington

CTO, AiChatSolutions

Automatio is fantastic and simple to use to extract data from any website. This allowed me to replace a developer and do tasks myself as they only take a few minutes to setup and forget about it. Automatio is a game changer!

Sarah Chen

Head of Growth, ScaleUp Labs

We've tried dozens of automation tools, but Automatio stands out for its flexibility and ease of use. Our team productivity increased by 40% within the first month of adoption.

David Park

Founder, DataDriven.io

The AI-powered features in Automatio are incredible. It understands context and adapts to changes in websites automatically. No more broken scrapers!

Emily Rodriguez

Marketing Director, GrowthMetrics

Automatio transformed our lead generation process. What used to take our team days now happens automatically in minutes. The ROI is incredible.

Jonathan Kogan

Co-Founder/CEO, rpatools.io

Mohammed Ibrahim

CEO, qannas.pro

Ben Bressington

CTO, AiChatSolutions

Sarah Chen

Head of Growth, ScaleUp Labs

We've tried dozens of automation tools, but Automatio stands out for its flexibility and ease of use. Our team productivity increased by 40% within the first month of adoption.

David Park

Founder, DataDriven.io

The AI-powered features in Automatio are incredible. It understands context and adapts to changes in websites automatically. No more broken scrapers!

Emily Rodriguez

Marketing Director, GrowthMetrics

Automatio transformed our lead generation process. What used to take our team days now happens automatically in minutes. The ROI is incredible.

Related AI Models

DeepSeek-V3.2-Speciale

DeepSeek

DeepSeek-V3.2-Speciale is a reasoning-first LLM featuring gold-medal math performance, DeepSeek Sparse Attention, and a 131K context window. Rivaling GPT-5...

131K context

$0.28/$0.42/1M

MiniMax M2.5

minimax

MiniMax M2.5 is a SOTA MoE model featuring a 1M context window and elite agentic coding capabilities at disruptive pricing for autonomous agents.

1M context

$0.15/$1.20/1M

Kimi K2.7 Code

Moonshot

Kimi K2.7 Code is a 1T parameter MoE model from Moonshot AI. It features a 262k context window and 30% more efficient reasoning for software engineering.

262K context

$0.95/$4.00/1M

GLM-4.7

Zhipu (GLM)

GLM-4.7 by Zhipu AI is a flagship 358B MoE model featuring a 200K context window, elite 73.8% SWE-bench performance, and native Deep Thinking for agentic...

200K context

$0.60/$2.20/1M

Qwen3-Coder-Next

alibaba

Qwen3-Coder-Next is Alibaba Cloud's elite Apache 2.0 coding model, featuring an 80B MoE architecture and 256k context window for advanced local development.

262K context

$0.12/$0.75/1M

GPT-4o mini

OpenAI

OpenAI's most cost-efficient small model, GPT-4o mini offers multimodal intelligence and high-speed performance at a significantly lower price point.

128K context

$0.15/$0.60/1M

Kimi K3

Moonshot

Kimi K3 is Moonshot AI's 2.8T MoE model with a 1M token context window, native multimodal vision, and frontier-tier coding performance for complex agents.

1M context

$3.00/$15.00/1M

GLM-5.2

Zhipu (GLM)

GLM-5.2 is Zhipu AI's flagship open-weight model featuring a 1M context window and specialized agentic coding capabilities under an MIT license.

1M context

$1.40/$4.40/1M

Frequently Asked Questions

Find answers to common questions about MiMo V2.5 Pro

MiMo V2.5 Pro

About MiMo V2.5 Pro

Use Cases

Autonomous Software Engineering

Long-Horizon Agent Workflows

Native Multimodal Analysis

Large-Scale Codebase Navigation

Analog Circuit Design

3D Web Generation

Strengths

Limitations

API Quick Start

Community Feedback

Related Videos

Supercharge your workflow with AI Automation

Pro Tips

Manage Chain-of-Thought Latency

Preserve Reasoning Content

Define Environment Affordances

Optimize Local Deployment

What Our Users Say

Related AI Models

DeepSeek-V3.2-Speciale

MiniMax M2.5

Kimi K2.7 Code

GLM-4.7

Qwen3-Coder-Next

GPT-4o mini

Kimi K3

GLM-5.2

Frequently Asked Questions

What is the pricing for MiMo V2.5 Pro?

How do I access the MiMo V2.5 Pro API?

What is the context window size?

How does it compare to closed frontier models?

Does it support native multimodal input?

What is the model's license?

Can I use function calling?

What hardware is needed for local hosting?