
MiMo V2.5 Pro
MiMo V2.5 Pro 是小米的 open-source 1.02T parameters MoE model,具备 1M context window、原生多模态能力和顶尖的 agentic 编程性能。
关于 MiMo V2.5 Pro
了解 MiMo V2.5 Pro 的功能、特性以及它如何帮助您获得更好的效果。
MiMo V2.5 Pro 是小米的旗舰级 open-source model。它采用 1.02 万亿 parameters 的 Mixture-of-Experts 架构,在 inference 期间有 420 亿 parameters 处于激活状态。其混合注意力设计以 6:1 的比例融合了局部滑动窗口注意力和全局注意力。与标准 transformer model 相比,这种特殊配置将 KV-cache 存储需求降低了近 7 倍。
该 model 支持 100 万 tokens 的 context window,并支持包括文本、图像、音频和视频在内的原生 omnimodal 输入。它针对长周期 agentic 任务和自主工具使用进行了优化。开发者可以使用 FP8 精度权重在本地运行该 model,从而平衡内存使用和输出 throughput。宽松的 MIT 许可允许在无需额外费用的情况下进行修改和商业部署。

MiMo V2.5 Pro 的使用案例
发现使用 MiMo V2.5 Pro 获得出色效果的不同方式。
自主软件工程
通过自我纠正逻辑来解决 GitHub 问题并构建编译器等系统组件。
长周期 Agent 工作流
执行需要在软件环境中保持超过 1,000 次工具调用连贯性的任务。
原生 Multimodal 分析
直接对视频和文本的组合输入进行 reasoning,无需外部预处理或帧提取。
大规模代码库导航
在 1M tokens 的 context window 内摄取整个项目仓库,以重构逻辑或查找 bug。
模拟电路设计
通过与模拟循环交互来优化复杂电路,以满足多指标规范。
3D Web 生成
使用 Three.js 和程序化地形生成技术创建复杂的环境和物理模拟。
优势
局限性
API快速入门
xiaomi/mimo-v2.5-pro
import OpenAI from "openai";
const client = new OpenAI({
baseURL: "https://api.xiaomimimo.com/v1",
apiKey: process.env.MIMO_API_KEY
});
const completion = await client.chat.completions.create({
model: "mimo-v2.5-pro",
messages: [{ role: "user", content: "Identify logic errors in this 50,000 line codebase." }],
thinking: { type: "enabled" }
});
console.log(completion.choices[0].message.content);安装SDK并在几分钟内开始进行API调用。
人们对 MiMo V2.5 Pro 的评价
看看社区对 MiMo V2.5 Pro 的看法
“对于需要一次性扫描整个代码库的 RAG 流水线,MiMo-V2.5-Pro 的速度与 context 比率是无敌的。”
“中国刚刚以 40-60% 的 token 成本匹配了美国的 frontier coding AI。这不仅仅是增量改进;这是在重写规则。”
“MiMo-V2.5-Pro 解决了原本需要人类专家花费数周才能解决的问题。它仅用了 4 小时多一点就构建了一个完整的编译器。”
“该 model 的价值不仅在于 benchmark,还在于它能够持续执行复杂的 agent 工作流而不崩溃。”
“对于一个 1T 的 model 来说,速度确实不错。MoE 路由在这里发挥了很大作用。”
“终于有一款 MIT 许可的 model 可以真正与 closed-source 巨头竞争了。本地部署是下一个挑战。”
关于 MiMo V2.5 Pro 的视频
观看关于 MiMo V2.5 Pro 的教程、评测和讨论
“我从未在结果中见过这种细节水平……看看那木板地板的细节。”
“当你提供特定的技术错误消息时,该 model 非常自信且有效。”
“它处理整个代码库 context 时,不会出现通常的文档中间信息丢失。”
“思考过程是透明的,确切展示了它如何评估各种工具选项。”
“该 model 在 JSON 输出的严格指令遵循方面表现优于其前身。”
“它旨在处理复杂的多步工作流,支持数千次工具调用。”
“在相似的性能下,它比 GPT-5.4 或 Claude Opus 4.6 等 model 少用了 40% 到 60% 的 tokens。”
“小米凭借此次发布震撼了 open-source AI 领域。”
“原生的多模态特性意味着它不需要单独的视频 vision encoder。”
“通过提供正确的环境钩子(hooks),你可以有效地构建整个 OS 组件。”
“Mimo 的定价意在压制所有人……编码计划的第一个月仅需 6 美元。”
“Benchmark 只能说明部分情况;我希望它们是真正的构建者,能把屋顶修好。”
“在处理长 reasoning 链时,它比早期的 V2 版本稳定得多。”
“其原生 API 的定价非常激进,很可能是为了抢占开发者市场。”
“它在处理极高频音频时稍显吃力,但在处理对话语音时完美无缺。”
MiMo V2.5 Pro专业提示
专家提示助您充分利用MiMo V2.5 Pro。
管理 chain-of-thought 延迟
在 prompt 中加入“不要过度思考(don't overthink)”,以减少简单技术查询的 reasoning 延迟。
保留 reasoning 内容
在多轮对话中传回之前的 reasoning_content,以保持 agentic 性能。
定义环境功能(Affordances)
清晰地指定工具环境的能力,因为该 model 针对环境感知(harness awareness)进行了优化。
优化本地部署
使用 FP8 混合精度权重,以平衡内存效率和高输出 throughput。
用户评价
用户怎么说
加入数千名已改变工作流程的满意用户
Jonathan Kogan
Co-Founder/CEO, rpatools.io
Automatio is one of the most used for RPA Tools both internally and externally. It saves us countless hours of work and we realized this could do the same for other startups and so we choose Automatio for most of our automation needs.
Mohammed Ibrahim
CEO, qannas.pro
I have used many tools over the past 5 years, Automatio is the Jack of All trades.. !! it could be your scraping bot in the morning and then it becomes your VA by the noon and in the evening it does your automations.. its amazing!
Ben Bressington
CTO, AiChatSolutions
Automatio is fantastic and simple to use to extract data from any website. This allowed me to replace a developer and do tasks myself as they only take a few minutes to setup and forget about it. Automatio is a game changer!
Sarah Chen
Head of Growth, ScaleUp Labs
We've tried dozens of automation tools, but Automatio stands out for its flexibility and ease of use. Our team productivity increased by 40% within the first month of adoption.
David Park
Founder, DataDriven.io
The AI-powered features in Automatio are incredible. It understands context and adapts to changes in websites automatically. No more broken scrapers!
Emily Rodriguez
Marketing Director, GrowthMetrics
Automatio transformed our lead generation process. What used to take our team days now happens automatically in minutes. The ROI is incredible.
Jonathan Kogan
Co-Founder/CEO, rpatools.io
Automatio is one of the most used for RPA Tools both internally and externally. It saves us countless hours of work and we realized this could do the same for other startups and so we choose Automatio for most of our automation needs.
Mohammed Ibrahim
CEO, qannas.pro
I have used many tools over the past 5 years, Automatio is the Jack of All trades.. !! it could be your scraping bot in the morning and then it becomes your VA by the noon and in the evening it does your automations.. its amazing!
Ben Bressington
CTO, AiChatSolutions
Automatio is fantastic and simple to use to extract data from any website. This allowed me to replace a developer and do tasks myself as they only take a few minutes to setup and forget about it. Automatio is a game changer!
Sarah Chen
Head of Growth, ScaleUp Labs
We've tried dozens of automation tools, but Automatio stands out for its flexibility and ease of use. Our team productivity increased by 40% within the first month of adoption.
David Park
Founder, DataDriven.io
The AI-powered features in Automatio are incredible. It understands context and adapts to changes in websites automatically. No more broken scrapers!
Emily Rodriguez
Marketing Director, GrowthMetrics
Automatio transformed our lead generation process. What used to take our team days now happens automatically in minutes. The ROI is incredible.
相关 AI Models
DeepSeek-V3.2-Speciale
DeepSeek
DeepSeek-V3.2-Speciale is a reasoning-first LLM featuring gold-medal math performance, DeepSeek Sparse Attention, and a 131K context window. Rivaling GPT-5...
MiniMax M2.5
minimax
MiniMax M2.5 is a SOTA MoE model featuring a 1M context window and elite agentic coding capabilities at disruptive pricing for autonomous agents.
GLM-4.7
Zhipu (GLM)
GLM-4.7 by Zhipu AI is a flagship 358B MoE model featuring a 200K context window, elite 73.8% SWE-bench performance, and native Deep Thinking for agentic...
Qwen3-Coder-Next
alibaba
Qwen3-Coder-Next is Alibaba Cloud's elite Apache 2.0 coding model, featuring an 80B MoE architecture and 256k context window for advanced local development.
GPT-4o mini
OpenAI
OpenAI's most cost-efficient small model, GPT-4o mini offers multimodal intelligence and high-speed performance at a significantly lower price point.
Qwen 3.7 Max
alibaba
Qwen 3.7 Max is Alibaba’s flagship AI model for deep reasoning and autonomous agent tasks, featuring a 256k context window and top-tier coding performance.
Qwen3.5-Omni
alibaba
Qwen3.5-Omni is a natively omnimodal AI by Alibaba Cloud, offering seamless audio-visual reasoning, real-time voice chat, and 256k context for low-latency apps.
DeepSeek v4
DeepSeek
DeepSeek v4 is a 1.6T parameter MoE model featuring a 1M token context window and native multimodal support for text, vision, and video at disruptive prices.
关于MiMo V2.5 Pro的常见问题
查找关于MiMo V2.5 Pro的常见问题答案