MiMo API
MiMo pay-as-you-go API with separate pricing for V2.5 text, ASR, TTS, and web search, independent from Token Plan quota
Core models
This is the current flagship pay-as-you-go text model in the official docs. It is priced domestically at ¥0.025 for cache hits, ¥3 for uncached input, and ¥6 for output per 1M tokens, or $0.0036, $0.435, and $0.87 overseas, which makes it better suited to complex engineering, critical content generation, and high-value flows where per-call quality matters more.
This is the current mainline standard pay-as-you-go text model, priced domestically at ¥0.02 for cache hits, ¥1 for uncached input, and ¥2 for output per 1M tokens, or $0.0028, $0.14, and $0.28 overseas. Compared with the flagship tier, it is better suited to day-to-day large-scale text usage, knowledge assistants, and routine coding support.
The docs list this as the current ASR model, and unlike text models it is billed by input audio duration at ¥0.5/hour domestically or $0.074/hour overseas. For meeting notes, voice input, and support-call analysis, it acts as the key speech-ingestion capability inside the MiMo platform.
VoiceClone is one of the explicitly listed TTS-series models in the official docs and is currently temporarily free. It is more suitable for turning text results into speech that feels more personalized, character-driven, or closer to a specific vocal identity, which makes it valuable for voice assistants, companion-style interactions, and dubbing workflows.
VoiceDesign is also a currently supported and temporarily free TTS-series model. It is better suited to scenarios that need speech-style design, narrated product experiences, or voice-product prototypes, showing that MiMo’s pay-as-you-go platform covers not only text and ASR but also the speech-output layer.
Additional core model names still appear above, with full details on the latest official page.
Plan details
mimo-v2.5
Main text modelRecommendedmimo-v2.5 as one of the current main pay-as-you-go text models, priced domestically at ¥0.02 for cache hits, ¥1 for uncached input, and ¥2 for output per 1M tokens, making it the more scalable entry point for cost-sensitive, high-frequency text workloads.mimo-v2.5-pro
Flagship text modelmimo-v2.5-pro as the current flagship pay-as-you-go text model, priced domestically at ¥0.025 for cache hits, ¥3 for uncached input, and ¥6 for output per 1M tokens, which is clearly above the standard model and better suited to smaller volumes of high-value workloads.mimo-v2.5, which makes it more reasonable for complex software engineering, critical result generation, and multi-step agent work where per-call quality matters more.mimo-v2.5-asr is explicit: ¥0.5/hour domestically and $0.074/hour overseas, prorated from second-level measurement, which means it follows a different cost model than text generation and is easier to budget separately as a voice-input layer.TTS 系列
Temporarily freeTTS 系列
Temporarily freemimo-v2.5-tts, mimo-v2.5-tts-voiceclone, and mimo-v2.5-tts-voicedesign as temporarily free, which gives MiMo’s pay-as-you-go platform a notably favorable window for trying and integrating voice output.Notes
- Domestic pricing is in CNY per 1M tokens and overseas pricing is in USD per 1M tokens; web search plugins cost ¥16/1000 calls domestically and $5/1000 calls overseas, billed separately from token pricing.
- The TTS family and cache writes are currently temporarily free with no published end date; ASR costs ¥0.5/hour domestically and $0.074/hour overseas. MiMo-V2.5 price cuts took effect on 2026-05-27.
- New integrations should use the V2.5 family directly. V2 migration: `mimo-v2-pro` and `mimo-v2-omni` auto-route to V2.5 pricing since 2026-06-01; `mimo-v2-flash` and `mimo-v2-tts` from 2026-06-18; the V2 family retires 2026-06-30.
Supported coding tools
Pricing and model data sourced from official vendor websites