Particle.news
Download on the App Store

Xiaomi Debuts MiMo‑V2.5 AI Family, Unifying Text, Image, Audio and Video

Lower prices with a 1M‑token context window aim to cut developer costs.

Overview

  • Xiaomi launched MiMo‑V2.5 and MiMo‑V2.5‑Pro, which fold vision, speech, and video into one model family, with access live through the MiMo API and limited availability in AI Studio.
  • The company reports near top‑tier results on coding and agent tasks, including a 57.2% score on SWE‑bench Pro, a test where models fix real software bugs, though it still trails on the hardest reasoning exams.
  • Both models support a 1M‑token context window with no extra fee, with MiMo‑V2.5‑Pro priced at $1.00 per million input tokens and $3.00 per million output tokens, and the base model at $0.40 input and $2.00 output.
  • Xiaomi says MiMo‑V2.5‑Pro uses about 42% fewer tokens than Kimi K2.6 at similar scores and the base model uses nearly half the tokens of Muse Spark, which can lower bills for large, long‑running workflows.
  • Xiaomi highlights a rapid release cadence backed by a $8.7 billion AI commitment, growing usage on OpenRouter after a Hermes free‑access push, and plans to open‑source the models as it trains the next generation.