1 vs 0 benchmarks won
Mistral Mistral Medium 3.5 | Moonshot AI Kimi K2 Thinking | |
|---|---|---|
| Overview | ||
| Company | Mistral | Moonshot AI |
| Release date | Apr 29 2026 | Nov 6 2025 |
| Model type | — | — |
| Open source | Yes | Yes |
| Specifications | ||
Parameters | 128B | 1T |
Context window | 256k | 256k |
| Benchmarks | ||
Science reasoning GPQA Diamond | — | — |
Software engineering SWE-Bench Verified | 77.6% | 71.3% |
Multimodal understanding MMMU | — | — |
| Timeline | ||
| Release gap | Kimi K2 Thinking shipped 174 days before Mistral Medium 3.5 | |
Mistral Medium 3.5 leads Kimi K2 Thinking on 1 of the tracked benchmarks (GPQA Diamond, SWE-Bench Verified, MMMU). Kimi K2 Thinking shipped 174 days before Mistral Medium 3.5, so benchmark comparisons should account for the intervening progress.
Mistral Medium 3.5 has 128B parameters, while Kimi K2 Thinking has 1T. Context windows are 256k (Mistral Medium 3.5) vs 256k (Kimi K2 Thinking).
On SWE-Bench Verified, Mistral Medium 3.5 scores 77.6%, 6.3 points above Kimi K2 Thinking at 71.3%.