1 vs 1 benchmarks won
Anthropic Claude Opus 4.1 | Mistral Mistral Medium 3.5 | |
|---|---|---|
| Overview | ||
| Company | Anthropic | Mistral |
| Release date | Aug 5 2025 | Apr 29 2026 |
| Model type | — | — |
| Open source | No | Yes |
| Specifications | ||
Parameters | — | 128B |
Context window | — | 256k |
| Benchmarks | ||
Science reasoning GPQA Diamond | 80.9% | — |
Software engineering SWE-Bench Verified | 74.5% | 77.6% |
Multimodal understanding MMMU | — | — |
| Timeline | ||
| Release gap | Claude Opus 4.1 shipped 267 days before Mistral Medium 3.5 | |
Claude Opus 4.1 and Mistral Medium 3.5 are evenly matched across the benchmarks they both publish. Claude Opus 4.1 shipped 267 days before Mistral Medium 3.5, so benchmark comparisons should account for the intervening progress.
Mistral Medium 3.5 is an open-source / open-weight model; Claude Opus 4.1 is proprietary.
On SWE-Bench Verified, Mistral Medium 3.5 scores 77.6%, 3.1 points above Claude Opus 4.1 at 74.5%.