Models · Compare
Claude Opus 4.6 vs Kimi K2.5
Rows highlighted in warm gray are where the models differ. Numbers carry their as-of date and primary source.
Specs
| Field | A: Claude Opus 4.6 | B: Kimi K2.5 |
|---|---|---|
| Released | 2026-02-05 | 2026-01-27 |
| Developer | Anthropic | Moonshot AI |
| Openness | Proprietary | Open |
| License | Proprietary | Modified MIT |
| OSI-approved | no | no |
| Data released | no | no |
| Training code | no | no |
| Architecture | unknown | moe |
| Total params | — | 1T |
| Active params | — | 32B |
| Experts | — | 384 (8 active) |
| Context window | 1.0M | 256K |
| Attention | unknown | mla |
| Position enc. | unknown | rope |
| Pretraining tokens | — | 15.0T |
| Post-training | rlhf, constitutional | sft, rlhf |
| Training hardware | — | — |
| $/M input | $5.00 | $0.58 |
| $/M output | $25.00 | $3.00 |
| Output tok/sec | 43.7 | 33.9 |
Benchmarks
Missing scores render as not reported; never inferred. Bold highlights the leader per benchmark.
General reasoning
| GPQA-Diamond | 84.0 2026-05-21 | 87.6 2026-01-27 |
Code
| SWE-Bench Verified | — | 76.8 2026-01-27 |
| LiveCodeBench | — | 85.0 2026-01-27 |
Math
| AIME 2025 | — | 96.1 2026-01-27 |
Context · A
First Opus-class model with a 1M-token context window (in beta), released February 5 2026. Headline feature was agent teams in Claude Code, letting multiple agents work in parallel and coordinate autonomously. Long-context pricing premium kicks in above 200K input tokens ($10 / $37.50 per Mtok).
Context · B
Native multimodal Kimi at 1T total / 32B active with MoonViT (400M-parameter vision encoder), trained on roughly 15T mixed visual and text tokens. 256K context, Modified MIT license, January 27 2026 release. Adds visual reasoning, video understanding, and UI-to-code workflows.