The Open-Source AI Stack
RSS
All models

Models · Compare

Kimi K2.5 vs Claude Opus 4.6

Rows highlighted in warm gray are where the models differ. Numbers carry their as-of date and primary source.

Specs

Field A: Kimi K2.5 B: Claude Opus 4.6
Released 2026-01-272026-02-05
Developer Moonshot AIAnthropic
Openness OpenProprietary
License Modified MITProprietary
OSI-approved nono
Data released nono
Training code nono
Architecture moeunknown
Total params 1T
Active params 32B
Experts 384 (8 active)
Context window 256K1.0M
Attention mlaunknown
Position enc. ropeunknown
Pretraining tokens 15.0T
Post-training sft, rlhfrlhf, constitutional
Training hardware
$/M input $0.58$5.00
$/M output $3.00$25.00
Output tok/sec 33.943.7

Benchmarks

Missing scores render as not reported; never inferred. Bold highlights the leader per benchmark.

General reasoning

GPQA-Diamond 87.6 2026-01-27 84.0 2026-05-21

Code

SWE-Bench Verified 76.8 2026-01-27
LiveCodeBench 85.0 2026-01-27

Math

AIME 2025 96.1 2026-01-27

Context · A

Native multimodal Kimi at 1T total / 32B active with MoonViT (400M-parameter vision encoder), trained on roughly 15T mixed visual and text tokens. 256K context, Modified MIT license, January 27 2026 release. Adds visual reasoning, video understanding, and UI-to-code workflows.

Context · B

First Opus-class model with a 1M-token context window (in beta), released February 5 2026. Headline feature was agent teams in Claude Code, letting multiple agents work in parallel and coordinate autonomously. Long-context pricing premium kicks in above 200K input tokens ($10 / $37.50 per Mtok).

Kimi K2.5 detail → · Claude Opus 4.6 detail →