Models · Compare
DeepSeek-V3.2 vs Claude Opus 4.5
Rows highlighted in warm gray are where the models differ. Numbers carry their as-of date and primary source.
Specs
| Field | A: DeepSeek-V3.2 | B: Claude Opus 4.5 |
|---|---|---|
| Released | — | 2025-11-24 |
| Developer | DeepSeek | Anthropic |
| Openness | Open | Proprietary |
| License | MIT | Proprietary |
| OSI-approved | yes | no |
| Data released | no | no |
| Training code | no | no |
| Architecture | moe | unknown |
| Total params | — | — |
| Active params | — | — |
| Experts | — | — |
| Context window | — | — |
| Attention | dsa | unknown |
| Position enc. | rope-yarn | unknown |
| Pretraining tokens | — | — |
| Post-training | sft, rlhf | rlhf, constitutional |
| Training hardware | — | — |
| $/M input | $0.50 | $5.00 |
| $/M output | $1.60 | $25.00 |
| Output tok/sec | 0 | 51.8 |
Benchmarks
Missing scores render as not reported; never inferred. Bold highlights the leader per benchmark.
General reasoning
| MMLU-Pro | 83.7 2026-05-21 | 88.9 2026-05-21 |
| GPQA-Diamond | 75.1 2026-05-21 | 81.0 2026-05-21 |
Code
| LiveCodeBench | 59.3 2026-05-21 | 73.8 2026-05-21 |
Math
| AIME 2025 | 59.0 2026-05-21 | 62.7 2026-05-21 |
Context · A
Successor to V3.2-Exp, released December 1 2025 with DeepSeek Sparse Attention for long-context efficiency. Integrated reasoning with tool use through a new chat template, and a Speciale variant claimed gold-level finishes on IMO, CMO, ICPC, and IOI 2025.
Context · B
Anthropic's first Opus to cross 80 percent on SWE-Bench Verified per the lab's own numbers, released November 24 2025 at a two-thirds price cut versus Opus 4.1 ($5 / $25 per Mtok). Added an effort parameter for adjustable reasoning intensity, with medium-effort runs matching Sonnet 4.5 using 76 percent fewer output tokens.