The Open-Source AI Stack
RSS
All models

Models · Compare

DeepSeek-V3.2 vs Claude Opus 4.5

Rows highlighted in warm gray are where the models differ. Numbers carry their as-of date and primary source.

Specs

Field A: DeepSeek-V3.2 B: Claude Opus 4.5
Released 2025-11-24
Developer DeepSeekAnthropic
Openness OpenProprietary
License MITProprietary
OSI-approved yesno
Data released nono
Training code nono
Architecture moeunknown
Total params
Active params
Experts
Context window
Attention dsaunknown
Position enc. rope-yarnunknown
Pretraining tokens
Post-training sft, rlhfrlhf, constitutional
Training hardware
$/M input $0.50$5.00
$/M output $1.60$25.00
Output tok/sec 051.8

Benchmarks

Missing scores render as not reported; never inferred. Bold highlights the leader per benchmark.

General reasoning

MMLU-Pro 83.7 2026-05-21 88.9 2026-05-21
GPQA-Diamond 75.1 2026-05-21 81.0 2026-05-21

Code

LiveCodeBench 59.3 2026-05-21 73.8 2026-05-21

Math

AIME 2025 59.0 2026-05-21 62.7 2026-05-21

Context · A

Successor to V3.2-Exp, released December 1 2025 with DeepSeek Sparse Attention for long-context efficiency. Integrated reasoning with tool use through a new chat template, and a Speciale variant claimed gold-level finishes on IMO, CMO, ICPC, and IOI 2025.

Context · B

Anthropic's first Opus to cross 80 percent on SWE-Bench Verified per the lab's own numbers, released November 24 2025 at a two-thirds price cut versus Opus 4.1 ($5 / $25 per Mtok). Added an effort parameter for adjustable reasoning intensity, with medium-effort runs matching Sonnet 4.5 using 76 percent fewer output tokens.

DeepSeek-V3.2 detail → · Claude Opus 4.5 detail →