Models · Compare

Claude Sonnet 4.5 vs GLM-4.6

Rows highlighted in warm gray are where the models differ. Numbers carry their as-of date and primary source.

Specs

Field	A: Claude Sonnet 4.5	B: GLM-4.6
Released	2025-09-29	2025-09-30
Developer	Anthropic	Zhipu AI
Openness	Proprietary	Open
License	Proprietary	MIT
OSI-approved	no	yes
Data released	no	no
Training code	no	no
Architecture	unknown	moe
Total params	—	357B
Active params	—	—
Experts	—	—
Context window	—	128K
Attention	unknown	unknown
Position enc.	unknown	unknown
Pretraining tokens	—	—
Post-training	rlhf, constitutional	sft, rlhf
Training hardware	—	—
$/M input	$3.00	$0.60
$/M output	$15.00	$2.20
Output tok/sec	48.8	30.7

Benchmarks

Missing scores render as not reported; never inferred. Bold highlights the leader per benchmark.

General reasoning

MMLU-Pro	86.0 2026-05-21	78.4 2026-05-21
GPQA-Diamond	72.7 2026-05-21	63.2 2026-05-21

Code

SWE-Bench Verified	77.2 2025-09-29	—
LiveCodeBench	59.0 2026-05-21	56.1 2026-05-21

Math

AIME 2025

37.0 2026-05-21

44.3 2026-05-21

Context · A

September 29 2025 incremental update of the Sonnet line at the same $3 / $15 price point as Sonnet 4. Shipped alongside Claude Code checkpoints, a native VS Code extension, context editing and memory tools for long-running agents, and the Claude Agent SDK. OSWorld score of 61.4% positioned it as the strongest computer-use model from Anthropic at launch.

Context · B

September 30 2025 refresh, 357B MoE with 200K context (up from 128K in 4.5) and 128K maximum output. Zhipu reports a 27 percent coding improvement over 4.5 and parity with Claude Sonnet 4 on 8 public benchmarks. MIT licensed.

Claude Sonnet 4.5 detail → · GLM-4.6 detail →