The Open-Source AI Stack
RSS
All models

Models · Compare

Claude Sonnet 4.5 vs GLM-4.6

Rows highlighted in warm gray are where the models differ. Numbers carry their as-of date and primary source.

Specs

Field A: Claude Sonnet 4.5 B: GLM-4.6
Released 2025-09-292025-09-30
Developer AnthropicZhipu AI
Openness ProprietaryOpen
License ProprietaryMIT
OSI-approved noyes
Data released nono
Training code nono
Architecture unknownmoe
Total params 357B
Active params
Experts
Context window 128K
Attention unknownunknown
Position enc. unknownunknown
Pretraining tokens
Post-training rlhf, constitutionalsft, rlhf
Training hardware
$/M input $3.00$0.60
$/M output $15.00$2.20
Output tok/sec 48.830.7

Benchmarks

Missing scores render as not reported; never inferred. Bold highlights the leader per benchmark.

General reasoning

MMLU-Pro 86.0 2026-05-21 78.4 2026-05-21
GPQA-Diamond 72.7 2026-05-21 63.2 2026-05-21

Code

SWE-Bench Verified 77.2 2025-09-29
LiveCodeBench 59.0 2026-05-21 56.1 2026-05-21

Math

AIME 2025 37.0 2026-05-21 44.3 2026-05-21

Context · A

September 29 2025 incremental update of the Sonnet line at the same $3 / $15 price point as Sonnet 4. Shipped alongside Claude Code checkpoints, a native VS Code extension, context editing and memory tools for long-running agents, and the Claude Agent SDK. OSWorld score of 61.4% positioned it as the strongest computer-use model from Anthropic at launch.

Context · B

September 30 2025 refresh, 357B MoE with 200K context (up from 128K in 4.5) and 128K maximum output. Zhipu reports a 27 percent coding improvement over 4.5 and parity with Claude Sonnet 4 on 8 public benchmarks. MIT licensed.

Claude Sonnet 4.5 detail → · GLM-4.6 detail →