Models · Compare
GPT-5 vs GLM-4.5
Rows highlighted in warm gray are where the models differ. Numbers carry their as-of date and primary source.
Specs
| Field | A: GPT-5 | B: GLM-4.5 |
|---|---|---|
| Released | — | 2025-07-28 |
| Developer | OpenAI | Zhipu AI |
| Openness | Proprietary | Open |
| License | Proprietary | MIT |
| OSI-approved | no | yes |
| Data released | no | no |
| Training code | no | no |
| Architecture | unknown | moe |
| Total params | — | 355B |
| Active params | — | 32B |
| Experts | — | — |
| Context window | — | 128K |
| Attention | unknown | unknown |
| Position enc. | unknown | unknown |
| Pretraining tokens | — | 23.0T |
| Post-training | rlhf | sft, rlhf |
| Training hardware | — | — |
| $/M input | $1.25 | — |
| $/M output | $10.00 | — |
| Output tok/sec | 72 | — |
Benchmarks
Missing scores render as not reported; never inferred. Bold highlights the leader per benchmark.
General reasoning
| MMLU-Pro | 87.1 2026-05-21 | — |
| GPQA-Diamond | 85.4 2026-05-21 | — |
Code
| SWE-Bench Verified | — | 64.2 2025-07-28 |
| LiveCodeBench | 84.6 2026-05-21 | — |
Math
| MATH | 99.4 2026-05-21 | — |
| AIME 2024 | 95.7 2026-05-21 | 91.0 2025-07-28 |
Context · A
OpenAI's first unified model, fusing the fast GPT-series with the deeper o-series reasoning track behind a real-time router that picks per-turn. Launched August 7 2025 via livestream; immediately default in ChatGPT, available in API and on the GitHub Models Playground. Ships in main, mini, nano, thinking, and thinking-mini variants with low/medium/high/minimal effort settings.
Context · B
Zhipu's first ARC-focused flagship (agentic, reasoning, coding), released July 28 2025 as 355B total / 32B active MoE under MIT. 23T-token pretraining run. Hybrid thinking / non-thinking modes. GLM-4.5-Air (106B / 12B) released alongside. Zhipu reports 90.6 percent tool-calling success vs Claude 4 Sonnet's 89.5 percent.