Models · Compare
DeepSeek-V3.1 vs GPT-5
Rows highlighted in warm gray are where the models differ. Numbers carry their as-of date and primary source.
Specs
| Field | A: DeepSeek-V3.1 | B: GPT-5 |
|---|---|---|
| Released | — | — |
| Developer | DeepSeek | OpenAI |
| Openness | Open | Proprietary |
| License | MIT | Proprietary |
| OSI-approved | yes | no |
| Data released | no | no |
| Training code | no | no |
| Architecture | moe | unknown |
| Total params | 671B | — |
| Active params | 37B | — |
| Experts | — | — |
| Context window | 128K | — |
| Attention | mla | unknown |
| Position enc. | rope-yarn | unknown |
| Pretraining tokens | — | — |
| Post-training | sft, rlhf | rlhf |
| Training hardware | — | — |
| $/M input | $0.56 | $1.25 |
| $/M output | $1.67 | $10.00 |
| Output tok/sec | 0 | 72 |
Benchmarks
Missing scores render as not reported; never inferred. Bold highlights the leader per benchmark.
General reasoning
| MMLU-Pro | 83.3 2026-05-21 | 87.1 2026-05-21 |
| GPQA-Diamond | 73.5 2026-05-21 | 85.4 2026-05-21 |
Code
| LiveCodeBench | 57.7 2026-05-21 | 84.6 2026-05-21 |
Math
| MATH | — | 99.4 2026-05-21 |
| AIME 2024 | — | 95.7 2026-05-21 |
| AIME 2025 | 49.7 2026-05-21 | — |
Context · A
Hybrid V3.1 with one model serving both thinking and non-thinking modes, released August 21 2025. DeepSeek added 840B tokens of continued pretraining for long-context extension (32K and 128K phases) and shipped under MIT. Pricing update took effect September 5 2025.
Context · B
OpenAI's first unified model, fusing the fast GPT-series with the deeper o-series reasoning track behind a real-time router that picks per-turn. Launched August 7 2025 via livestream; immediately default in ChatGPT, available in API and on the GitHub Models Playground. Ships in main, mini, nano, thinking, and thinking-mini variants with low/medium/high/minimal effort settings.