Models · Compare
GPT-4 Turbo vs Command R+
Rows highlighted in warm gray are where the models differ. Numbers carry their as-of date and primary source.
Specs
| Field | A: GPT-4 Turbo | B: Command R+ |
|---|---|---|
| Released | 2024-04-09 | 2024-04-04 |
| Developer | OpenAI | Cohere |
| Openness | Proprietary | Source-available |
| License | Proprietary | CC BY-NC 4.0 |
| OSI-approved | no | no |
| Data released | no | no |
| Training code | no | no |
| Architecture | unknown | dense |
| Total params | — | — |
| Active params | — | — |
| Experts | — | — |
| Context window | 128K | — |
| Attention | unknown | gqa |
| Position enc. | unknown | rope |
| Pretraining tokens | — | — |
| Post-training | rlhf | sft, dpo |
| Training hardware | — | — |
| $/M input | $10.00 | — |
| $/M output | $30.00 | — |
| Output tok/sec | 27.8 | — |
Benchmarks
Missing scores render as not reported; never inferred. Bold highlights the leader per benchmark.
General reasoning
| MMLU-Pro | 69.4 2026-05-21 | — |
Code
| LiveCodeBench | 29.1 2026-05-21 | — |
Math
| MATH | 73.7 2026-05-21 | — |
| AIME 2024 | 15.0 2026-05-21 | — |
Context · A
Announced at OpenAI DevDay on November 6, 2023 as a 128K-context, cheaper successor to the original GPT-4 endpoint. The gpt-4-turbo-2024-04-09 revision shipped as the general-availability version with vision support and a knowledge cutoff through December 2023.
Context · B
Cohere released Command R+ as a 104B dense model with 128K context, optimized for retrieval-augmented generation, multi-step tool use, and grounded citation. Production weights remain proprietary, but Cohere Labs published the research weights on Hugging Face under CC BY-NC 4.0, making it source-available for non-commercial use.