Models · Compare
Command A vs Gemini 2.5 Pro
Rows highlighted in warm gray are where the models differ. Numbers carry their as-of date and primary source.
Specs
| Field | A: Command A | B: Gemini 2.5 Pro |
|---|---|---|
| Released | 2025-03-13 | 2025-03-25 |
| Developer | Cohere | Google DeepMind |
| Openness | Open weights | Proprietary |
| License | CC-BY-NC-4.0 | Proprietary |
| OSI-approved | no | no |
| Data released | no | no |
| Training code | no | no |
| Architecture | dense | unknown |
| Total params | 111B | — |
| Active params | — | — |
| Experts | — | — |
| Context window | 256K | 1.0M |
| Attention | unknown | unknown |
| Position enc. | unknown | unknown |
| Pretraining tokens | — | — |
| Post-training | sft, rlhf | rlhf |
| Training hardware | — | — |
| $/M input | $2.50 | $1.25 |
| $/M output | $10.00 | $10.00 |
| Output tok/sec | 44.8 | 127.3 |
Benchmarks
Missing scores render as not reported; never inferred. Bold highlights the leader per benchmark.
General reasoning
| MMLU-Pro | 71.2 2026-05-21 | 86.2 2026-05-21 |
| GPQA-Diamond | 52.7 2026-05-21 | — |
Code
| LiveCodeBench | 28.7 2026-05-21 | 80.1 2026-05-21 |
Math
| MATH | 81.9 2026-05-21 | 96.7 2026-05-21 |
| AIME 2024 | 9.7 2026-05-21 | — |
| AIME 2025 | 13.0 2026-05-21 | 87.7 2026-05-21 |
Held-out / arena
Context · A
Cohere's most-performant model at release, replacing Command R+ as the enterprise flagship. 111B parameters, 256K context, 23-language support, and 150 percent throughput vs Command R+ on two A100s or H100s. Released March 13 2025.
Context · B
The first Gemini release to clearly lead on LMArena Elo and on hard reasoning benchmarks. Native 1M-token context with reported 2M expansion in pipeline.